Commit c1ddf13
phase A-2: answer-question alignment + neighbor research
Added to verifier:
- Temporal alignment: year/date questions must have year in answer
- Event alignment: battle questions must name a battle
- Definition alignment: "what does X mean" must have definition
- These catch "related but wrong" answers (RLV's core differentiator)
Added to researcher:
- Neighbor-first retry: try adjacent chunks (±1, ±2) before
searching a completely different article region
- Human pattern: "not on this page → check next page"
Result: 15/20 unchanged — 5 failures have answers that pass all
alignment checks (contain proper nouns, years, etc.) but are from
the wrong section of the same article. Verifier correctly identifies
these as "related" but cannot distinguish "related" from "correct"
without LLM-based semantic verification.
Remaining gap to 100%: requires LLM answer-question coherence check,
which adds ~15s per question. Speed vs accuracy tradeoff.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 93ad892 commit c1ddf13
2 files changed
Lines changed: 69 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
13 | 13 | | |
14 | 14 | | |
15 | 15 | | |
16 | | - | |
| 16 | + | |
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
| |||
52 | 52 | | |
53 | 53 | | |
54 | 54 | | |
55 | | - | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
56 | 75 | | |
57 | 76 | | |
58 | 77 | | |
59 | 78 | | |
60 | | - | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
61 | 96 | | |
62 | 97 | | |
63 | 98 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
165 | 165 | | |
166 | 166 | | |
167 | 167 | | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
168 | 199 | | |
169 | 200 | | |
170 | 201 | | |
| |||
0 commit comments