Commit 0b7a524
correction #10: 2-bit Pareto claim withdrawn + k128 FP32 parity validated
3970-token eval (BPE O(n log n)):
turbo_kv_4b + k128: PPL 19.39 (-0.1% vs FP32) at 3.2% FP32 ✅
uniform_2b + k512: PPL 26.53 (+36.7% vs FP32) ❌ — quality collapse
The "2-bit+k512 Pareto-dominates flat 4-bit" claim was an artifact of
957-token eval where k512 = 53% FP32. At honest long context, 2-bit
is vastly worse. Claim withdrawn.
Real S1 finding: 128 FP32 tokens achieve context-length-invariant
quality recovery at 4-bit compression.
Honest correction track: 10 of 10 self-found.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 4cdf81d commit 0b7a524
1 file changed
Lines changed: 18 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
31 | 31 | | |
32 | 32 | | |
33 | 33 | | |
34 | | - | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
35 | 52 | | |
36 | 53 | | |
37 | 54 | | |
| |||
0 commit comments