Commit 0fa8cec
docs: update README with measured 128K context data (M1 Pro 16GB)
Replace theoretical context table with REAL measured RSS:
- Llama 3.2 3B + 128K context = 9.5 GB (6.4x compression)
- FP32 at 128K would need ~30 GB → OOM on 16GB Mac
- Generation speed: 6.6 tok/s at 16K (same as FP32)
- Hero stat: "128K context on 16GB Mac"
All numbers measured with /usr/bin/time -l on Apple M1 Pro.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent 5c79c8d commit 0fa8cec
1 file changed
Lines changed: 12 additions & 12 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
| 17 | + | |
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
| |||
82 | 82 | | |
83 | 83 | | |
84 | 84 | | |
85 | | - | |
| 85 | + | |
86 | 86 | | |
87 | | - | |
| 87 | + | |
88 | 88 | | |
89 | | - | |
90 | | - | |
91 | | - | |
92 | | - | |
93 | | - | |
94 | | - | |
95 | | - | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
96 | 97 | | |
97 | 98 | | |
98 | | - | |
99 | | - | |
| 99 | + | |
100 | 100 | | |
101 | 101 | | |
102 | 102 | | |
| |||
0 commit comments