You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- "Zero dependencies" → "No external libraries" (pthreads is a system dep)
- "5 architectures" → accurate description (3 code paths: Llama/Qwen3.5
share model_type=0, Gemma 3/4 share model_type=1, Qwen2-MoE)
- "4x longer context" → "~4x" with footnote that numbers are estimates
based on KV memory reduction, not actual measurements
- Dependencies table: "Zero (libc only)" → "libc + pthreads only"
- Apply same fixes to README.ko.md
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
quant.cpp is not a fork. It's a standalone engine built from scratch for one goal: **LLM inference you can understand, customize, and ship inside your own product.**
0 commit comments