README: clarify Gemma 4 status as experimental (non-standard GGUF)

unamedkr · claude · unamedkr · commit 71ee7a4f754f · 2026-04-04T03:53:53.000+09:00
The gemma4 GGUF format is not yet supported by llama.cpp itself
(error: "unknown model architecture: gemma4"). Our implementation
produces semantically relevant tokens but has repetition issues.
This is likely a GGUF conversion limitation, not an inference bug.

Co-Authored-By: Claude Opus 4.6 (1M context) &lt;noreply@anthropic.com&gt;
diff --git a/README.md b/README.md
@@ -111,7 +111,7 @@ Cross-model (4b K + Q4 V): SmolLM2 1.7B (-1.6%), Qwen3.5 0.8B (+0.9%), Qwen3.5 4
 | Qwen3.5-4B | Qwen3.5 (DeltaNet) | 4B | PPL verified |
 | Qwen3.5-35B-A3B | Qwen2-MoE | 35B (3B active) | Working |
 | Gemma 3 270M | Gemma 3 | 270M | Working |
-| Gemma 4 E2B | Gemma 4 | 2B | WIP |
+| Gemma 4 E2B | Gemma 4 | 2B | Experimental (non-standard GGUF) |
 
 Architectures: Llama/Qwen3.5 (shared path), Gemma 3/4 (sliding + full attention), Qwen2-MoE.