Skip to content

fix: drop unused tokenizer vocab assets#18

Merged
gianni-cor merged 1 commit into
2026-07-03from
qvac/slim-tokenizer-assets
Jul 4, 2026
Merged

fix: drop unused tokenizer vocab assets#18
gianni-cor merged 1 commit into
2026-07-03from
qvac/slim-tokenizer-assets

Conversation

@gianni-cor

Copy link
Copy Markdown

Summary

  • Stop embedding Gemma2 and GPT-OSS vocab/merge headers in the 2026-07-03 sd.cpp source branch.
  • Keep the existing loader function symbols, but return runtime errors if unsupported Gemma2/GPT-OSS paths are used.

Why

QVAC diffusion-cpp does not expose the upstream Lens/PiD paths that require these tokenizers, but their generated vocab arrays were linked into every static prebuild.

Test plan

  • Ran .
  • Not run: full native build.

Made with Cursor

@gianni-cor gianni-cor merged commit 5832f9a into 2026-07-03 Jul 4, 2026
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants