[2/N] add cuda imatrix support for custom RL model by yiakwy-xpu-ml-framework-team · Pull Request #377 · antirez/ds4

yiakwy-xpu-ml-framework-team · 2026-06-10T03:46:41Z

Introduction

Previously we imatrix was assumed to be tuned on Metal machine (yes we have one M3/2 Ultra) but it is handy to tune the file directly on Hopper platform to avoid network traffices for a 250 GB model.

This is a follow up of #368

Usage:

DS4_CUDA_IMATRIX_GPU_COLLECT=1 ./ds4 --backend cuda
-m gguf/DeepSeek-V4-Flash-Q4KExperts-F16HC-F16Compressor-F16Indexer-Q8Attn-Q8Shared-Q8Out-chat-v2-imatrix.gguf
--backend cuda
--imatrix-dataset gguf-tools/imatrix/dataset/rendered_prompts.txt
--imatrix-out gguf/DeepSeek-V4-Flash-chat-v2-routed-moe-ds4.dat \

Note DS4_CUDA_IMATRIX_GPU_COLLECT=0 , means we will use legacy imatrix logics without gpu acceleration to compute imatrix scores.

yiakwy-xpu-ml-framework-team · 2026-06-10T03:48:11Z

@antirez I guess this is a useful feature , wish your feedback!

add cuda imatrix support

a9299c9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[2/N] add cuda imatrix support for custom RL model#377

[2/N] add cuda imatrix support for custom RL model#377
yiakwy-xpu-ml-framework-team wants to merge 1 commit into
antirez:mainfrom
yiakwy-xpu-ml-framework-team:add_cuda_imatrix_support

yiakwy-xpu-ml-framework-team commented Jun 10, 2026 •

edited

Loading

Uh oh!

yiakwy-xpu-ml-framework-team commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yiakwy-xpu-ml-framework-team commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Introduction

Usage:

Uh oh!

yiakwy-xpu-ml-framework-team commented Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

yiakwy-xpu-ml-framework-team commented Jun 10, 2026 •

edited

Loading