Skip to content

Commit 06f6b1b

Browse files
committed
refactor(compute): split gpu_engine.go into focused files
Extract methods from gpu_engine.go (3521 -> 2245 lines) into three focused files to improve navigability: - gpu_engine_elementwise.go (400 lines): Add/Sub/Mul/Div, scalar ops, Exp/Log/Sin/Cos/Tanh/Pow, Sqrt/Rsqrt, fused RoPE/SwiGLU/RMSNorm, CosineSimilarity, HadamardTransform - gpu_engine_reduction.go (221 lines): Sum, Softmax, ReduceSum/Max/Mean, GPUArgmax, GPUScaledSoftmax, GPUFusedSoftmaxVMul - gpu_engine_memory.go (695 lines): Transpose, Zero/Zeros/Copy, Gather, ScatterAdd, Fill, Split/Concat/Repeat/RepeatInterleave, Reshape, OneHot, ConvertFP16ToF32 Zero behavioral changes. All method signatures identical. Build, vet, and race-detector tests pass.
1 parent 6e4c241 commit 06f6b1b

4 files changed

Lines changed: 1316 additions & 1276 deletions

File tree

0 commit comments

Comments
 (0)