Commit ad2faee
committed
fix: pre-compile vllm-metal paged_ops extension in tarball build
vllm-metal v0.2.0 JIT-compiles a paged_ops C++ extension using clang++
at runtime. This fails inside the macOS sandbox which blocks compiler
invocations. Instead, compile the extension during the tarball build
(where Xcode CLT is available) and ship the .so in a prebuilt/ dir.
At install time, model-runner copies the pre-built .so into the user's
~/.cache/vllm-metal/ cache directory. vllm-metal's build.py sees the
cached .so is newer than the sources and skips JIT compilation.
This also reverts the include/ directory preservation since the Python
headers are only needed for compilation, which now happens at build time.1 parent 7e37085 commit ad2faee
2 files changed
Lines changed: 33 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
176 | 176 | | |
177 | 177 | | |
178 | 178 | | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
179 | 202 | | |
180 | 203 | | |
181 | 204 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
70 | 70 | | |
71 | 71 | | |
72 | 72 | | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
73 | 82 | | |
74 | 83 | | |
75 | | - | |
76 | | - | |
| 84 | + | |
77 | 85 | | |
78 | 86 | | |
79 | 87 | | |
| |||
0 commit comments