Skip to content

feat(agent-profile): snap vendor-locked harnesses to native model#303

Merged
drewstone merged 1 commit into
mainfrom
feat/expand-profile-axes-snap
Jul 2, 2026
Merged

feat(agent-profile): snap vendor-locked harnesses to native model#303
drewstone merged 1 commit into
mainfrom
feat/expand-profile-axes-snap

Conversation

@drewstone

Copy link
Copy Markdown
Contributor

What

expandProfileAxes previously dropped harness×model cells for vendor-locked harnesses (e.g. kimi-code) that cannot run an arbitrary model. This adds a harness→model snap: such harnesses resolve to their native model via the exported HARNESS_NATIVE_MODEL='default' sentinel instead of being silently dropped.

Why

Cross-harness leaderboards (opencode vs kimi-code) need every harness to produce a runnable cell. Dropping the vendor-locked ones left the matrix incomplete. With the snap, kimi-code runs on its native Kimi model while portable harnesses run the requested model.

Change shape

Additive: new exported sentinel + new snap branch in expandProfileAxes. No existing signature changes. Version trio bumped to 0.102.0.

Verification

  • pnpm exec vitest run = 5320 passed / 4 skipped
  • pnpm build rc=0, biome check clean, tsc --noEmit rc=0
  • src/agent-profile.test.ts spot-check: 28 tests pass

…el instead of dropping

expandProfileAxes dropped (harness, model) pairs a vendor-locked harness
couldn't run, so sweeping a single deepseek model silently excluded kimi-code
/codex/claude-code — no head-to-head. Now a universal harness (opencode/pi/
claudish) runs every swept model; a vendor-locked harness runs the swept models
in its family, or snaps to its native default (HARNESS_NATIVE_MODEL sentinel,
resolved by the harness at runtime) when it supports none — so every requested
harness appears in the sweep on a model it can actually run. keepIncompatible
still forces every pair verbatim.

@tangletools tangletools left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

✅ Auto-approved drewstone PR — e8a7550d

This PR was opened by the trusted drewstone account.
The full PR reviewer audit still runs separately and will publish findings if it detects issues.

tangletools · auto-approval · reason: drewstone_author · 2026-07-02T02:28:13Z

@drewstone drewstone merged commit 1a588ab into main Jul 2, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants