Skip to content

Expand Memory.swift evals with metadata-aware smoke coverage and grouped gates#15

Open
zac wants to merge 4 commits into
mainfrom
agent-memory-improvements
Open

Expand Memory.swift evals with metadata-aware smoke coverage and grouped gates#15
zac wants to merge 4 commits into
mainfrom
agent-memory-improvements

Conversation

@zac

@zac zac commented Jun 11, 2026

Copy link
Copy Markdown
Member

Summary

  • Added shared eval row metadata and stricter dataset validation across storage, recall, query expansion, and agent-memory suites.
  • Introduced a small committed synthetic smoke dataset plus a focused baseline for category-aware gating.
  • Updated reports, reduced metrics, and Markdown output to surface grouped metrics by category, source family, and difficulty.
  • Refined memory extraction, debug, storage migration, and CoreML default configuration behavior to support the new eval coverage.
  • Added/updated reusable skills for designing memory evals and synthetic datasets.

Testing

  • Added and updated unit coverage for the new metadata, migration, and eval-reporting paths.
  • Validated the updated eval dataset structure and baseline shape.
  • Not run (not requested)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant