Caching AI SDK Models #317

mattpocock · 2025-11-10T17:58:29Z

Fixes #309

changeset-bot · 2025-11-10T17:58:33Z

🦋 Changeset detected

Latest commit: b8a1dfc

The changes in this PR will be included in the next version bump.

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

vercel · 2025-11-10T17:58:37Z

The latest updates on your projects. Learn more about Vercel for GitHub.

2 Skipped Deployments

Project	Deployment	Preview	Comments	Updated (UTC)
evalite	Ignored			Nov 14, 2025 11:26am
evalite-beta-docs	Ignored			Nov 14, 2025 11:26am

pkg-pr-new · 2025-11-10T17:59:25Z

Open in StackBlitz

npm i https://pkg.pr.new/mattpocock/evalite@317

commit: 0e0fcc0

mattpocock · 2025-11-11T09:21:01Z

There are a couple of things I'm not happy with about this PR.

Overall, the implementation is solid, and the UI is nice. But:

Scorers should show cache hits individually, since this gives a better breakdown to users
We should consolidate tracing and caching into a single wrapAISDKModel call

It doesn't really make sense to me to have caching without tracing, or tracing without caching. If you want a great AI SDK integration, you really just want to call one function, wrap your model, and you're good to go. We can have options to disable tracing or disable caching, but having a single function seems indispensable.

My thinking is that inside scorers, we would want to prevent tracing but allow caching, since scorer traces don't make sense in the trace view.

mattpocock changed the base branch from main to v1 November 10, 2025 17:58

mattpocock changed the title ~~matt/caching ai sdk models~~ Caching AI SDK Models Nov 10, 2025

mattpocock marked this pull request as draft November 10, 2025 17:59

mattpocock marked this pull request as ready for review November 10, 2025 21:29

mattpocock marked this pull request as draft November 10, 2025 21:29

mattpocock force-pushed the matt/caching-ai-sdk-models branch from c1a2208 to 86b625c Compare November 11, 2025 09:18

mattpocock force-pushed the matt/caching-ai-sdk-models branch from 9bf49cd to 0061595 Compare November 14, 2025 10:34

mattpocock added 10 commits November 14, 2025 10:41

Added the server implementation and added the types to storage

095a0dc

Server will now attempt to find another port if 3006 is unavailable.

a50b983

Added AI SDK cache functions

84c928c

Added caching to all built-in scorers

99b70db

Updates

a1a45de

Showed cache hits in the UI

88dd5b0

Phase 1 of integration

93aea3c

Got it half working

19d0a6f

Updates

49efdba

Fixed bugs and added tests

61ece64

mattpocock force-pushed the matt/caching-ai-sdk-models branch from 0061595 to 61ece64 Compare November 14, 2025 10:41

mattpocock added 6 commits November 14, 2025 10:49

Updates to fix CI

753691f

Updates

cbfb562

Tweak to changeset

44c1f0b

Bugfix for table header

e5db053

Updated docs

3df7791

Updates

730afb1

mattpocock marked this pull request as ready for review November 14, 2025 11:14

Updates to test reliability

b8a1dfc

mattpocock merged commit 4fd065e into v1 Nov 14, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Caching AI SDK Models #317

Caching AI SDK Models #317

mattpocock commented Nov 10, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Nov 10, 2025 •

edited

Loading

Uh oh!

vercel bot commented Nov 10, 2025 •

edited

Loading

Uh oh!

pkg-pr-new bot commented Nov 10, 2025

Uh oh!

mattpocock commented Nov 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Caching AI SDK Models #317

Caching AI SDK Models #317

Conversation

mattpocock commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

changeset-bot bot commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

vercel bot commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pkg-pr-new bot commented Nov 10, 2025

Uh oh!

mattpocock commented Nov 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mattpocock commented Nov 10, 2025 •

edited

Loading

changeset-bot bot commented Nov 10, 2025 •

edited

Loading

vercel bot commented Nov 10, 2025 •

edited

Loading