feat: Add prompt caching to OpenAI-compatible custom models #1587

dleen · 2025-03-12T05:58:56Z

Context

Builds on the PR: #1562 to add the OpenAI compatible provider support for cache control. The previous PR updates the UI to add an option to specify that a model supports prompt caching.

Implementation

The OpenRouter provider has an implementation of adding the cache control key to OpenAI messages. Acknowledging the risk of duplication we pretty much copy the implementation wholesale. There probably is an opportunity to combine the implementations for OpenAI compatible and OpenRouter in the future.

Screenshots

How to Test

Selected the prompt caching option in the UI. Entered gateway server for base URL.
Started a chat request.
On the gateway server observed the cache control key in the messages: 'type': 'text', 'cache_control': {'type': 'ephemeral'}
On the gateway server observe the usage response:

'usage': {'cacheReadInputTokenCount': 16974, 'cacheReadInputTokens': 16974, 'cacheWriteInputTokenCount': 4438, 'cacheWriteInputTokens': 4438, 'inputTokens': 4, 'outputTokens': 222, 'totalTokens': 21638}

Get in Touch

Roo Code Discord handle: dleen

Important

Adds prompt caching support for OpenAI-compatible custom models, including UI options and message handling in openai.ts.

Behavior:
- Adds prompt caching support to OpenAI-compatible custom models in openai.ts.
- Implements cache control in createMessage() for models with supportsPromptCache.
- Copies logic from OpenRouter to add cache_control to user messages.
UI Changes:
- Adds "Prompt Caching" checkbox in ApiOptions.tsx for OpenAI-compatible models.
- Allows configuration of cache read/write prices when prompt caching is enabled.
Misc:
- Updates .changeset/thin-fans-deliver.md to document the feature addition.

^{This description was created by}^{for 10c1e7d. It will automatically update as commits are pushed.}

changeset-bot · 2025-03-12T05:58:59Z

🦋 Changeset detected

Latest commit: 10c1e7d

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
roo-cline	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

ellipsis-dev · 2025-03-12T06:01:21Z

src/api/providers/openai.ts

+								type: "text",
+								text: systemPrompt,
+								// @ts-ignore-next-line
+								cache_control: { type: "ephemeral" },


Avoid using // @ts-ignore-next-line to bypass type errors for the cache_control property. Consider extending the type definitions instead, and also evaluate extracting this caching logic into a shared helper to reduce duplication with the OpenRouter implementation.

mrubens

Thank you!

mrubens and others added 2 commits March 11, 2025 22:43

Add prompt caching to OpenAI-compatible custom model info

421e197

Add cache control key to messages in OpenAI compatible provider

10c1e7d

dleen requested review from mrubens and cte as code owners March 12, 2025 05:58

github-project-automation bot added this to Roo Code Roadmap Mar 12, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Mar 12, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. enhancement New feature or request labels Mar 12, 2025

ellipsis-dev bot reviewed Mar 12, 2025

View reviewed changes

mrubens approved these changes Mar 12, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Mar 12, 2025

mrubens merged commit 9b5ee27 into RooCodeInc:main Mar 12, 2025
18 checks passed

github-project-automation bot moved this from New to Done in Roo Code Roadmap Mar 12, 2025

mrubens mentioned this pull request Mar 12, 2025

Add prompt caching to OpenAI-compatible custom model info #1562

Closed

dleen deleted the cache branch March 12, 2025 17:53

mrubens mentioned this pull request Apr 8, 2025

Fix cache usage tracking for openai-compatible #2401

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add prompt caching to OpenAI-compatible custom models #1587

feat: Add prompt caching to OpenAI-compatible custom models #1587

Uh oh!

dleen commented Mar 12, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

changeset-bot bot commented Mar 12, 2025

Uh oh!

ellipsis-dev bot Mar 12, 2025

Uh oh!

mrubens left a comment

Uh oh!

Uh oh!

Uh oh!

feat: Add prompt caching to OpenAI-compatible custom models #1587

feat: Add prompt caching to OpenAI-compatible custom models #1587

Uh oh!

Conversation

dleen commented Mar 12, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Implementation

Screenshots

How to Test

Get in Touch

Uh oh!

changeset-bot bot commented Mar 12, 2025

🦋 Changeset detected

Uh oh!

ellipsis-dev bot Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

mrubens left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dleen commented Mar 12, 2025 •

edited by ellipsis-dev bot

Loading