Add provider_metadata to Vercel UI adapter, event stream and certain models handling #3754

Light2Dark · 2025-12-17T18:26:14Z

Fixes #3748 . Pulls provider details from reasoning end blocks, and passes them as provider_metadata.

Should fix anthropic issues: https://platform.claude.com/docs/en/build-with-claude/extended-thinking#extended-thinking-with-tool-use

Affected chunks (event-stream)

TextStartChunk
TextDeltaChunk
TextEndChunk
ReasoningStartChunk
ReasoningDeltaChunk
ReasoningEndChunk
ToolInputAvailableChunk

Affected parts (adapter)

assistant's TextUIPart
assistant's BuiltInToolCallPart
assistant's ToolCallPart
assistant's FileUIPart

These messages have been modified to add provider_name

TextPart
BaseToolCallPart (only ToolCallPart affected)
TextDeltaPart
ToolCallPartDelta

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_event_stream.py

tests/test_vercel_ai.py

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_adapter.py

DouweM · 2025-12-18T22:47:01Z

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_adapter.py

                                        tool_call_id=tool_call_id,
                                        content=output,
                                        provider_name=provider_name,
+                                        provider_details=provider_details,


Hmm, in Pydantic AI we can have separate provider metadata on the call and return parts, but Vercel represents them as a single part. So we need to make sure they don't accidentally get merged, somehow, so that all the fields make it back to the correct part.

Not sure I'm getting it right but I did move getting this details to the top, so both call and return parts should be the same.

I might be confused, I can also remove this..

The problem is that in Pydantic AI, (Builtin)ToolCallPart and (Builtin)ToolReturnPart both have their own provider_details, but Vercel AI combines both into one part (ToolOutputAvailablePart etc).

We need to make sure that when we translate the 2 Pydantic AI parts into 1 Vercel AI part and back into 2 Pydantic AI parts, they end up with the original provider_details, not all details on one of the 2 parts, or both parts duplcating the same details. So I think in this case we should store 2 dicts on the Vercel AI part, so that we can separately store and extract the details for the call and those for the result

DouweM · 2025-12-18T22:49:34Z

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_event_stream.py

        return output.get('return_value', output)
+
+
+def form_provider_metadata(**kwargs: ProviderDetailsDelta | str) -> ProviderMetadata | None:


Can we make this a method on VercelAIAdapter like _get_pydantic_ai_meta? To match the load/dump_messages methods, I'd call them _load_part_metadata and _dump_part_metadata

I created those 2 methods under VercelAIAdapter, _get_pydantic_ai_meta -> _load_part_metadata and _dump_part_metadata.

However, does it mean we should remove this form_provider_metadata function? I think it'd be good to reuse the same function to standardize. If so, I need to make _dump_part_metadata function public and use it in _event_stream.py

Yeah I'd rather use the same _dump_part_metadata here. I'm OK with the event stream using private methods from the adapter; I'd rather not have this be user-facing public

Thanks for the thorough review, I'll try to get it all right.

Yeah I'd rather use the same _dump_part_metadata here. I'm OK with the event stream using private methods from the adapter; I'd rather not have this be user-facing public

Just realized this would cause a circular import (adapter imports VercelAIEventStream). I could move it to _utils.py.

Could also keep this function in _event_stream.py. Idt it's user facing since not exported in __all__ declaration.

I've decided to move both functions to _utils.py, and move around some code to _models.py to prevent circular import errors. Let me know if this is unwanted.

Light2Dark · 2025-12-19T08:11:48Z

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_adapter.py

+                    builtin_return_meta = VercelAIAdapter._dump_part_metadata(
+                        id=part.id,
+                        provider_name=builtin_return.provider_name,
+                        provider_details=builtin_return.provider_details,
+                    )


I made this change because I think it's more accurate, but it does differ from the old behaviour. Can refer to 9748bbb for the change on the test too.

See above; we should store the data of both the call and the return, we can't lose any piece of data (like the 'tool_type': 'web_search_preview')

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_adapter.py

DouweM · 2025-12-19T17:44:55Z

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_adapter.py

                                        tool_call_id=tool_call_id,
                                        content=output,
                                        provider_name=provider_name,
+                                        provider_details=provider_details,


The problem is that in Pydantic AI, (Builtin)ToolCallPart and (Builtin)ToolReturnPart both have their own provider_details, but Vercel AI combines both into one part (ToolOutputAvailablePart etc).

We need to make sure that when we translate the 2 Pydantic AI parts into 1 Vercel AI part and back into 2 Pydantic AI parts, they end up with the original provider_details, not all details on one of the 2 parts, or both parts duplcating the same details. So I think in this case we should store 2 dicts on the Vercel AI part, so that we can separately store and extract the details for the call and those for the result

DouweM · 2025-12-19T17:47:54Z

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_adapter.py

+                                    tool_call_id=tool_call_id,
+                                    args=args,
+                                    id=part_id,
+                                    provider_details=provider_details,


Hmm ToolCallPart doesn't have provider_name, but in some places we only use ToolCallPart.provider_details if ModelResponse.provider_name has the expected value:

pydantic-ai/pydantic_ai_slim/pydantic_ai/models/google.py

Lines 914 to 918 in b5dee9b

if (

item.provider_details

and (thought_signature := item.provider_details.get('thought_signature'))

and m.provider_name == provider_name

):

So to make this work properly with Google though signatures, we need to make sure the provider_name is also stored on the ModelResponse we build here from the Vercel AI data. Can you see how we would get that from Pydantic AI to Vercel AI format and back on the ModelResponse?

I don't think we are passing this data in _message_builder to convert to pydantic messages.

@dataclass class MessagesBuilder: """Helper class to build Pydantic AI messages from request/response parts.""" messages: list[ModelMessage] = field(default_factory=list) def add(self, part: ModelRequestPart | ModelResponsePart) -> None: """Add a new part, creating a new request or response message if necessary.""" last_message = self.messages[-1] if self.messages else None if isinstance(part, get_union_args(ModelRequestPart)): part = cast(ModelRequestPart, part) if isinstance(last_message, ModelRequest): last_message.parts = [*last_message.parts, part] else: self.messages.append(ModelRequest(parts=[part])) ## <-- We don't pass additional data here else: part = cast(ModelResponsePart, part) if isinstance(last_message, ModelResponse): last_message.parts = [*last_message.parts, part] else: self.messages.append(ModelResponse(parts=[part])) ## <-- And here

Could infer details from the part

def add(self, part: ModelRequestPart | ModelResponsePart) -> None: """Add a new part, creating a new request or response message if necessary.""" ... else: part = cast(ModelResponsePart, part) if isinstance(last_message, ModelResponse): last_message.parts = [*last_message.parts, part] else: self.messages.append(ModelResponse(parts=[part], provider_name=part.provider_name)) ## <-- here

and build the metadata as parts come in.

Another option is to modify the add function to accept more arguments and pass it to ModelResponse 🤔. Anyway, I'd prefer this to be another PR

I think it's worth fixing this in this PR, because the goal here is to make encrypted thinking work reliably when data goes from Pydantic AI to Vercel AI and back to Pydantic AI, which means making all fields that matter to thinking survive that round-trip, which includes this ones :(

One way to do that would be to see if we can add extra fields to add, making sure it works even if there's no part on ModelResponse.parts that actually has the provider_name, so it will need to be read off ModelResponse.provider_name itself.

But it may be better to add the provider_name field everywhere we have currently have provider_details (ToolCallPart, TextPart etc), and then making sure it's always set when where set provider_details. That may touch more code but is a lot more clean, so please give that a try.

@DouweM , I assume you mean modifying the part like

class ToolCallPart(BaseToolCallPart): """A tool call from a model.""" _: KW_ONLY provider_name: str | None = None. <---- NEW """The name of the provider that generated the response.""" ...

I can make the changes for _event_stream.py, _adapter.py and for Vercel AI related changes. The surface area is not large. But doing the wiring for all the providers is imo going to be very large.

Example for the issue you mentioned

ToolCallPart doesn't have provider_name, but in some places we only use ToolCallPart.provider_details

I'm guessing we would modify google.py like this or something similar:

if ( item.provider_details and (thought_signature := item.provider_details.get('thought_signature')) and (m.provider_name == provider_name or item.provider_name == provider_name) ## <-- check item provider ):

@Light2Dark Yep, adding provider_name to ToolCallPart and TextPart and any others that currently have provider_details.

But doing the wiring for all the providers is imo going to be very large.

I believe it's just one instance in OpenAIResponsesModel:

pydantic-ai/pydantic_ai_slim/pydantic_ai/models/openai.py

Line 1290 in fbe0dfe

items.append(TextPart(content.text, id=item.id, provider_details=part_provider_details))

And a bunch in GoogleModel: (some of which may require changes to methods like handle_tool_call_delta, but that should be straightforward):

pydantic-ai/pydantic_ai_slim/pydantic_ai/models/google.py

Line 858 in fbe0dfe

part=FilePart(content=BinaryContent.narrow_type(content), provider_details=provider_details),

pydantic-ai/pydantic_ai_slim/pydantic_ai/models/google.py

Line 842 in fbe0dfe

provider_details=provider_details,

pydantic-ai/pydantic_ai_slim/pydantic_ai/models/google.py

Lines 827 to 834 in fbe0dfe

for event in self._parts_manager.handle_thinking_delta(

vendor_part_id=None, content=part.text, provider_details=provider_details

):

yield event

else:

for event in self._parts_manager.handle_text_delta(

vendor_part_id=None, content=part.text, provider_details=provider_details

):

pydantic-ai/pydantic_ai_slim/pydantic_ai/models/google.py

Line 1101 in fbe0dfe

item.provider_details = {**(item.provider_details or {}), **provider_details}

Really appreciate you working on this, we're almost there!

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_adapter.py

DouweM · 2025-12-19T17:49:54Z

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_adapter.py

+                    builtin_return_meta = VercelAIAdapter._dump_part_metadata(
+                        id=part.id,
+                        provider_name=builtin_return.provider_name,
+                        provider_details=builtin_return.provider_details,
+                    )


See above; we should store the data of both the call and the return, we can't lose any piece of data (like the 'tool_type': 'web_search_preview')

DouweM · 2025-12-19T17:50:40Z

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_event_stream.py

        return output.get('return_value', output)
+
+
+def form_provider_metadata(**kwargs: ProviderDetailsDelta | str) -> ProviderMetadata | None:


Yeah I'd rather use the same _dump_part_metadata here. I'm OK with the event stream using private methods from the adapter; I'd rather not have this be user-facing public

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_event_stream.py

* add provider_name and wiring for models * only set provider_name if provider_details exists * don't set obj parts

Light2Dark marked this pull request as ready for review December 17, 2025 18:35

Light2Dark mentioned this pull request Dec 17, 2025

Migrate to pydantic-ai for google & anthropic providers marimo-team/marimo#7425

Draft

4 tasks

DouweM requested changes Dec 17, 2025

View reviewed changes

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_event_stream.py Show resolved Hide resolved

pydantic_ai_slim/pydantic_ai/ui/vercel_ai/_event_stream.py Outdated Show resolved Hide resolved

tests/test_vercel_ai.py Outdated Show resolved Hide resolved

DouweM self-assigned this Dec 17, 2025

DouweM added the awaiting author revision label Dec 17, 2025

Light2Dark changed the title ~~Adds provider_metadata to ReasoningEndChunk~~ Adds provider_metadata to Vercel adapter's load_messages, dump_messages and event_stream Dec 18, 2025

Light2Dark requested a review from DouweM December 18, 2025 13:38

DouweM requested changes Dec 18, 2025

View reviewed changes

Light2Dark commented Dec 19, 2025

View reviewed changes

Light2Dark requested a review from DouweM December 19, 2025 16:11

DouweM requested changes Dec 19, 2025

View reviewed changes

Light2Dark requested a review from DouweM December 22, 2025 13:30

Light2Dark added 12 commits December 23, 2025 16:15

check for signature and add to provider metadata

e71f2a9

remove constant

78eeaee

handle more parts, add tests

8abca32

missed file part, and ids

951014b

refactor sync timestamp function

be1ff29

move function inside adapter

dccf701

refacotr out tool part load metadata

c95f18b

fix to use builtin_return part meta

5403d4a

add test

633f5e5

correctly construct and extract provider_meta, move to utils func

149f9c7

add test, rename vars, comment

bee8c04

nit renames

81bf84c

Light2Dark force-pushed the sham/fix-reasoning-end-signature-vercel branch from ae631b4 to 81bf84c Compare December 23, 2025 08:16

Light2Dark and others added 4 commits December 24, 2025 01:01

Merge branch 'main' into sham/fix-reasoning-end-signature-vercel

b041c06

Merge branch 'main' into sham/fix-reasoning-end-signature-vercel

0b9d3a4

add provider_name and wiring for models (#1)

93dcd81

* add provider_name and wiring for models * only set provider_name if provider_details exists * don't set obj parts

fix tool output checks

79d13ea

Light2Dark changed the title ~~Adds provider_metadata to Vercel adapter's load_messages, dump_messages and event_stream~~ Add provider_metadata to Vercel UI adapter, event stream and certain models handling Dec 23, 2025

		return output.get('return_value', output)


		def form_provider_metadata(**kwargs: ProviderDetailsDelta \| str) -> ProviderMetadata \| None:

	if (
	item.provider_details
	and (thought_signature := item.provider_details.get('thought_signature'))
	and m.provider_name == provider_name
	):

	for event in self._parts_manager.handle_thinking_delta(
	vendor_part_id=None, content=part.text, provider_details=provider_details
	):
	yield event
	else:
	for event in self._parts_manager.handle_text_delta(
	vendor_part_id=None, content=part.text, provider_details=provider_details
	):

Add provider_metadata to Vercel UI adapter, event stream and certain models handling #3754

Are you sure you want to change the base?

Add provider_metadata to Vercel UI adapter, event stream and certain models handling #3754

Conversation

Light2Dark commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Light2Dark Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Light2Dark Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Light2Dark Dec 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Light2Dark Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Light2Dark Dec 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Light2Dark Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Light2Dark commented Dec 17, 2025 •

edited

Loading

Light2Dark Dec 19, 2025 •

edited

Loading

Light2Dark Dec 19, 2025 •

edited

Loading

Light2Dark Dec 21, 2025 •

edited

Loading

Light2Dark Dec 19, 2025 •

edited

Loading

Light2Dark Dec 21, 2025 •

edited

Loading

Light2Dark Dec 23, 2025 •

edited

Loading