Migrating Messages to AI SDK v5 UIMessage Format #7988

khanhduyvt0101 · 2025-08-12T13:21:48Z

khanhduyvt0101
Aug 12, 2025

Problem

After upgrading to AI SDK v5, I need to migrate existing chat messages in PostgreSQL. My messages currently have both old format fields (content, annotations) and new parts array, but tool invocations still use the old state: "result" instead of v5's 4-state system. I also store messages in jsonb type.

Current Message Format (Mixed)

{
  "id": "msg-S8wpAp3QWIHXEuufOxeA2vEt",
  "role": "assistant",
  "content": "can you find papers...",  // Old field - should be removed
  "annotations": [{"type": "file", "hasFile": false}],  // Old field
  "parts": [
    {
      "type": "tool-invocation",
      "toolInvocation": {
        "state": "result",  // Old state - should be "output-available"
        "result": { "results": [...] },
        "toolName": "semantic_scholar_search"
      }
    },
    {
      "type": "text", 
      "text": "I found several papers..."
    }
  ]
}

Target AI SDK v5 Format

interface UIMessage {
  id: string;
  role: "user" | "assistant" | "system";
  parts: MessagePart[];
  metadata?: Record<string, any>;
}

// Tool states: "input-streaming" | "input-available" | "output-available" | "output-error"

Questions

Best migration approach for ~500k messages?
- Batch migration script vs on-the-fly conversion?
- How to handle the state mapping (result → output-available)?
Should I remove legacy fields (content, annotations) or keep for backward compatibility?
How to handle tool invocations that might fail during migration?

Looking for advice on:

PostgreSQL JSONB migration best practices
Handling large-scale migrations without downtime
Rollback strategies

Thanks! 🙏

Answered by khanhduyvt0101

Sep 10, 2025

I try my best to migrate all old message JSON to the new message JSON, and when the AI SDK throws an error message indicating it's invalid, I force the user to create a new chat. I think this is the best way currently.

View full answer

Theonlyhamstertoh · 2025-08-25T01:28:47Z

Theonlyhamstertoh
Aug 25, 2025

Same issue here, any update on this?

0 replies

bakeyevrus · 2025-09-10T10:17:58Z

bakeyevrus
Sep 10, 2025

I have the same problem, and honestly, a little bit surprised that there is no mention of this problem in the Vercel AI SDK migration docs.

@khanhduyvt0101, how did you proceed with the issue?

3 replies

khanhduyvt0101 Sep 10, 2025
Author

I try my best to migrate all old message JSON to the new message JSON, and when the AI SDK throws an error message indicating it's invalid, I force the user to create a new chat. I think this is the best way currently.

Answer selected by khanhduyvt0101

bakeyevrus Sep 10, 2025

Would it be too much to ask you to share a snippet of JS (or SQL) that converts an old JSON format into a new one? I think this could be useful for many people as well.

khanhduyvt0101 Sep 10, 2025
Author

Unfortunately, I deleted it after having a terrible migration 🥲

ian · 2025-09-16T00:35:27Z

ian
Sep 16, 2025

We just got through our 4->5 migration and it was a nightmare, took about 3-4 weeks. Happy to share what worked for us.

We renamed message.parts -> message.parts_old and added new JSON field message.parts. From there we wrote a script to look at message.content, message.attachments, and message.parts_old and write to the message.parts field.

This preserves original fields so we wrote an audit script to look at message.parts and verify that content, attachments (where we stored them) and parts_old mapped correctly to message.parts.

Can share some snippets if it's helpful.

1 reply

nicoalbanese Sep 16, 2025
Maintainer

Hey @ian! Sorry to hear it was tricky but happy it's resolved. Would love to check out those scripts!

Theonlyhamstertoh · 2025-09-23T01:37:27Z

Theonlyhamstertoh
Sep 23, 2025

I just went through the migration process from v4 -> v5 as well. What I did was create a 1-to-1 mapping of the v4 & v5 vercel/ai types to zod schema. From there, I transform each schema to v5 schema so that it is completely type-safe and validated. Once conversion is done, I then used the zod schema for the V5 UI Message to then validate all parts, attachments, reasoning, are correct. Take a look at the functions below to edit for your data structure, especially forV4ToV5UIMessageTransform where I skip some checks on deprecated fields. Hope this helps someone! Here is the code with the types + schema + demo from ai sdk included

import { z } from "zod";
import { inferMediaTypeFromUrl } from "./helper";

/**
 *
 * BASE SCHEMA
 *
 *
 */

/**
A JSON value can be a string, number, boolean, object, array, or null.
JSON values can be serialized and deserialized by the JSON.stringify and JSON.parse methods.
 */
type JSONValue =
    | null
    | string
    | number
    | boolean
    | {
          [value: string]: JSONValue;
      }
    | Array<JSONValue>;

// Base schemas
const JSONValueSchema: z.ZodType<JSONValue> = z.lazy(() =>
    z.union([z.string(), z.number(), z.boolean(), z.null(), z.array(JSONValueSchema), z.record(JSONValueSchema)])
);

const ProviderMetadataSchema = z.record(z.record(JSONValueSchema));

/**
 *
 * PARTS SCHEMA
 *
 *
 */

// SourceUIPart
const V4SourceUIPartSchema = z.object({
    type: z.literal("source"),
    source: z.object({
        sourceType: z.literal("url"),
        id: z.string(),
        url: z.string(),
        title: z.string().optional(),
        providerMetadata: ProviderMetadataSchema.optional(),
    }),
});

// skips source-document, since all v4 sourceUIPart were of source type url
const V4ToV5SourceUIPartTransform = V4SourceUIPartSchema.transform((v4) => ({
    type: "source-url",
    sourceId: v4.source.id,
    url: v4.source.url,
    title: v4.source.title,
    providerMetadata: v4.source.providerMetadata,
}));

// TextUIPart
const V4TextUIPartSchema = z.object({
    type: z.literal("text"),
    text: z.string(),
});

const V4ToV5TextUIPartTransform = V4TextUIPartSchema.transform((v4) => ({
    type: "text",
    text: v4.text,
    // Did not add state - v4 didn't have it
    // Did not add providerMetadata - v4 didn't have it
}));

// ReasoningUIPart
const V4ReasoningUIPartSchema = z.object({
    type: z.literal("reasoning"),
    reasoning: z.string(),
    details: z.array(
        z.union([
            z.object({
                type: z.literal("text"),
                text: z.string(),
                signature: z.string().optional(),
            }),
            z.object({
                type: z.literal("redacted"),
                data: z.string(),
            }),
        ])
    ),
});

const V4ToV5ReasoningUIPartTransform = V4ReasoningUIPartSchema.transform((v4) => ({
    type: "reasoning",
    text: v4.reasoning,
    // Did not add state - v4 didn't have it
    // Did not add providerMetadata - v4 didn't have it
}));

// The data message, not used...
const V4DataMessageSchema = z.object({
    id: z.string(),
    role: z.literal("data"),
    content: z.string(),
    data: JSONValueSchema.optional(),
});

// FileUIPart
const V4FileUIPartSchema = z.object({
    type: z.literal("file"),
    mimeType: z.string(),
    data: z.string(),
});
const V4ToV5FileUIPartTransform = V4FileUIPartSchema.transform((v4) => ({
    type: "file",
    mediaType: v4.mimeType,
    url: v4.data, // data url from v4
    // Did not add filename - v4 didn't have it
    // Did not add providerMetadata - v4 didn't have it
}));

// Step start
const V4StepStartUIPartSchema = z.object({
    type: z.literal("step-start"),
});

const V4ToV5StepStartUIPartTransform = V4StepStartUIPartSchema.transform((_v4) => ({
    type: "step-start",
}));

// Attachment -> File part
const V4AttachmentSchema = z.object({
    name: z.string().optional(),
    contentType: z.string().optional(),
    url: z.string(),
});

// there is some issues here. V5 requires mediaType but V4's version of it is contentType.
const AttachmentToV5FileTransform = V4AttachmentSchema.transform((v4) => {
    // application/octet-stream is a generic mediaType for unknown file
    const mediaType = v4.contentType?.trim() || inferMediaTypeFromUrl(v4.url) || "application/octet-stream";
    return {
        type: "file",
        mediaType,
        filename: v4.name,
        url: v4.url,
        // providerMetadata will be undefined since attachments don't have it
    };
});

// ToolInvocation schemas
const V4ToolInvocationUnionSchema = z.union([
    z.object({
        state: z.literal("partial-call"),
        step: z.number().optional(),
        toolCallId: z.string(),
        toolName: z.string(),
        args: JSONValueSchema,
    }),
    z.object({
        state: z.literal("call"),
        step: z.number().optional(),
        toolCallId: z.string(),
        toolName: z.string(),
        args: JSONValueSchema,
    }),
    z.object({
        state: z.literal("result"),
        step: z.number().optional(),
        toolCallId: z.string(),
        toolName: z.string(),
        args: JSONValueSchema,
        result: JSONValueSchema,
    }),
]);

const V4ToolInvocationSchema = z.object({
    type: z.literal("tool-invocation"),
    toolInvocation: V4ToolInvocationUnionSchema,
});

const V4ToV5ToolUIPartTransform = V4ToolInvocationSchema.transform((v4) => {
    const tool = v4.toolInvocation;
    if (tool.state === "result") {
        return {
            type: `tool-${tool.toolName}`,
            toolCallId: tool.toolCallId,
            state: "output-available",
            input: tool.args,
            output: tool.result,
            // skipped providerExecuted, v4 didn't have it
            // skipped callProviderMetadata, v4 didn't have it
            // skipped errorText
            // skipped
        };
    }
    if (tool.state === "call") {
        return {
            type: `tool-${tool.toolName}`,
            toolCallId: tool.toolCallId,
            state: "input-available",
            input: tool.args,
        };
    }
    // skipped output-error, v4 toolInovcation didn't have it
    // partial-call
    return {
        type: `tool-${tool.toolName}`,
        toolCallId: tool.toolCallId,
        state: "input-streaming",
        input: tool.args,
    };
});

const V4PartsSchema = z.discriminatedUnion("type", [
    V4SourceUIPartSchema,
    V4TextUIPartSchema,
    V4ReasoningUIPartSchema,
    V4FileUIPartSchema,
    V4StepStartUIPartSchema,
    V4ToolInvocationSchema,
]);

const V4UIMessageSchema = z.object({
    id: z.string(),
    createdAt: z.date().optional(),
    content: z.string(),
    reasoning: z.string().optional(), //deprecated
    experimental_attachments: z.array(V4AttachmentSchema).optional(),
    role: z.enum(["system", "user", "assistant", "data"]),
    data: JSONValueSchema.optional(), ////deprecated
    annotations: z.array(JSONValueSchema).optional(),
    toolInvocations: z.array(V4ToolInvocationUnionSchema).optional(),
    parts: z.array(V4PartsSchema).optional(),
});

/**
 * UNIVERSAL PART CONVERTER
 */

// Helper function to convert a single V4 part to V5
const convertV4PartToV5Part = (part?: any): any => {
    if (!part) return null;
    const transforms = {
        text: V4ToV5TextUIPartTransform,
        reasoning: V4ToV5ReasoningUIPartTransform,
        "tool-invocation": V4ToV5ToolUIPartTransform,
        source: V4ToV5SourceUIPartTransform,
        file: V4ToV5FileUIPartTransform,
        "step-start": V4ToV5StepStartUIPartTransform,
    };

    type TransformKey = keyof typeof transforms;
    const key = (part as { type?: unknown })?.type;
    const isTransformKey = (val: unknown): val is TransformKey => typeof val === "string" && val in transforms;

    if (isTransformKey(key)) {
        const result = transforms[key].safeParse(part);
        if (result.success) {
            return result.data;
        }
    }

    return null;
};

/**
 * MESSAGE CONVERTER
 */

const V4ToV5UIMessageTransform = V4UIMessageSchema.transform((v4) => {
    const convertedParts = [];

    if (v4.role === "data") {
        // didn't have time to look further where this maps to in v5.
        // return null and skips it.
        return null;
    }

    // Convert content to text part if it exists

    if (v4.content) {
        // not needed in my use case but please change this if this applies to you
    }

    // Convert parts array if it exists
    if (v4.parts) {
        const partResults = v4.parts
            .map(convertV4PartToV5Part)
            .filter((part): part is NonNullable<typeof part> => part !== null);

        convertedParts.push(...partResults);
    }

    // Convert attachments to file parts if they exist
    if (v4.experimental_attachments) {
        const attachmentResults = v4.experimental_attachments
            .map((attachment) => {
                const result = AttachmentToV5FileTransform.safeParse(attachment);
                return result.success ? result.data : null;
            })
            .filter((part) => part !== null);

        convertedParts.push(...attachmentResults);
    }

    // Convert toolInvocations to tool parts if they exist
    // if (v4.toolInvocations) {
    //     const toolResults = v4.toolInvocations
    //         .map((toolInvocation) => {
    //             const result = V4ToV5ToolUIPartTransform.safeParse({
    //                 type: "tool-invocation",
    //                 toolInvocation,
    //             });
    //             return result.success ? result.data : null;
    //         })
    //         .filter((part): part is V5_ToolUIPart => part !== null);

    //     convertedParts.push(...toolResults);
    // }

    return {
        id: v4.id,
        role: v4.role,
        parts: convertedParts,
        // edit for your use case
    };
});

/**
 * MAIN CONVERSION FUNCTIONS
 */

export function convertV4MessageToV5(v4Message: unknown) {
    const result = V4ToV5UIMessageTransform.safeParse(v4Message);
    return result.success ? result.data : null;
}

/**
 *
 * V5 Schema
 *
 */

const V5TextUIPartSchema = z.object({
    type: z.literal("text"),
    text: z.string(),
    state: z.enum(["streaming", "done"]).optional(),
    providerMetadata: ProviderMetadataSchema.optional(),
});

const V5ReasoningUIPartSchema = z.object({
    type: z.literal("reasoning"),
    text: z.string(),
    state: z.enum(["streaming", "done"]).optional(),
    providerMetadata: ProviderMetadataSchema.optional(),
});

const V5SourceURLUIPartSchema = z.object({
    type: z.literal("source-url"),
    sourceId: z.string(),
    url: z.string(),
    title: z.string().optional(),
    providerMetadata: ProviderMetadataSchema.optional(),
});

const V5SourceDocumentUIPartSchema = z.object({
    type: z.literal("source-document"),
    sourceId: z.string(),
    mediaType: z.string(),
    title: z.string(),
    filename: z.string().optional(),
    providerMetadata: ProviderMetadataSchema.optional(),
});

const V5ToolTypeSchema: z.ZodType<`tool-${string}`> = z.custom(
    (val) => typeof val === "string" && val.startsWith("tool-")
);
const V5BaseToolSchema = z.object({
    type: V5ToolTypeSchema,
    toolCallId: z.string(),
    providerExecuted: z.boolean().optional(),
});
const V5ToolUIPartSchema = z.discriminatedUnion("state", [
    V5BaseToolSchema.extend({
        state: z.literal("input-streaming"),
        input: JSONValueSchema, // required, may be undefined
    }),
    V5BaseToolSchema.extend({
        state: z.literal("input-available"),
        input: JSONValueSchema,
        callProviderMetadata: ProviderMetadataSchema.optional(),
    }),
    V5BaseToolSchema.extend({
        state: z.literal("output-available"),
        input: JSONValueSchema,
        output: JSONValueSchema,
        callProviderMetadata: ProviderMetadataSchema.optional(),
        preliminary: z.boolean().optional(),
    }),
    V5BaseToolSchema.extend({
        state: z.literal("output-error"),
        input: JSONValueSchema,
        rawInput: JSONValueSchema.optional(),
        callProviderMetadata: ProviderMetadataSchema.optional(),
        errorText: z.string(),
    }),
]);

const V5DynamicToolSchema = z.object({
    type: z.literal("dynamic-tool"),
    toolName: z.string(),
    toolCallId: z.string(),
});

// zod issue, if you don't use `as type`, zod will complain that input: z.unknown() = unknown | undefined, instead of just unknown.
const V5DynamicToolUIPartSchema = z.discriminatedUnion("state", [
    V5DynamicToolSchema.extend({
        state: z.literal("input-streaming"),
        input: z.unknown(),
    }),
    V5DynamicToolSchema.extend({
        state: z.literal("input-available"),
        input: z.unknown(),
        callProviderMetadata: ProviderMetadataSchema.optional(),
    }),
    V5DynamicToolSchema.extend({
        state: z.literal("output-available"),
        input: z.unknown(),
        output: z.unknown(),
        callProviderMetadata: ProviderMetadataSchema.optional(),
        preliminary: z.boolean().optional(),
    }),
    V5DynamicToolSchema.extend({
        state: z.literal("output-error"),
        input: z.unknown(),
        callProviderMetadata: ProviderMetadataSchema.optional(),
        errorText: z.string(),
    }),
]);

const V5FileUIPartSchema = z.object({
    type: z.literal("file"),
    mediaType: z.string(),
    filename: z.string().optional(),
    url: z.string(),
    providerMetadata: ProviderMetadataSchema.optional(),
});

const V5StepStartUIPartSchema = z.object({
    type: z.literal("step-start"),
});

const V5DataTypeSchema: z.ZodType<`data-${string}`> = z.custom(
    (val) => typeof val === "string" && val.startsWith("data-")
);
const V5DataUIPartSchema = z.object({
    type: V5DataTypeSchema,
    id: z.string().optional(),
    data: JSONValueSchema,
});

const V5UIPartSchema = z.union([
    V5TextUIPartSchema,
    V5ReasoningUIPartSchema,
    V5SourceURLUIPartSchema,
    V5SourceDocumentUIPartSchema,
    V5ToolUIPartSchema,
    V5DynamicToolUIPartSchema,
    V5FileUIPartSchema,
    V5StepStartUIPartSchema,
    V5DataUIPartSchema,
]);

const V5UIMessageSchema = z.object({
    id: z.string(),
    role: z.enum(["system", "user", "assistant"]),
    parts: z.array(V5UIPartSchema),
    metadata: JSONValueSchema.optional(),
});

export { convertV4PartToV5Part, V4ToV5UIMessageTransform, V5UIMessageSchema };

// AI generated list of extension maps, customize for your purpose.
export const inferMediaTypeFromUrl = (url: string): string | null => {
    try {
        if (url.startsWith("data:")) {
            const match = url.match(/^data:([^;,]+)/);
            return match ? match[1] : null;
        }

        let pathname = "";
        try {
            pathname = new URL(url).pathname.toLowerCase();
        } catch {
            pathname = url.toLowerCase();
        }
        const ext = pathname.split(".").pop() || "";

        const map: Record<string, string> = {
            pdf: "application/pdf",
            txt: "text/plain",
            md: "text/markdown",
            html: "text/html",
            css: "text/css",
            js: "text/javascript",
            ts: "text/typescript",
            json: "application/json",
            xml: "application/xml",
            csv: "text/csv",
            jpg: "image/jpeg",
            jpeg: "image/jpeg",
            png: "image/png",
            gif: "image/gif",
            svg: "image/svg+xml",
            webp: "image/webp",
            ico: "image/x-icon",
            mp3: "audio/mpeg",
            wav: "audio/wav",
            mp4: "video/mp4",
            webm: "video/webm",
            zip: "application/zip",
            gz: "application/gzip",
        };

        return map[ext] ?? null;
    } catch {
        return null;
    }
};

0 replies

2025-11-01T20:47:27Z

github-actions[bot]
bot Nov 1, 2025

This discussion was automatically locked because it has not been updated in over 30 days. If you still have questions about this topic, please ask us at community.vercel.com/ai-sdk

0 replies

Migrating Messages to AI SDK v5 UIMessage Format #7988

Uh oh!

Uh oh!

khanhduyvt0101 Aug 12, 2025

Problem

Current Message Format (Mixed)

Target AI SDK v5 Format

Questions

Replies: 5 comments · 4 replies

Uh oh!

Theonlyhamstertoh Aug 25, 2025

Uh oh!

bakeyevrus Sep 10, 2025

Uh oh!

khanhduyvt0101 Sep 10, 2025 Author

Uh oh!

bakeyevrus Sep 10, 2025

Uh oh!

khanhduyvt0101 Sep 10, 2025 Author

Uh oh!

Uh oh!

ian Sep 16, 2025

Uh oh!

nicoalbanese Sep 16, 2025 Maintainer

Uh oh!

Theonlyhamstertoh Sep 23, 2025

Uh oh!

github-actions[bot] bot Nov 1, 2025

khanhduyvt0101
Aug 12, 2025

Replies: 5 comments 4 replies

Theonlyhamstertoh
Aug 25, 2025

bakeyevrus
Sep 10, 2025

khanhduyvt0101 Sep 10, 2025
Author

khanhduyvt0101 Sep 10, 2025
Author

ian
Sep 16, 2025

nicoalbanese Sep 16, 2025
Maintainer

Theonlyhamstertoh
Sep 23, 2025

github-actions[bot]
bot Nov 1, 2025