Add multimodal embeddings support by virgildotcodes · Pull Request #351 · laravel/ai

virgildotcodes · 2026-04-05T06:04:27Z

Closes #308

Summary

Laravel AI currently only supports text inputs for embeddings. Prism added multimodal embeddings support, but the Laravel SDK did not expose it cleanly and had a few provider-specific gaps.

This PR adds multimodal embeddings input support for embeddings, including:

images
audio
documents
video

It also validates unsupported provider / model combinations early, preserves the original media source when mapping Prism embeddings inputs, and avoids fetching remote media when generating embeddings cache keys.

Examples

Gemini multimodal embeddings

use Laravel\Ai\Embeddings;
use Laravel\Ai\Files\Image;

$response = Embeddings::for([
    Image::fromPath('/path/to/image.png'),
])->generate(provider: 'gemini', model: 'gemini-embedding-2-preview');

Voyage AI image embeddings

use Laravel\Ai\Embeddings;
use Laravel\Ai\Files\Image;

$response = Embeddings::for([
    Image::fromUrl('https://example.com/image.png'),
])->dimensions(1024)->generate(
    provider: 'voyageai',
    model: 'voyage-multimodal-3',
);

Changes

widen embeddings inputs to accept text, images, audio, documents, and video
add explicit validation for unsupported provider / input combinations
resolve Gemini provider-backed files to file URIs before sending them to Prism
preserve remote, local, stored, and base64 media sources when converting embeddings inputs
avoid remote fetches when building cache keys for remote embeddings inputs

Notes

Gemini multimodal embeddings require gemini-embedding-2-preview
Voyage AI currently supports text and image embeddings inputs

Add multimodal embeddings support

d87bd94

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multimodal embeddings support#351

Add multimodal embeddings support#351
virgildotcodes wants to merge 1 commit intolaravel:0.xfrom
virgildotcodes:gemini-multimodal-embeddings

virgildotcodes commented Apr 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

virgildotcodes commented Apr 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Examples

Gemini multimodal embeddings

Voyage AI image embeddings

Changes

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

virgildotcodes commented Apr 5, 2026 •

edited

Loading