feat: Add Multimodal RAG cookbook #242

sjrl · 2025-07-15T14:13:31Z

fixes Create Image Indexing Example haystack#9321

review-notebook-app · 2025-07-15T14:13:36Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

sjrl · 2025-07-21T12:06:34Z

@bilgeyucel this is ready for review!

review-notebook-app · 2025-07-22T13:40:17Z

View / edit / reply to this conversation on ReviewNB

anakin87 commented on 2025-07-22T13:40:17Z
----------------------------------------------------------------

I would also explain that in the following Pipeline we are:

computing embeddings based on images for image files
converting PDF files to textual Documents and then computing embeddings based on the text

sjrl commented on 2025-07-23T11:27:00Z
----------------------------------------------------------------

sounds good!

review-notebook-app · 2025-07-22T13:40:18Z

View / edit / reply to this conversation on ReviewNB

anakin87 commented on 2025-07-22T13:40:17Z
----------------------------------------------------------------

Is it worth mentioning that other models (Jina Embeddings 4) might work better?

sjrl commented on 2025-07-23T11:27:23Z
----------------------------------------------------------------

Yeah I was thinking about that. I'll mention that here.

review-notebook-app · 2025-07-22T13:40:18Z

View / edit / reply to this conversation on ReviewNB

anakin87 commented on 2025-07-22T13:40:18Z
----------------------------------------------------------------

Something seems off in the first sentence.

anakin87

Nice work! I left some comments (while waiting on a review from @bilgeyucel)

We should also create a new entry in index.toml.

sjrl · 2025-07-23T11:27:01Z

sounds good!

View entire conversation on ReviewNB

sjrl · 2025-07-23T11:27:24Z

Yeah I was thinking about that. I'll mention that here.

View entire conversation on ReviewNB

review-notebook-app · 2025-07-25T10:08:58Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:08:58Z
----------------------------------------------------------------

As a future note, let's add docs link for these components

review-notebook-app · 2025-07-25T10:08:59Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:08:59Z
----------------------------------------------------------------

"Next, we load our embedders with the s entence-transformers/clip-ViT-L-14 model that maps text and images to a shared vector space. It's important that we use the same CLIP model for both text and images to calculate the similarity between.

review-notebook-app · 2025-07-25T10:09:00Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:08:59Z
----------------------------------------------------------------

Let's run the embedders and create vector embeddings for images to see how semantically similar our query is to the two images

review-notebook-app · 2025-07-25T10:09:01Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:09:00Z
----------------------------------------------------------------

As we can see, the text is most similar to our Apple image, as expected! So, the CLIP model can create correct representations for images and text.

review-notebook-app · 2025-07-25T10:09:01Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:09:01Z
----------------------------------------------------------------

Let's create an indexing pipeline to process our image and PDF files at once and write them to our Document Store.

review-notebook-app · 2025-07-25T10:09:02Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:09:01Z
----------------------------------------------------------------

Line #1.    #indexing_pipe.show()

This line creates problem in the tutorial test so let's comment it out

review-notebook-app · 2025-07-25T10:09:03Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:09:02Z
----------------------------------------------------------------

"Run the indexing pipeline with a pdf and an image file"

review-notebook-app · 2025-07-25T10:09:03Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:09:03Z
----------------------------------------------------------------

Extra desciption here: Let's now set up our search and retrieve relevant data from our document store by passing a query.

review-notebook-app · 2025-07-25T10:09:04Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:09:03Z
----------------------------------------------------------------

Do we need the "but" here?

sjrl commented on 2025-07-28T09:26:04Z
----------------------------------------------------------------

Yes I think so otherwise users might think we only send the text version of the image to the LLM and not the image itself.

sjrl commented on 2025-07-28T09:26:22Z
----------------------------------------------------------------

But we can drop the but and leave the rest ;)

review-notebook-app · 2025-07-25T10:09:05Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:09:04Z
----------------------------------------------------------------

Line #1.    indexing_pipe.show()

Again, commenting out

review-notebook-app · 2025-07-25T10:09:05Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:09:05Z
----------------------------------------------------------------

Can you explain what is happening in this pipeline? This is also the first time we have introduced this new prompt type, so it requires more explanation.

I think the important thing to mention here is that we do retrieval based on the image caption we created as indexing, but then we use the image itself for the generation part of the RAG. It's worth mentioning that we convert the image into a base64 string with DocumentToImageContent and render it in the prompt before we pass the prompt to a language model that can process text and image

review-notebook-app · 2025-07-25T10:09:06Z

View / edit / reply to this conversation on ReviewNB

bilgeyucel commented on 2025-07-25T10:09:05Z
----------------------------------------------------------------

Line #1.    #pipe.show()

Same comment, we should delete the output cell though, the image should be there.

bilgeyucel

Left my comments @sjrl 🙌 We should have it as a tutorial so I will make further adjustments when we transfer it to haystack-tutorials repo.

sjrl · 2025-07-28T09:26:05Z

Yes I think so otherwise users might think we only send the text version of the image to the LLM and not the image itself.

View entire conversation on ReviewNB

sjrl · 2025-07-28T09:26:23Z

But we can drop the but and leave the rest ;)

View entire conversation on ReviewNB

sjrl · 2025-07-28T09:40:47Z

Closing since we are moving this to the tutorial repo deepset-ai/haystack-tutorials#409

sjrl added 2 commits July 15, 2025 15:43

First version of multimodal rag cookbook

22b9e16

Some updates

910c6fb

Update multimodal rag notebook

6682aaa

sjrl marked this pull request as ready for review July 21, 2025 12:06

sjrl requested a review from a team as a code owner July 21, 2025 12:06

anakin87 self-requested a review July 21, 2025 12:27

anakin87 reviewed Jul 22, 2025

View reviewed changes

anakin87 requested a review from bilgeyucel July 22, 2025 13:42

PR comments

5e03ea9

sjrl requested a review from anakin87 July 24, 2025 07:45

bilgeyucel reviewed Jul 25, 2025

View reviewed changes

sjrl added 2 commits July 28, 2025 11:27

PR comments

11ce08f

PR comments

ec2b210

sjrl mentioned this pull request Jul 28, 2025

feat: Add Multimodal RAG tutorial deepset-ai/haystack-tutorials#409

Open

sjrl closed this Jul 28, 2025

feat: Add Multimodal RAG cookbook #242

feat: Add Multimodal RAG cookbook #242

Uh oh!

Conversation

sjrl commented Jul 15, 2025

Uh oh!

review-notebook-app bot commented Jul 15, 2025

Uh oh!

sjrl commented Jul 21, 2025

Uh oh!

review-notebook-app bot commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anakin87 left a comment

Choose a reason for hiding this comment

Uh oh!

sjrl commented Jul 23, 2025

Uh oh!

sjrl commented Jul 23, 2025

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bilgeyucel left a comment

Choose a reason for hiding this comment

Uh oh!

sjrl commented Jul 28, 2025

Uh oh!

sjrl commented Jul 28, 2025

Uh oh!

sjrl commented Jul 28, 2025

Uh oh!

Uh oh!

review-notebook-app bot commented Jul 22, 2025 •

edited

Loading

review-notebook-app bot commented Jul 22, 2025 •

edited

Loading

review-notebook-app bot commented Jul 22, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading

review-notebook-app bot commented Jul 25, 2025 •

edited

Loading