Would you consider supporting the use of a model's multimodal capabilities to read PDFs? #4729

isCopyman · 2025-12-30T16:33:35Z

isCopyman
Dec 30, 2025

Gemini web and aistudio can both use multimodal capabilities to directly read PDFs instead of relying on text parsing. This would save more tokens and allow direct reading of images within the PDF. Claude Code actually supports reading PDFs via the readfile tool, and Gemini CLI seems to have a similar functionality.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Would you consider supporting the use of a model's multimodal capabilities to read PDFs? #4729

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Would you consider supporting the use of a model's multimodal capabilities to read PDFs? #4729

Uh oh!

isCopyman Dec 30, 2025

Replies: 0 comments

isCopyman
Dec 30, 2025