LlamaParse Roadmap #19
Replies: 8 comments 3 replies
-
|
When will LlamaParse support Chinese? |
Beta Was this translation helpful? Give feedback.
-
|
Is the source code of the parsing logic itself available, or will it be made available? As far as I can see, currently only the code which accesses the remote point is available, but not any of the parsing logic. |
Beta Was this translation helpful? Give feedback.
-
|
Testing on Academic Papers Hi, I very much appreciate the initiative on LlamaParse as pdf-parsing is indeed a challenge. Putting this as an API is also cool and solves many installation issues. The idea of using LLM instructions to steer the parser is great, as is the separation of objects for tables and figures.
In most of these aspects, parsing by Nougat works much better (though this has problems on its own...e.g. with references or left out pages). |
Beta Was this translation helpful? Give feedback.
-
|
Will there be LlamaParse on prem, i.e. a version which can be run locally without API calls? This would make its use für confidential documents possible. Also a "No"-answer would be helpful for decision making. |
Beta Was this translation helpful? Give feedback.
-
|
I have tried Llama Parse to parse complex financial documents. I found it to be a great and innovative tool. |
Beta Was this translation helpful? Give feedback.
-
Any plans to integrate this in the near future? Especially with GPT-4o being multimodal, filling the missing data in parsed charts/images shouldn't be an issue. |
Beta Was this translation helpful? Give feedback.
-
|
My PDF only has images :) I hope LlamaParse can read image :) |
Beta Was this translation helpful? Give feedback.
-
|
A few things that would be high-value additions to LlamaParse from an agent pipeline perspective: Structured extraction with schema — beyond OCR and layout parsing, the ability to extract into a specified Pydantic/JSON Schema directly. "Parse this invoice and return a Incremental / delta parsing — when a document is updated (new page added, table modified), parse only the changed sections and return a delta. Avoids re-parsing stable pages. Crucial for long-lived knowledge bases where documents evolve. Parse quality confidence — a confidence score per extracted element (this table was cleanly parsed: 0.97; this blurry scan section: 0.61). Agents can use this to decide whether to re-fetch the source, flag for human review, or proceed with reduced confidence. Extraction lineage — for agent systems, knowing "this fact came from page 4, table 2, row 3 of document X (version 2026-01-15)" is essential for citation and audit. Parse results should carry enough metadata for the agent to build a provenance trail. Batch prioritization — when an agent submits 50 documents to parse, it should be able to flag 5 as urgent (needed for the current task) and 45 as background. Priority queue semantics rather than FIFO. We've been building document-aware agents in KinthAI that need reliable parsing as input to their knowledge consolidation step: https://blog.kinthai.ai/why-character-ai-forgets-you-persistent-memory-architecture covers how parsing quality affects downstream memory reliability. Are the roadmap priorities driven more by enterprise document processing needs or by developer/agent pipeline needs? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Currently LlamaParse supports complex PDF documents as input. We will extend LlamaParse in the coming weeks / months to support the following:
Of course, we will also work hard to address any issue that may arise. Please drop in the Issues tab!
Beta Was this translation helpful? Give feedback.
All reactions