Skip to content

fix: robustify parse of broken pdfs#228

Merged
cau-git merged 4 commits intomainfrom
fix/robustify-parse-of-broken-pdfs
Feb 23, 2026
Merged

fix: robustify parse of broken pdfs#228
cau-git merged 4 commits intomainfrom
fix/robustify-parse-of-broken-pdfs

Conversation

@PeterStaar-IBM
Copy link
Member

command

uv run python ./perf/run_perf.py -p docling ./broken_pdfs

before:

Summary for parser=docling
 - files:        96
 - pages total:  96
 - pages ok:     1
 - pages failed: 95
 - total sec:    0.026838
 - avg sec/page: 0.026838
 - p50: 0.026838  p90: 0.026838  p95: 0.026838  p99: 0.026838
 - min: 0.026838  max: 0.026838

after:

Summary for parser=docling
 - files:        96
 - pages total:  96
 - pages ok:     94
 - pages failed: 2
 - total sec:    12.991205
 - avg sec/page: 0.138204
 - p50: 0.053474  p90: 0.264782  p95: 0.495950  p99: 1.683184
 - min: 0.004947  max: 2.489215

broken_pdfs.zip

Signed-off-by: Peter Staar <taa@zurich.ibm.com>
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
Signed-off-by: Peter Staar <taa@zurich.ibm.com>
@github-actions
Copy link
Contributor

DCO Check Passed

Thanks @PeterStaar-IBM, all your commits are properly signed off. 🎉

@PeterStaar-IBM PeterStaar-IBM changed the title Fix/robustify parse of broken pdfs fix: robustify parse of broken pdfs Feb 23, 2026
@mergify
Copy link

mergify bot commented Feb 23, 2026

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

  • title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

Signed-off-by: Peter Staar <taa@zurich.ibm.com>
@cau-git cau-git merged commit e0264dd into main Feb 23, 2026
34 checks passed
@cau-git cau-git deleted the fix/robustify-parse-of-broken-pdfs branch February 23, 2026 09:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants