Why use pypdf2 to split the pdf first? #3

bryanyzhu · 2023-03-13T18:59:21Z

Hi, thanks for the code, it works perfect! I have a quick question, why use pypdf2 to split the pdf first? I think pdfminer can work with multiple pages and extract the content as well. Will the additional dependency, pdf splitting, pdf writing, text merging make the pipeline more complicated? I'm very curious to understand the design, thank you.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why use pypdf2 to split the pdf first? #3

Why use pypdf2 to split the pdf first? #3

bryanyzhu commented Mar 13, 2023

Why use pypdf2 to split the pdf first? #3

Why use pypdf2 to split the pdf first? #3

Comments

bryanyzhu commented Mar 13, 2023