Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to enable GPU usage for docling? #423

Closed
Stdev95 opened this issue Nov 25, 2024 · 1 comment
Closed

How to enable GPU usage for docling? #423

Stdev95 opened this issue Nov 25, 2024 · 1 comment
Labels
question Further information is requested

Comments

@Stdev95
Copy link

Stdev95 commented Nov 25, 2024

Question

Hi,

I'm exploring docling and tried doing some experiments but I see that it takes a lot of time to parse and convert the doc to markdown or json format.
I'm trying to convert pdf to markdown with these config:
pipeline_options = PdfPipelineOptions() pipeline_options.do_ocr = True pipeline_options.do_table_structure = True pipeline_options.ocr_options.use_gpu = True pipeline_options.table_structure_options.do_cell_matching = True

I have enabled the option to use GPU but it seems that GPU is not being used.

my observation:
it is taking 650+ sec to process a pdf file of size 18MB (400+ pages) using above configurations for the pipeline.

I have two questions here:

  1. for EasyOCR, do we need to enable the flag for GPU usage or default is True?
  2. Do we need to follow any steps to use local GPU for processing?
@Stdev95 Stdev95 added the question Further information is requested label Nov 25, 2024
@cau-git
Copy link
Contributor

cau-git commented Nov 25, 2024

@Stdev95 By default, Docling already uses a GPU when detected on most models (TableFormer, EasyOCR). You don't need to enable anything specific. We have work-in-progress on complete GPU support for all models, including the layout model (see DS4SD/docling-ibm-models#50) and will inform on our README and Changelog when it is available.

@cau-git cau-git closed this as completed Nov 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants