You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
from docling.datamodel.base_models import InputFormat
from docling.datamodel.pipeline_options import PdfPipelineOptions, EasyOcrOptions
from docling.document_converter import PdfFormatOption, DocumentConverter
source = "庞氏骗局 - 中国投资者网.pdf"
result = converter.convert(source)
print(result.document.export_to_markdown()) python test.py
Downloading detection model, please wait. This may take several minutes depending upon your network connection.
The text was updated successfully, but these errors were encountered:
from docling.datamodel.base_models import InputFormat
from docling.datamodel.pipeline_options import PdfPipelineOptions, EasyOcrOptions
from docling.document_converter import PdfFormatOption, DocumentConverter
参考接口地址
配置pdf模型,设置Docling模型的路径
pdf_artifacts_path = "/docling-models"
pdf_pipeline_options = PdfPipelineOptions(artifacts_path=pdf_artifacts_path)
转换模型
converter = DocumentConverter(
format_options={
InputFormat.PDF: PdfFormatOption(pipeline_options=pdf_pipeline_options)
}
)
source = "庞氏骗局 - 中国投资者网.pdf"
result = converter.convert(source)
print(result.document.export_to_markdown()) python test.py
Downloading detection model, please wait. This may take several minutes depending upon your network connection.
The text was updated successfully, but these errors were encountered: