Skip to content

banianzr/doc_parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

document parser

This project aims to parse documents in .txt, .docx, .pdf formats.

Installation

TODO

Utils

magic-pdf

Modified from MinerU reference

Steps:

  1. run download_models_hf_magic_pdf.py
  2. config the downloaded magic-pdf.json

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages