Extract text with embedded fonts #4117
              
                Unanswered
              
          
                  
                    
                      maivan-hoa
                    
                  
                
                  asked this question in
                Q&A
              
            Replies: 0 comments
  
    Sign up for free
    to join this conversation on GitHub.
    Already have an account?
    Sign in to comment
  
        
    
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I have a pdf file. When extracting text, the resulting text has font errors and the text cannot be obtained as displayed. It seems that the text in the pdf file is displayed with embedded fonts. Is there any way I can extract text from such pdf files effectively?
This is the pdf file from which I want to extract data:
Page_test.pdf
Text in file:

Extracted text:
Beta Was this translation helpful? Give feedback.
All reactions