Replies: 1 comment
-
|
hi @yamuna83, unfortunately that's a known limitation and the version of KM you are using has been archived. That said, the fix should be simple enough, patching this file kernel-memory/archived/km-v1/service/Core/Handlers/TextExtractionHandler.cs Lines 186 to 235 in 94b69d3 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I am using kernel memory to process set of documents(pdf, docx, images, ppt, pptx). Works fine.
When I process a pdf file with only images then it is not extracting ocr.
Getting this error:
warn: Microsoft.KernelMemory.DocumentStorage.AzureBlobs.AzureBlobsStorage[0] The file user-documents/000d0000-ac13-0242-b807-08de3cc174d8/Digital_Strategy.pdf.extract.txt is empty warn: Microsoft.KernelMemory.Handlers.SaveRecordsHandler[0] Pipeline 'user-documents/000d0000-ac13-0242-b807-08de3cc174d8': step save_records: no records found, cannot save, moving to next pipeline step.
My current OCR settings:
"ImageOcrType": "AzureAIDocIntel",
Any help would be ppreciated.
Beta Was this translation helpful? Give feedback.
All reactions