Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update ocr-overview.md #13662

Merged
merged 1 commit into from
Oct 22, 2024
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 4 additions & 2 deletions microsoft-365/syntex/ocr-overview.md
Original file line number Diff line number Diff line change
Expand Up @@ -35,8 +35,10 @@ For example, you enable the OCR service and then add image files to your documen

|Endpoint |Supported file types |
|---------|---------|
|SharePoint and OneDrive |.bmp, .png, .jpeg, .jpg, .jfif, .arw, .cr2, .crw, .erf, .gif, .mef, .mrw, .nef, .nrw, .orf, .pef, .raw, .rw2, .rw1, .sr2, .tif, .tiff, .heic, .heif, .ari, .bay, .cap, .cr3, .dcs, .dcr, .drf, .eip, .fff, .iiq, .k25, .kdc, .mef, .mos, .ptx, .pxn, .raf, .rwl, .sr2, .srf, .srw, .x3f, .dng, .tiff, and .pdf (image only) |
|Teams, Exchange, and Windows devices |.bmp, .png, .jpeg, .jpg, .tiff, and .pdf (image only) |
|SharePoint and OneDrive |.bmp, .png, .jpeg, .jpg, .jfif, .arw, .cr2, .crw, .erf, .gif, .mef, .mrw, .nef, .nrw, .orf, .pef, .raw, .rw2, .rw1, .sr2, .tif, .tiff, .heic, .heif, .ari, .bay, .cap, .cr3, .dcs, .dcr, .drf, .eip, .fff, .iiq, .k25, .kdc, .mef, .mos, .ptx, .pxn, .raf, .rwl, .sr2, .srf, .srw, .x3f, .dng, .tiff, and .pdf |
|Teams, Exchange, and Windows devices |.bmp, .png, .jpeg, .jpg, .tiff, and .pdf |

In addition to image-based PDF, Syntex OCR starts to support hybrid PDF (text + image PDF) on Nov 1 2024. Newly uploaded hybrid PDFs after Nov 1 2024 will be processed by OCR service.

> [!NOTE]
> When you apply OCR to an image file, the text is stored in the **Extracted text** metadata column. When you apply OCR to a PDF or TIFF file, the extracted text is indexed in search but not available in the metadata column.
Expand Down