Skip to content

Releases: pymupdf/PyMuPDF

New features for text searching and more

27 Oct 12:09

Choose a tag to compare

This resolves

and removes the hit_max parameter from text searching. In addition, hyphenated words around line breaks are still found.

The use of the clip parameter in text searches and text extractions now only includes characters whose bboxes are fully contained in the clip rctangle.

Important fixes, some improvements for drawing extraction

18 Oct 16:54

Choose a tag to compare

  • fixed #692
  • fixed #686
  • Added transparency options for various methods in classes Shape and Page.

Support MuPDF v1.18.0

08 Oct 07:17

Choose a tag to compare

This version fixes the following issues:

  • #519 - method Page.cleanContents() should no longer destroy the PDF page's appearance. In earlier versions, this upstream bug occurred in rare cases.
  • #675 - unsuccessful storage allocations (e.g. for extremely large pixmaps), could occasionally lead to interpreter crashes. This should now always be prevented (fingers crossed).
  • #668 - the specification of line dashes in PDF is now correctly documented.
  • #669 - fixed a major cause of memory leakage in method Document.insertPDF.

The following new features or improvements are included:

  • Text extraction method Page.getText() now also works for annotations: Annot.getText().
  • Text from within a rectangle can now be extracted directly via Page.getTextbox(rect). This may obsolete extra scripts in many cases.
  • When applying redactions on PDF pages, the handling of images can now be fine-controlled via a new parameter.
  • The DPI (resolution) of PNG images created from pixmaps is now automatically set from the Pixmap.xres and Pixmap.yres values.

Fixes, performance improvements

14 Sep 11:12

Choose a tag to compare

Fixed #651
Fixed #645
Fixed #622
Fixed #653
Fixed #640
Added methods and atrributes to speed up TOC maintenance.
Added new page method to extract text from inside a rectangle.
All getText() methods (except (X)HTML and XML) now support a clip parameter.

Bug fixes and more support for font replacements

26 Aug 19:31

Choose a tag to compare

  • Fixed #605
  • Fixed #600
  • Added origin key to text span dictionary of Page.getText("dict").
  • Added property buffer to fitz.Font.
  • Added option sanitize to Page.cleanContents().

Several fixes

06 Aug 13:11

Choose a tag to compare

  • Fixes #561
  • Fixes #566
  • Fixes #568
  • Correct use of opacity default in TextWriter.writeText().
  • Supporting fonts of [pymupdf-fonts(https://github.com/pymupdf/pymupdf-fonts).
  • Some smaller feature improvements.

Font Handling improvements and other fixes

24 Jul 09:45

Choose a tag to compare

Fixes #561 - more than 10 TextWriter fonts per page
Fixes #562 - annotation pixmaps no longer derived from tha page
Fixes wrong appearance of mono-spaced fonts
Resolves #563 - allow manipulation of PDF property NeedAppearances
Support optional fonts provided via repository pymupdf-fonts

Minor Fixes, more flexible TextWriter Textbox

07 Jul 09:03

Choose a tag to compare

Fixes #540
Fixes #548
TextWriter.fillTextbox now supports indenting start of text.

Fix Important issue

24 Jun 14:04

Choose a tag to compare

Fixes #533
Adds appearance flexibility to redactions annotations, #535.

Addressing minor bugs

18 Jun 16:32

Choose a tag to compare

Fixed: #525, #520
Implemented: #524

Improved interactive help a lot.