Changelog¶
0.2.3 (2020-08-31)¶
Config: make
Config.cfg_path
public attributeDocument: add support for
Path
for loading pdfpyxpdf_data: add 35 base Postscript fonts from ghostscript
Bugs Fixed
Fix #9: segfault using
text()
Fix #8: add checks for file in
Config.add_font_file()
0.2.2 (2020-07-03)¶
Config: add function to add missing fonts
Config.add_font_file()
Introduce
PDFImage
to represent a PDF Image.PDFImageOutput:
get()
returnsPDFImage
instead of PillowImage
0.2.1 (2020-06-12)¶
Bugs Fixed
fix all direct memory leaks
Config: fix
Config.text_encoding
setter, encodings with lowercase characters were not able to set.fix weird bytes encoding problem in python debug builds
0.2.0 (2020-06-11)¶
Python 2.7 support dropped
2 optional dependencies (Pillow, pyxpdf_data) introduced
New Features
Introduce (optional) package pyxpdf_data which add more encoding support.
API: add specialised classes for pdf outputs, PDFOuputDevice.
TextOutput - For Text extraction
RawImageOutput - Render PDF Page as Image
PDFImageOutput - Extract images from PDF
- Config: add new global settings:
Bugs Fixed
pdftotext: extracted text contains clipped text even when explictly discarding it.
Config: fix loading of external xdfrc with
Config.load_file()
0.1.1 (2020-05-10)¶
FIX: default
Config.text_encoding
value i.e UTF-8 does not persistConfig.reset()
and changes to Latin1.pdftotext: remove all parameters that change global
Config
properties.
0.1 (2020-04-20)¶
Initial stable release.