ec16c49707
- update to 20251230: * Eliminated arbitrary code execution vulnerability (CVE-2025-64512) by replacing pickle CMap storage with json - users with custom pickle CMaps can use tools/convert_cmaps_to_json.py to convert to JSON format * Support for colored and uncolored tiling patterns per ISO 32000 * Pre-commit hooks for automated code quality checks * Ruff rules for for modernized Python syntax * Using makefile instead of nox for local development * Fix struct.error when processing PDFs with odd-length font encoding buffers * PSBaseParser combines tokens split across streams * Improve exception handling in PDFDocument with more precise error propagation * Support for Python 3.14 * Refuse to execute circular references to content streams (including Form XObjects) * IndexError when saving image with no filters` * Copying color space scs and ncs * Correct linewidth calculation in PDFPageInterpreter.do_w` * Support for Python 3.9
Dirk Mueller2026-01-02 14:02:38 +00:00
7d224a4c1c
Accepting request 1311146 from home:ecsos:python
Markéta Machová2025-10-13 15:03:22 +00:00
b915f2557a
Accepting request 1267469 from devel:languages:python
Ana Guerrero2025-04-16 18:39:13 +00:00
ca58f40036
- Update to 20250327: * Added + Support for Python 3.13 + Support for zipped jpeg's + Fuzzing harnesses for integration into Google's OSS-Fuzz + Support for setuptools-git-versioning version 2.0.0 * Changed + Reduce memory overhead on runlength encoding by using lists + Using pyproject.toml instead of setup.py + Updated Python 3.7 syntax to 3.8 + Updated all Python version specifications to a minimum of 3.8 + Using absolute instead of relative imports + Using standard library functions for ascii85 and asciihex * Fixed + TypeError when CID character widths are not parseable as floats + TypeError raised by extract_text method with compressed PDF file + PSBaseParser can't handle tokens split across end of buffer + TypeError when CropBox is an indirect object reference + Remove redundant line to be able to recognize rectangles + Support indirect objects for filters + Make sure bytes is bytes where it counts + TypeError when corrupt PDF object reference cannot be parsed as int + TypeError when corrupt PDF literal cannot be converted to str + ValueError when corrupt PDF specifies a negative xref location + ValueError when corrupt PDF specifies an invalid mediabox + RecursionError when corrupt PDF specifies a recursive /Pages object + TypeError when corrupt PDF specifies text-positioning operators with invalid values + inline image parsing fails when stream data contains "EI\n" + TypeError when parsing object reference as mediabox
Steve Kowalik2025-04-07 05:36:55 +00:00
22f203fbba
Accepting request 1137442 from devel:languages:python
Ana Guerrero2024-01-08 22:44:34 +00:00
ec451d674e
- update to 20231228: * Removed Support for Python 3.6 and 3.7 * Output converter for the hOCR format * Font name aliases for Arial, Courier New and Times New Roman * Documentation on why special characters can sometimes not be extracted * Storing Bezier path and dashing style of line in LTCurve * Broken CI/CD pipeline by setting upper version limit for black, mypy, pip and setuptools * flake8 failures * ValueError when bmp images with 1 bit channel are decoded * ValueError when trying to decrypt empty metadata values * Sphinx errors during building of documentation * TypeError when getting default width of font * Installing typing-extensions on Python 3.6 and 3.7 * TypeError in cmapdb.py when parsing null characters * Color "convenience operators" now (per spec) also set color space * ValueError when extracting images, due to breaking changes in Pillow * Small typo's and issues in the documentation * Ignore non-Unicode cmaps in TrueType fonts * Using non-hardcoded version string and setuptools-git- versioning to enable installation from source and building on Python 3.12 * Usage of if __name__ == "__main__" where it was only intended for testing purposes - Option to disable boxes flow layout analysis when using pdf2txt - Exporting images without any specific encoding - Rename PDFTextExtractionNotAllowedError to PDFTextExtractionNotAllowed to revert breaking change
Dirk Mueller2024-01-07 20:38:38 +00:00
ca85951c1e
Accepting request 1133035 from devel:languages:python
Ana Guerrero2023-12-14 21:03:40 +00:00
c1462f6c3d
Accepting request 1132937 from home:jonapap
Martin Hauke2023-12-14 09:40:54 +00:00
5c76a19da8
Accepting request 1105949 from devel:languages:python
Ana Guerrero2023-08-28 15:13:44 +00:00
4fecc7db54
Accepting request 1105920 from home:ecsos:python
Markéta Machová2023-08-25 18:45:00 +00:00
914b1f3fd5
- Use pytest to run the testsuite. - Add patch import-from-non-pythonpath-files.patch: * Allow the test suite to find modules not shipped as modules.
Steve Kowalik2021-11-09 07:33:46 +00:00