41 lines
2.3 KiB
Plaintext
41 lines
2.3 KiB
Plaintext
|
|
-------------------------------------------------------------------
|
||
|
|
Mon Jan 8 15:26:44 UTC 2024 - ecsos <ecsos@opensuse.org>
|
||
|
|
|
||
|
|
- Update to 16.0.4
|
||
|
|
- Fixed some issues for left-to-right text with the new hOCR renderer.
|
||
|
|
It is still not default yet but will be made so soon.
|
||
|
|
Right-to-left text is still in progress.
|
||
|
|
- Added an error to prevent use of several versions of Ghostscript
|
||
|
|
that seem corrupt existing text in input PDFs.
|
||
|
|
Newly generated OCR is not affected.
|
||
|
|
For best results, use Ghostscript 10.02.1 or newer,
|
||
|
|
which contains the fix for the issue.
|
||
|
|
|
||
|
|
-------------------------------------------------------------------
|
||
|
|
Thu Jan 4 10:05:05 UTC 2024 - ecsos <ecsos@opensuse.org>
|
||
|
|
|
||
|
|
- Update to 16.0.3
|
||
|
|
- Changed minimum required Ghostscript to 9.54, to support users of RHEL 9 and its derivatives,
|
||
|
|
since that is the latest version available there.
|
||
|
|
- Removed warning message about CVE-2023-43115, on the assumption that most distributions have backported the patch by now.
|
||
|
|
- Changes from 16.0.2
|
||
|
|
- Temporarily changed PDF text renderer back to sandwich by default to address regressions in macOS Preview.
|
||
|
|
- Changes from 16.0.1
|
||
|
|
- Fixed text rendering issue with new hOCR text renderer - extraneous byte order marks.
|
||
|
|
- Tightened dependencies.
|
||
|
|
- Changes from 16.0.0
|
||
|
|
- Added OCR text renderer, combined the best ideas of Tesseract's PDF generator and the older hOCR transformer renderer.
|
||
|
|
The result is a hopefully permanent fix for wordssmushedtogetherwithoutspaces issues in extracted text, better
|
||
|
|
registration/position of text on skewed baselines :issue:`1009`, fixes to character output when the German Fraktur script
|
||
|
|
is used :issue:`1191`, proper rendering of right to left languages (Arabic, Hebrew, Persian) :issue:`1157`.
|
||
|
|
Asian languages may still have excessive word breaks compared to expectations. The new renderer is the default;
|
||
|
|
the old sandwich renderer is still available using --pdf-renderer sandwich; the old hOCR renderer is no more.
|
||
|
|
- The ocrmypdf.hocrtransform API has changed substantially.
|
||
|
|
- Support for Python 3.9 has been dropped. Python 3.10+ is now required.
|
||
|
|
- pikepdf >= 8.8.0 is now required.
|
||
|
|
|
||
|
|
-------------------------------------------------------------------
|
||
|
|
Fri Dec 15 08:32:05 UTC 2023 - ecsos <ecsos@opensuse.org>
|
||
|
|
|
||
|
|
- Initial version 15.4.4
|