diff --git a/ftfy-6.0.1.tar.gz b/ftfy-6.0.1.tar.gz new file mode 100644 index 0000000..9188fed --- /dev/null +++ b/ftfy-6.0.1.tar.gz @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:46f5cd2ec15469bfea8f7c0169c32210c9091228d37f528d71046e1a43e81b13 +size 278549 diff --git a/python-ftfy.changes b/python-ftfy.changes index 8147180..e292306 100644 --- a/python-ftfy.changes +++ b/python-ftfy.changes @@ -1,3 +1,59 @@ +------------------------------------------------------------------- +Sat Apr 17 18:25:51 UTC 2021 - Ben Greiner + +- Update to 6.0.1 + * The remove_terminal_escapes step was accidentally not being + used. This version restores it. + * Specified in setup.py that ftfy 6 requires Python 3.6 or later. + * Use a lighter link color when the docs are viewed in dark mode. +- Version 6.0 + * New function: ftfy.fix_and_explain() can describe all the + transformations that happen when fixing a string. This is + similar to what ftfy.fixes.fix_encoding_and_explain() did in + previous versions, but it can fix more than the encoding. + * fix_and_explain() and fix_encoding_and_explain() are now in + the top-level ftfy module. + * Changed the heuristic entirely. ftfy no longer needs to + categorize every Unicode character, but only characters that + are expected to appear in mojibake. + * Because of the new heuristic, ftfy will no longer have to + release a new version for every new version of Unicode. It + should also run faster and use less RAM when imported. + * The heuristic ftfy.badness.is_bad(text) can be used to + determine whether there appears to be mojibake in a string. + Some users were already using the old function + sequence_weirdness() for that, but this one is actually + designed for that purpose. + * Instead of a pile of named keyword arguments, ftfy functions + now take in a TextFixerConfig object. The keyword arguments + still work, and become settings that override the defaults in + TextFixerConfig. + * Added support for UTF-8 mixups with Windows-1253 and + Windows-1254. + * Overhauled the documentation: https://ftfy.readthedocs.org +- Version 5.9 + * This version is brought to you by the letter à and the number + 0xC3. + * Tweaked the heuristic to decode, for example, "Ã " as the + letter "à" more often. + * This combines with the non-breaking-space fixer to decode "Ã " + as "à" as well. However, in many cases, the text " Ã " was + intended to be " à ", preserving the space -- the underlying + mojibake had two spaces after it, but the Web coalesced them + into one. We detect this case based on common French and + Portuguese words, and preserve the space when it appears + intended. + * Thanks to @zehavoc for bringing to my attention how common + this case is. + * Updated the data file of Unicode character categories to + Unicode 13, as used in Python 3.9. (No matter what version of + Python you're on, ftfy uses the same data.) +- Version 5.8 + * Improved detection of UTF-8 mojibake of Greek, Cyrillic, + Hebrew, and Arabic scripts. + * Fixed the undeclared dependency on setuptools by removing the + use of pkg_resources. + ------------------------------------------------------------------- Tue May 26 06:34:21 UTC 2020 - Petr Gajdos @@ -13,13 +69,13 @@ Tue Mar 10 08:48:18 UTC 2020 - Tomáš Chvátal Mon Aug 12 12:31:18 UTC 2019 - Marketa Calabkova - Update to version 5.6 - * The unescape_html function now supports all the HTML5 entities - that appear in html.entities.html5, including those with long + * The unescape_html function now supports all the HTML5 entities + that appear in html.entities.html5, including those with long names such as ˝. - * Unescaping of numeric HTML entities now uses the standard library's + * Unescaping of numeric HTML entities now uses the standard library's html.unescape, making edge cases consistent. - * On top of Python's support for HTML5 entities, ftfy will also - convert HTML escapes of common Latin capital letters that are + * On top of Python's support for HTML5 entities, ftfy will also + convert HTML escapes of common Latin capital letters that are (nonstandardly) written in all caps, such as Ñ for Ñ. ------------------------------------------------------------------- diff --git a/python-ftfy.spec b/python-ftfy.spec index b490123..1fcdb3c 100644 --- a/python-ftfy.spec +++ b/python-ftfy.spec @@ -1,7 +1,7 @@ # # spec file for package python-ftfy # -# Copyright (c) 2020 SUSE LLC +# Copyright (c) 2021 SUSE LLC # # All modifications and additions to the file contributed by third parties # remain the property of their copyright owners, unless otherwise agreed @@ -19,18 +19,18 @@ %{?!python_module:%define python_module() python-%{**} python3-%{**}} %define skip_python2 1 Name: python-ftfy -Version: 5.7 +Version: 6.0.1 Release: 0 Summary: Python module for repairing mis-decoded Unicode text License: MIT URL: https://github.com/LuminosoInsight/python-ftfy -Source: https://github.com/LuminosoInsight/python-ftfy/archive/v%{version}.tar.gz +Source: https://github.com/LuminosoInsight/python-ftfy/archive/v%{version}.tar.gz#/ftfy-%{version}.tar.gz BuildRequires: %{python_module setuptools} BuildRequires: fdupes BuildRequires: python-rpm-macros Requires: python-wcwidth Requires(post): update-alternatives -Requires(postun): update-alternatives +Requires(postun):update-alternatives BuildArch: noarch # SECTION test requirements BuildRequires: %{python_module pytest} @@ -54,9 +54,12 @@ put through an encode/decode cycle with different encodings. %python_expand %fdupes %{buildroot}%{$python_sitelib} %check -export PATH="$PATH:%{buildroot}%{_bindir}" -# test_cli: ftfy binary not found (update-alternatives) -%pytest -k 'not test_cli' +%{python_expand # provide u-a controlled cli command for tests +mkdir -p build/testbin +ln -s %{buildroot}%{_bindir}/ftfy-%{python_bin_suffix} build/testbin/ftfy +} +export PATH="build/testbin:$PATH" +%pytest %post %python_install_alternative ftfy diff --git a/v5.7.tar.gz b/v5.7.tar.gz deleted file mode 100644 index 2432811..0000000 --- a/v5.7.tar.gz +++ /dev/null @@ -1,3 +0,0 @@ -version https://git-lfs.github.com/spec/v1 -oid sha256:84f98cde0a4cc5223c00ea18bc49b1fa5686a6e9393c4c8df6f7e2f7620c0fcf -size 69683