15
0
forked from pool/python-ftfy

Accepting request 886559 from devel:languages:python

OBS-URL: https://build.opensuse.org/request/show/886559
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-ftfy?expand=0&rev=7
This commit is contained in:
2021-04-19 19:05:57 +00:00
committed by Git OBS Bridge
4 changed files with 74 additions and 15 deletions

3
ftfy-6.0.1.tar.gz Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:46f5cd2ec15469bfea8f7c0169c32210c9091228d37f528d71046e1a43e81b13
size 278549

View File

@@ -1,3 +1,59 @@
-------------------------------------------------------------------
Sat Apr 17 18:25:51 UTC 2021 - Ben Greiner <code@bnavigator.de>
- Update to 6.0.1
* The remove_terminal_escapes step was accidentally not being
used. This version restores it.
* Specified in setup.py that ftfy 6 requires Python 3.6 or later.
* Use a lighter link color when the docs are viewed in dark mode.
- Version 6.0
* New function: ftfy.fix_and_explain() can describe all the
transformations that happen when fixing a string. This is
similar to what ftfy.fixes.fix_encoding_and_explain() did in
previous versions, but it can fix more than the encoding.
* fix_and_explain() and fix_encoding_and_explain() are now in
the top-level ftfy module.
* Changed the heuristic entirely. ftfy no longer needs to
categorize every Unicode character, but only characters that
are expected to appear in mojibake.
* Because of the new heuristic, ftfy will no longer have to
release a new version for every new version of Unicode. It
should also run faster and use less RAM when imported.
* The heuristic ftfy.badness.is_bad(text) can be used to
determine whether there appears to be mojibake in a string.
Some users were already using the old function
sequence_weirdness() for that, but this one is actually
designed for that purpose.
* Instead of a pile of named keyword arguments, ftfy functions
now take in a TextFixerConfig object. The keyword arguments
still work, and become settings that override the defaults in
TextFixerConfig.
* Added support for UTF-8 mixups with Windows-1253 and
Windows-1254.
* Overhauled the documentation: https://ftfy.readthedocs.org
- Version 5.9
* This version is brought to you by the letter à and the number
0xC3.
* Tweaked the heuristic to decode, for example, "Ã " as the
letter "à" more often.
* This combines with the non-breaking-space fixer to decode "Ã "
as "à" as well. However, in many cases, the text " Ã " was
intended to be " à ", preserving the space -- the underlying
mojibake had two spaces after it, but the Web coalesced them
into one. We detect this case based on common French and
Portuguese words, and preserve the space when it appears
intended.
* Thanks to @zehavoc for bringing to my attention how common
this case is.
* Updated the data file of Unicode character categories to
Unicode 13, as used in Python 3.9. (No matter what version of
Python you're on, ftfy uses the same data.)
- Version 5.8
* Improved detection of UTF-8 mojibake of Greek, Cyrillic,
Hebrew, and Arabic scripts.
* Fixed the undeclared dependency on setuptools by removing the
use of pkg_resources.
-------------------------------------------------------------------
Tue May 26 06:34:21 UTC 2020 - Petr Gajdos <pgajdos@suse.com>
@@ -13,13 +69,13 @@ Tue Mar 10 08:48:18 UTC 2020 - Tomáš Chvátal <tchvatal@suse.com>
Mon Aug 12 12:31:18 UTC 2019 - Marketa Calabkova <mcalabkova@suse.com>
- Update to version 5.6
* The unescape_html function now supports all the HTML5 entities
that appear in html.entities.html5, including those with long
* The unescape_html function now supports all the HTML5 entities
that appear in html.entities.html5, including those with long
names such as &DiacriticalDoubleAcute;.
* Unescaping of numeric HTML entities now uses the standard library's
* Unescaping of numeric HTML entities now uses the standard library's
html.unescape, making edge cases consistent.
* On top of Python's support for HTML5 entities, ftfy will also
convert HTML escapes of common Latin capital letters that are
* On top of Python's support for HTML5 entities, ftfy will also
convert HTML escapes of common Latin capital letters that are
(nonstandardly) written in all caps, such as &NTILDE; for Ñ.
-------------------------------------------------------------------

View File

@@ -1,7 +1,7 @@
#
# spec file for package python-ftfy
#
# Copyright (c) 2020 SUSE LLC
# Copyright (c) 2021 SUSE LLC
#
# All modifications and additions to the file contributed by third parties
# remain the property of their copyright owners, unless otherwise agreed
@@ -19,18 +19,18 @@
%{?!python_module:%define python_module() python-%{**} python3-%{**}}
%define skip_python2 1
Name: python-ftfy
Version: 5.7
Version: 6.0.1
Release: 0
Summary: Python module for repairing mis-decoded Unicode text
License: MIT
URL: https://github.com/LuminosoInsight/python-ftfy
Source: https://github.com/LuminosoInsight/python-ftfy/archive/v%{version}.tar.gz
Source: https://github.com/LuminosoInsight/python-ftfy/archive/v%{version}.tar.gz#/ftfy-%{version}.tar.gz
BuildRequires: %{python_module setuptools}
BuildRequires: fdupes
BuildRequires: python-rpm-macros
Requires: python-wcwidth
Requires(post): update-alternatives
Requires(postun): update-alternatives
Requires(postun):update-alternatives
BuildArch: noarch
# SECTION test requirements
BuildRequires: %{python_module pytest}
@@ -54,9 +54,12 @@ put through an encode/decode cycle with different encodings.
%python_expand %fdupes %{buildroot}%{$python_sitelib}
%check
export PATH="$PATH:%{buildroot}%{_bindir}"
# test_cli: ftfy binary not found (update-alternatives)
%pytest -k 'not test_cli'
%{python_expand # provide u-a controlled cli command for tests
mkdir -p build/testbin
ln -s %{buildroot}%{_bindir}/ftfy-%{python_bin_suffix} build/testbin/ftfy
}
export PATH="build/testbin:$PATH"
%pytest
%post
%python_install_alternative ftfy

View File

@@ -1,3 +0,0 @@
version https://git-lfs.github.com/spec/v1
oid sha256:84f98cde0a4cc5223c00ea18bc49b1fa5686a6e9393c4c8df6f7e2f7620c0fcf
size 69683