Sync from SUSE:SLFO:Main python-charset-normalizer revision aa6f07f269e46028b0f120cf45a8f554

This commit is contained in:
Adrian Schröter 2024-12-13 11:16:33 +01:00
parent 695c58848f
commit 6582fa9b6d
4 changed files with 70 additions and 11 deletions

BIN
charset_normalizer-3.1.0.tar.gz (Stored with Git LFS)

Binary file not shown.

BIN
charset_normalizer-3.4.0.tar.gz (Stored with Git LFS) Normal file

Binary file not shown.

View File

@ -1,3 +1,62 @@
-------------------------------------------------------------------
Tue Oct 22 16:00:12 UTC 2024 - Dirk Müller <dmueller@suse.com>
- update to 3.4.0:
* Argument `--no-preemptive` in the CLI to prevent the detector
to search for hints.
* Support for Python 3.13
* Relax the TypeError exception thrown when trying to compare a
CharsetMatch with anything else than a CharsetMatch.
* Improved the general reliability of the detector based on
user feedbacks. (#520) (#509) (#498) (#407)
* Declared charset in content (preemptive detection) not
changed when converting to utf-8 bytes.
-------------------------------------------------------------------
Sat Nov 25 14:12:18 UTC 2023 - Dirk Müller <dmueller@suse.com>
- update to 3.3.2:
* Unintentional memory usage regression when using large
payload that match several encoding (#376)
* Regression on some detection case showcased in the
documentation (#371)
* Noise (md) probe that identify malformed arabic
representation due to the presence of letters in isolated
form
* Optional mypyc compilation upgraded to version 1.6.1 for
Python >= 3.8
* Improved the general detection reliability based on reports
from the community
-------------------------------------------------------------------
Mon Oct 2 09:07:47 UTC 2023 - Dirk Müller <dmueller@suse.com>
- update to 3.3.0:
* Allow to execute the CLI (e.g. normalizer) through `python -m
charset_normalizer.cli` or `python -m charset_normalizer`
* Support for 9 forgotten encoding that are supported by Python
but unlisted in `encoding.aliases` as they have no alias
* Optional mypyc compilation upgraded to version 1.5.1 for
Python >= 3.7
* Unable to properly sort CharsetMatch when both chaos/noise
and coherence were close due to an unreachable condition in
\_\_lt\_\_ (#350)
-------------------------------------------------------------------
Tue Jul 11 13:22:52 UTC 2023 - Dirk Müller <dmueller@suse.com>
- update to 3.2.0:
* Typehint for function `from_path` no longer enforce
`PathLike` as its first argument
* Minor improvement over the global detection reliability
* Introduce function `is_binary` that relies on main
capabilities, and optimized to detect binaries
* Propagate `enable_fallback` argument throughout `from_bytes`,
`from_path`, and `from_fp` that allow a deeper control over
the detection (default True)
* Edge case detection failure where a file would contain 'very-
long' camel cased word (Issue #289)
-------------------------------------------------------------------
Fri Apr 21 12:31:23 UTC 2023 - Dirk Müller <dmueller@suse.com>
@ -15,7 +74,7 @@ Sun Mar 26 20:04:17 UTC 2023 - Dirk Müller <dmueller@suse.com>
-------------------------------------------------------------------
Sat Dec 3 04:13:46 UTC 2022 - Yogalakshmi Arunachalam <yarunachalam@suse.com>
- Update to 3.0.1
- Update to 3.0.1
Fixed
Multi-bytes cutter/chunk generator did not always cut correctly (PR #233)
Changed
@ -24,7 +83,7 @@ Sat Dec 3 04:13:46 UTC 2022 - Yogalakshmi Arunachalam <yarunachalam@suse.com>
-------------------------------------------------------------------
Thu Oct 27 22:18:02 UTC 2022 - Yogalakshmi Arunachalam <yarunachalam@suse.com>
- Update to 3.0.0
- Update to 3.0.0
Added
* Extend the capability of explain=True when cp_isolation contains at most two entries (min one), will log in details of the Mess-detector results
Support for alternative language frequency set in charset_normalizer.assets.FREQUENCIES
@ -78,7 +137,7 @@ Tue Jul 19 11:38:48 UTC 2022 - Dirk Müller <dmueller@suse.com>
Tue Feb 15 08:42:30 UTC 2022 - Dirk Müller <dmueller@suse.com>
- update to 2.0.12:
* ASCII miss-detection on rare cases (PR #170)
* ASCII miss-detection on rare cases (PR #170)
* Explicit support for Python 3.11 (PR #164)
* The logging behavior have been completely reviewed, now using only TRACE
and DEBUG levels
@ -97,7 +156,7 @@ Mon Dec 6 20:08:41 UTC 2021 - Dirk Müller <dmueller@suse.com>
- update to 2.0.9:
* Moderating the logging impact (since 2.0.8) for specific
environments
* Wrong logging level applied when setting kwarg `explain` to True
* Wrong logging level applied when setting kwarg `explain` to True
-------------------------------------------------------------------
Mon Nov 29 11:14:37 UTC 2021 - Dirk Müller <dmueller@suse.com>
@ -120,7 +179,7 @@ Mon Nov 29 11:14:37 UTC 2021 - Dirk Müller <dmueller@suse.com>
-------------------------------------------------------------------
Fri Nov 26 11:35:25 UTC 2021 - Dirk Müller <dmueller@suse.com>
- require lower-case name instead of breaking build
- require lower-case name instead of breaking build
-------------------------------------------------------------------
Thu Nov 25 22:26:52 UTC 2021 - Matej Cepl <mcepl@suse.com>

View File

@ -1,7 +1,7 @@
#
# spec file for package python-charset-normalizer
#
# Copyright (c) 2023 SUSE LLC
# Copyright (c) 2024 SUSE LLC
#
# All modifications and additions to the file contributed by third parties
# remain the property of their copyright owners, unless otherwise agreed
@ -18,7 +18,7 @@
%{?sle15_python_module_pythons}
Name: python-charset-normalizer
Version: 3.1.0
Version: 3.4.0
Release: 0
Summary: Python Universal Charset detector
License: MIT
@ -29,7 +29,7 @@ BuildRequires: %{python_module setuptools}
BuildRequires: fdupes
BuildRequires: python-rpm-macros
Requires(post): update-alternatives
Requires(postun):update-alternatives
Requires(postun): update-alternatives
Suggests: python-unicodedata2
BuildArch: noarch
# SECTION test requirements