python-charset-normalizer

SHA256

Author	SHA256	Message	Date
dimstar_suse	7e8d58bccf	Accepting request 1344812 from devel:languages:python - update to 3.4.7: * Pre-built optimized version using mypy[c] v1.20. * Relax `setuptools` constraint to `setuptools>=68,<82.1`. * Correctly remove SIG remnant in utf-7 decoded string. (#718) OBS-URL: https://build.opensuse.org/request/show/1344812 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=31	2026-04-08 15:13:29 +00:00
dirkmueller	b55fcd72a9	- update to 3.4.7: * Pre-built optimized version using mypy[c] v1.20. * Relax `setuptools` constraint to `setuptools>=68,<82.1`. * Correctly remove SIG remnant in utf-7 decoded string. (#718) OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=61	2026-04-06 21:44:31 +00:00
anag_factory	e21b5a19ec	Accepting request 1343405 from devel:languages:python - update to 3.4.6: * Flattened the logic in `charset_normalizer.md` for higher performance. Removed `eligible(..)` and `feed(...)` in favor of `feed_info(...)`. * Updated `UNICODE_RANGES_COMBINED` using Unicode blocks v17. * Edge case where noise difference between two candidates can be almost insignificant. * CLI `--normalize` writing to wrong path when passing multiple files in. * Freethreaded pre-built wheels now shipped in PyPI starting with 3.14t. * Update `setuptools` constraint to `setuptools>=68,<=82`. * Raised upper bound of mypyc for the optional pre-built extension to v1.19.1 * Add explicit link to lib math in our optimized build. * Logger level not restored correctly for empty byte sequences. * TypeError when passing bytearray to from_bytes. * Applied safe micro-optimizations in both our noise detector and language detector. * Rewrote the `query_yes_no` function (inside CLI) to avoid using ambiguous licensed code. * Added `cd.py` submodule into mypyc optional compilation to reduce further the performance impact. OBS-URL: https://build.opensuse.org/request/show/1343405 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=30	2026-03-30 16:29:45 +00:00
dirkmueller	f7d33c691b	- update to 3.4.6: * Flattened the logic in `charset_normalizer.md` for higher performance. Removed `eligible(..)` and `feed(...)` in favor of `feed_info(...)`. * Updated `UNICODE_RANGES_COMBINED` using Unicode blocks v17. * Edge case where noise difference between two candidates can be almost insignificant. * CLI `--normalize` writing to wrong path when passing multiple files in. * Freethreaded pre-built wheels now shipped in PyPI starting with 3.14t. * Update `setuptools` constraint to `setuptools>=68,<=82`. * Raised upper bound of mypyc for the optional pre-built extension to v1.19.1 * Add explicit link to lib math in our optimized build. * Logger level not restored correctly for empty byte sequences. * TypeError when passing bytearray to from_bytes. * Applied safe micro-optimizations in both our noise detector and language detector. * Rewrote the `query_yes_no` function (inside CLI) to avoid using ambiguous licensed code. * Added `cd.py` submodule into mypyc optional compilation to reduce further the performance impact. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=59	2026-03-29 10:16:17 +00:00
anag_factory	6b4533885b	Accepting request 1315251 from devel:languages:python OBS-URL: https://build.opensuse.org/request/show/1315251 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=29	2025-11-05 15:17:48 +00:00
dgarcia	ce5ef5f0c0	Accepting request 1314564 from home:glaubitz:branches:devel:languages:python - Update to 3.4.4 * Bound `setuptools` to a specific constraint `setuptools>=68,<=81`. * Raised upper bound of mypyc for the optional pre-built extension to v1.18.2 * `setuptools-scm` as a build dependency. * Enforced hashes in `dev-requirements.txt` and created `ci-requirements.txt` for security purposes. * Additional pre-built wheels for riscv64, s390x, and armv7l architectures. * Restore ` multiple.intoto.jsonl` in GitHub releases in addition to individual attestation file per wheel. OBS-URL: https://build.opensuse.org/request/show/1314564 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=57	2025-11-03 09:19:03 +00:00
anag_factory	7e2ef73dfc	Accepting request 1304688 from devel:languages:python - update to 3.4.3: * mypy(c) is no longer a required dependency at build time if `CHARSET_NORMALIZER_USE_MYPYC` isn't set to `1`. (#595) * automatically lower confidence on small bytes samples that are not Unicode in `detect` output legacy function. * Custom build backend to overcome inability to mark mypy as an optional dependency in the build phase. * Support for Python 3.14 * sdist archive contained useless directories. * automatically fallback on valid UTF-16 or UTF-32 even if the md says it's noisy. * SBOM are automatically published to the relevant GitHub release to comply with regulatory changes. * Each published wheel comes with its SBOM. We choose CycloneDX as the format. * Prebuilt optimized wheel are no longer distributed by default for CPython 3.7 due to a change in cibuildwheel. OBS-URL: https://build.opensuse.org/request/show/1304688 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=28	2025-09-15 17:50:31 +00:00
dirkmueller	91d56d242a	- update to 3.4.3: * mypy(c) is no longer a required dependency at build time if `CHARSET_NORMALIZER_USE_MYPYC` isn't set to `1`. (#595) * automatically lower confidence on small bytes samples that are not Unicode in `detect` output legacy function. * Custom build backend to overcome inability to mark mypy as an optional dependency in the build phase. * Support for Python 3.14 * sdist archive contained useless directories. * automatically fallback on valid UTF-16 or UTF-32 even if the md says it's noisy. * SBOM are automatically published to the relevant GitHub release to comply with regulatory changes. * Each published wheel comes with its SBOM. We choose CycloneDX as the format. * Prebuilt optimized wheel are no longer distributed by default for CPython 3.7 due to a change in cibuildwheel. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=55	2025-09-14 21:00:37 +00:00
anag_factory	d9b2ef1a60	Accepting request 1274387 from devel:languages:python OBS-URL: https://build.opensuse.org/request/show/1274387 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=27	2025-05-06 14:38:59 +00:00
mcalabkova	940d389e13	Accepting request 1274349 from home:glaubitz:branches:devel:languages:python - Update to 3.4.2 * Addressed the DeprecationWarning in our CLI regarding `argparse.FileType` by backporting the target class into the package. (#591) * Improved the overall reliability of the detector with CJK Ideographs. (#605) (#587) * Optional mypyc compilation upgraded to version 1.15 for Python >= 3.8 OBS-URL: https://build.opensuse.org/request/show/1274349 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=53	2025-05-05 09:15:39 +00:00
anag_factory	2cb57d39c4	Accepting request 1238022 from devel:languages:python - Use libalternatives instead of update-alternatives, bsc#1235781 OBS-URL: https://build.opensuse.org/request/show/1238022 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=26	2025-01-16 17:31:17 +00:00
dgarcia	ceff5b28c1	- Use libalternatives instead of update-alternatives, bsc#1235781 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=51	2025-01-15 10:38:37 +00:00
anag_factory	a683fc34d0	Accepting request 1236172 from devel:languages:python OBS-URL: https://build.opensuse.org/request/show/1236172 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=25	2025-01-12 10:10:17 +00:00
mcalabkova	561398d356	Accepting request 1236124 from home:glaubitz:branches:devel:languages:python - Update to 3.4.1 * Project metadata are now stored using `pyproject.toml` instead of `setup.cfg` using setuptools as the build backend. * Enforce annotation delayed loading for a simpler and consistent types in the project. * Optional mypyc compilation upgraded to version 1.14 for Python >= 3.8 * Added pre-commit configuration. * Added noxfile. * Removed `build-requirements.txt` as per using `pyproject.toml` native build configuration. * Removed `bin/integration.py` and `bin/serve.py` in favor of downstream integration test (see noxfile). * Removed `setup.cfg` in favor of `pyproject.toml` metadata configuration. * Removed unused `utils.range_scan` function. * Converting content to Unicode bytes may insert `utf_8` instead of preferred `utf-8`. (#572) * Deprecation warning "'count' is passed as positional argument" when converting to Unicode bytes on Python 3.13+ - Drop sed command to remove code coverage flags from pytest OBS-URL: https://build.opensuse.org/request/show/1236124 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=49	2025-01-09 12:21:13 +00:00
anag_factory	0ee7e60e59	Accepting request 1221058 from devel:languages:python Automatic submission by obs-autosubmit OBS-URL: https://build.opensuse.org/request/show/1221058 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=24	2024-11-05 14:39:43 +00:00
dirkmueller	ce57742542	- switch to PEP517 build OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=47	2024-10-28 16:38:02 +00:00
anag_factory	2535bce82a	Accepting request 1217078 from devel:languages:python - update to 3.4.0: * Argument `--no-preemptive` in the CLI to prevent the detector to search for hints. * Support for Python 3.13 * Relax the TypeError exception thrown when trying to compare a CharsetMatch with anything else than a CharsetMatch. * Improved the general reliability of the detector based on user feedbacks. (#520) (#509) (#498) (#407) * Declared charset in content (preemptive detection) not changed when converting to utf-8 bytes. OBS-URL: https://build.opensuse.org/request/show/1217078 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=23	2024-10-23 19:08:21 +00:00
dirkmueller	98a4dbb463	- update to 3.4.0: * Argument `--no-preemptive` in the CLI to prevent the detector to search for hints. * Support for Python 3.13 * Relax the TypeError exception thrown when trying to compare a CharsetMatch with anything else than a CharsetMatch. * Improved the general reliability of the detector based on user feedbacks. (#520) (#509) (#498) (#407) * Declared charset in content (preemptive detection) not changed when converting to utf-8 bytes. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=45	2024-10-22 16:00:24 +00:00
anag_factory	39f28f6f2d	Accepting request 1128743 from devel:languages:python - update to 3.3.2: * Unintentional memory usage regression when using large payload that match several encoding (#376) * Regression on some detection case showcased in the documentation (#371) * Noise (md) probe that identify malformed arabic representation due to the presence of letters in isolated form * Optional mypyc compilation upgraded to version 1.6.1 for Python >= 3.8 * Improved the general detection reliability based on reports from the community OBS-URL: https://build.opensuse.org/request/show/1128743 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=22	2023-11-27 21:42:20 +00:00
dirkmueller	4d62239b35	- update to 3.3.2: * Unintentional memory usage regression when using large payload that match several encoding (#376) * Regression on some detection case showcased in the documentation (#371) * Noise (md) probe that identify malformed arabic representation due to the presence of letters in isolated form * Optional mypyc compilation upgraded to version 1.6.1 for Python >= 3.8 * Improved the general detection reliability based on reports from the community OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=43	2023-11-25 14:12:46 +00:00
anag_factory	2c41d8af7f	Accepting request 1114778 from devel:languages:python - update to 3.3.0: * Allow to execute the CLI (e.g. normalizer) through `python -m charset_normalizer.cli` or `python -m charset_normalizer` * Support for 9 forgotten encoding that are supported by Python but unlisted in `encoding.aliases` as they have no alias * Optional mypyc compilation upgraded to version 1.5.1 for Python >= 3.7 * Unable to properly sort CharsetMatch when both chaos/noise and coherence were close due to an unreachable condition in \_\_lt\_\_ (#350) - Update to 3.0.1 - Update to 3.0.0 * ASCII miss-detection on rare cases (PR #170) * Wrong logging level applied when setting kwarg `explain` to True - require lower-case name instead of breaking build OBS-URL: https://build.opensuse.org/request/show/1114778 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=21	2023-11-23 20:38:43 +00:00
dirkmueller	b5ba82d155	- update to 3.3.0: * Allow to execute the CLI (e.g. normalizer) through `python -m charset_normalizer.cli` or `python -m charset_normalizer` * Support for 9 forgotten encoding that are supported by Python but unlisted in `encoding.aliases` as they have no alias * Optional mypyc compilation upgraded to version 1.5.1 for Python >= 3.7 * Unable to properly sort CharsetMatch when both chaos/noise and coherence were close due to an unreachable condition in \_\_lt\_\_ (#350) - Update to 3.0.1 - Update to 3.0.0 * ASCII miss-detection on rare cases (PR #170) * Wrong logging level applied when setting kwarg `explain` to True - require lower-case name instead of breaking build OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=41	2023-10-02 09:08:45 +00:00
anag_factory	69d02200cb	Accepting request 1098807 from devel:languages:python - update to 3.2.0: * Typehint for function `from_path` no longer enforce `PathLike` as its first argument * Minor improvement over the global detection reliability * Introduce function `is_binary` that relies on main capabilities, and optimized to detect binaries * Propagate `enable_fallback` argument throughout `from_bytes`, `from_path`, and `from_fp` that allow a deeper control over the detection (default True) * Edge case detection failure where a file would contain 'very- long' camel cased word (Issue #289) OBS-URL: https://build.opensuse.org/request/show/1098807 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=20	2023-07-17 17:22:47 +00:00
dirkmueller	2419fdf02e	- update to 3.2.0: * Typehint for function `from_path` no longer enforce `PathLike` as its first argument * Minor improvement over the global detection reliability * Introduce function `is_binary` that relies on main capabilities, and optimized to detect binaries * Propagate `enable_fallback` argument throughout `from_bytes`, `from_path`, and `from_fp` that allow a deeper control over the detection (default True) * Edge case detection failure where a file would contain 'very- long' camel cased word (Issue #289) OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=39	2023-07-11 13:24:00 +00:00
dimstar_suse	f910db3030	Accepting request 1084939 from devel:languages:python - add sle15_python_module_pythons (jsc#PED-68) OBS-URL: https://build.opensuse.org/request/show/1084939 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=19	2023-05-09 11:06:32 +00:00
dirkmueller	fbcae794f8	- add sle15_python_module_pythons (jsc#PED-68) OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=37	2023-05-05 06:41:30 +00:00
dimstar_suse	5fd26c1c73	Accepting request 1074517 from devel:languages:python - update to 3.1.0: * Argument `should_rename_legacy` for legacy function `detect` and disregard any new arguments without errors (PR #262) * Removed Support for Python 3.6 (PR #260) * Optional speedup provided by mypy/c 1.0.1 OBS-URL: https://build.opensuse.org/request/show/1074517 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=18	2023-03-29 21:26:15 +00:00
dirkmueller	41b8cd85f8	- update to 3.1.0: * Argument `should_rename_legacy` for legacy function `detect` and disregard any new arguments without errors (PR #262) * Removed Support for Python 3.6 (PR #260) * Optional speedup provided by mypy/c 1.0.1 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=35	2023-03-26 20:04:47 +00:00
dimstar_suse	2f54e56b98	Accepting request 1039740 from devel:languages:python OBS-URL: https://build.opensuse.org/request/show/1039740 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=17	2022-12-04 13:57:44 +00:00
dirkmueller	9815d6c60a	Accepting request 1039709 from home:yarunachalam:branches:devel:languages:python - Update to 3.0.1 Fixed Multi-bytes cutter/chunk generator did not always cut correctly (PR #233) Changed Speedup provided by mypy/c 0.990 on Python >= 3.7 OBS-URL: https://build.opensuse.org/request/show/1039709 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=33	2022-12-03 07:29:01 +00:00
dimstar_suse	f4493151fe	Accepting request 1032182 from devel:languages:python Forwarded request #1031656 from yarunachalam - Update to 3.0.0 Added * Extend the capability of explain=True when cp_isolation contains at most two entries (min one), will log in details of the Mess-detector results Support for alternative language frequency set in charset_normalizer.assets.FREQUENCIES Add parameter language_threshold in from_bytes, from_path and from_fp to adjust the minimum expected coherence ratio normalizer --version now specify if current version provide extra speedup (meaning mypyc compilation whl) * Changed Build with static metadata using 'build' frontend Make the language detection stricter Optional: Module md.py can be compiled using Mypyc to provide an extra speedup up to 4x faster than v2.1 * Fixed CLI with opt --normalize fail when using full path for files TooManyAccentuatedPlugin induce false positive on the mess detection when too few alpha character have been fed to it Sphinx warnings when generating the documentation * Removed Coherence detector no longer return 'Simple English' instead return 'English' Coherence detector no longer return 'Classical Chinese' instead return 'Chinese' Breaking: Method first() and best() from CharsetMatch UTF-7 will no longer appear as "detected" without a recognized SIG/mark (is unreliable/conflict with ASCII) Breaking: Class aliases CharsetDetector, CharsetDoctor, CharsetNormalizerMatch and CharsetNormalizerMatches Breaking: Top-level function normalize Breaking: Properties chaos_secondary_pass, coherence_non_latin and w_counter from CharsetMatch Support for the backport unicodedata2 OBS-URL: https://build.opensuse.org/request/show/1032182 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=16	2022-11-04 16:31:30 +00:00
mcepl	bc69402598	Accepting request 1031656 from home:yarunachalam:branches:devel:languages:python - Update to 3.0.0 Added * Extend the capability of explain=True when cp_isolation contains at most two entries (min one), will log in details of the Mess-detector results Support for alternative language frequency set in charset_normalizer.assets.FREQUENCIES Add parameter language_threshold in from_bytes, from_path and from_fp to adjust the minimum expected coherence ratio normalizer --version now specify if current version provide extra speedup (meaning mypyc compilation whl) * Changed Build with static metadata using 'build' frontend Make the language detection stricter Optional: Module md.py can be compiled using Mypyc to provide an extra speedup up to 4x faster than v2.1 * Fixed CLI with opt --normalize fail when using full path for files TooManyAccentuatedPlugin induce false positive on the mess detection when too few alpha character have been fed to it Sphinx warnings when generating the documentation * Removed Coherence detector no longer return 'Simple English' instead return 'English' Coherence detector no longer return 'Classical Chinese' instead return 'Chinese' Breaking: Method first() and best() from CharsetMatch UTF-7 will no longer appear as "detected" without a recognized SIG/mark (is unreliable/conflict with ASCII) Breaking: Class aliases CharsetDetector, CharsetDoctor, CharsetNormalizerMatch and CharsetNormalizerMatches Breaking: Top-level function normalize Breaking: Properties chaos_secondary_pass, coherence_non_latin and w_counter from CharsetMatch Support for the backport unicodedata2 OBS-URL: https://build.opensuse.org/request/show/1031656 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=31	2022-10-29 11:47:59 +00:00
dimstar_suse	17e5c833b9	Accepting request 1004361 from devel:languages:python - update to 2.1.1: * Function `normalize` scheduled for removal in 3.0 * Removed useless call to decode in fn is_unprintable (#206) OBS-URL: https://build.opensuse.org/request/show/1004361 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=15	2022-09-18 15:31:58 +00:00
dirkmueller	9a8edf9b19	- update to 2.1.1: * Function `normalize` scheduled for removal in 3.0 * Removed useless call to decode in fn is_unprintable (#206) OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=29	2022-09-17 15:50:18 +00:00
dimstar_suse	aa7d15bff1	Accepting request 998090 from devel:languages:python OBS-URL: https://build.opensuse.org/request/show/998090 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=14	2022-08-20 18:27:45 +00:00
dirkmueller	54ae0383fc	Accepting request 998013 from home:bnavigator:branches:devel:languages:python - Clean requirements: We don't need anything OBS-URL: https://build.opensuse.org/request/show/998013 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=27	2022-08-19 06:47:38 +00:00
RBrownFactory	2ea2fee032	Accepting request 991152 from devel:languages:python - update to 2.1.0: * Output the Unicode table version when running the CLI with `--version` * Re-use decoded buffer for single byte character sets * Fixing some performance bottlenecks * Workaround potential bug in cpython with Zero Width No-Break Space located * in Arabic Presentation Forms-B, Unicode 1.1 not acknowledged as space * CLI default threshold aligned with the API threshold from * Support for Python 3.5 (PR #192) * Use of backport unicodedata from `unicodedata2` as Python is quickly catching up, scheduled for removal in 3.0 OBS-URL: https://build.opensuse.org/request/show/991152 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=13	2022-07-26 17:42:09 +00:00
dirkmueller	2e435dd04d	- update to 2.1.0: * Output the Unicode table version when running the CLI with `--version` * Re-use decoded buffer for single byte character sets * Fixing some performance bottlenecks * Workaround potential bug in cpython with Zero Width No-Break Space located * in Arabic Presentation Forms-B, Unicode 1.1 not acknowledged as space * CLI default threshold aligned with the API threshold from * Support for Python 3.5 (PR #192) * Use of backport unicodedata from `unicodedata2` as Python is quickly catching up, scheduled for removal in 3.0 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=25	2022-07-19 11:40:33 +00:00
dimstar_suse	de8764bf5c	Accepting request 954654 from devel:languages:python - update to 2.0.12: * ASCII miss-detection on rare cases (PR #170) * Explicit support for Python 3.11 (PR #164) * The logging behavior have been completely reviewed, now using only TRACE and DEBUG levels OBS-URL: https://build.opensuse.org/request/show/954654 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=12	2022-02-16 23:29:57 +00:00
dirkmueller	18b77e47fa	- update to 2.0.12: * ASCII miss-detection on rare cases (PR #170) * Explicit support for Python 3.11 (PR #164) * The logging behavior have been completely reviewed, now using only TRACE and DEBUG levels OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=24	2022-02-15 08:43:43 +00:00
dimstar_suse	597427c321	Accepting request 945443 from devel:languages:python - update to 2.0.10: * Fallback match entries might lead to UnicodeDecodeError for large bytes sequence * Skipping the language-detection (CD) on ASCII OBS-URL: https://build.opensuse.org/request/show/945443 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=11	2022-01-11 20:17:16 +00:00
dirkmueller	43636cbea9	- update to 2.0.10: * Fallback match entries might lead to UnicodeDecodeError for large bytes sequence * Skipping the language-detection (CD) on ASCII OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=23	2022-01-10 23:04:22 +00:00
dimstar_suse	7d3f6e53ab	Accepting request 936118 from devel:languages:python - update to 2.0.9: * Moderating the logging impact (since 2.0.8) for specific environments * Wrong logging level applied when setting kwarg `explain` to True OBS-URL: https://build.opensuse.org/request/show/936118 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=10	2021-12-09 18:45:18 +00:00
dirkmueller	4e2a28ffe3	- update to 2.0.9: * Moderating the logging impact (since 2.0.8) for specific environments * Wrong logging level applied when setting kwarg `explain` to True OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=22	2021-12-06 20:09:48 +00:00
dimstar_suse	b2ca28eae6	Accepting request 934519 from devel:languages:python - update to 2.0.8: * Improvement over Vietnamese detection * MD improvement on trailing data and long foreign (non-pure latin) * Efficiency improvements in cd/alphabet_languages * call sum() without an intermediary list following PEP 289 recommendations * Code style as refactored by Sourcery-AI * Minor adjustment on the MD around european words * Remove and replace SRTs from assets / tests * Initialize the library logger with a `NullHandler` by default * Setting kwarg `explain` to True will add provisionally * Fix large (misleading) sequence giving UnicodeDecodeError * Avoid using too insignificant chunk * Add and expose function `set_logging_handler` to configure a specific StreamHandler - require lower-case name instead of breaking build - Use lower-case name of prettytable package OBS-URL: https://build.opensuse.org/request/show/934519 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=9	2021-12-01 19:46:49 +00:00
dirkmueller	905e9c7130	- update to 2.0.8: * Improvement over Vietnamese detection * MD improvement on trailing data and long foreign (non-pure latin) * Efficiency improvements in cd/alphabet_languages * call sum() without an intermediary list following PEP 289 recommendations * Code style as refactored by Sourcery-AI * Minor adjustment on the MD around european words * Remove and replace SRTs from assets / tests * Initialize the library logger with a `NullHandler` by default * Setting kwarg `explain` to True will add provisionally * Fix large (misleading) sequence giving UnicodeDecodeError * Avoid using too insignificant chunk * Add and expose function `set_logging_handler` to configure a specific StreamHandler OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=21	2021-11-29 11:18:31 +00:00
dirkmueller	8ecb13326b	- require lower-case name instead of breaking build OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=20	2021-11-26 11:35:38 +00:00
mcepl	e9dbb76923	- Use lower-case name of prettytable package OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=19	2021-11-25 22:27:00 +00:00
dimstar_suse	08e2786e7f	Accepting request 927599 from devel:languages:python - Update to version 2.0.7 * Addition: bento Add support for Kazakh (Cyrillic) language detection * Improvement: sparkle Further improve inferring the language from a given code page (single-byte). * Removed: fire Remove redundant logging entry about detected language(s). * Improvement: zap Refactoring for potential performance improvements in loops. * Improvement: sparkles Various detection improvement (MD+CD). * Bugfix: bug Fix a minor inconsistency between Python 3.5 and other versions regarding language detection. - Update to version 2.0.6 * Bugfix: bug Unforeseen regression with the loss of the backward-compatibility with some older minor of Python 3.5.x. * Bugfix: bug Fix CLI crash when using --minimal output in certain cases. * Improvement: sparkles Minor improvement to the detection efficiency (less than 1%). - Update to version 2.0.5 * Improvement: sparkles The BC-support with v1.x was improved, the old staticmethods are restored. * Remove: fire The project no longer raise warning on tiny content given for detection, will be simply logged as warning instead. * Improvement: sparkles The Unicode detection is slightly improved, see #93 * Bugfix: bug In some rare case, the chunks extractor could cut in the middle of a multi-byte character and could mislead the mess detection. OBS-URL: https://build.opensuse.org/request/show/927599 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-charset-normalizer?expand=0&rev=8	2021-10-27 20:21:05 +00:00
dirkmueller	6e97c41098	Accepting request 925848 from home:mnhauke - Update to version 2.0.7 * Addition: bento Add support for Kazakh (Cyrillic) language detection * Improvement: sparkle Further improve inferring the language from a given code page (single-byte). * Removed: fire Remove redundant logging entry about detected language(s). * Improvement: zap Refactoring for potential performance improvements in loops. * Improvement: sparkles Various detection improvement (MD+CD). * Bugfix: bug Fix a minor inconsistency between Python 3.5 and other versions regarding language detection. - Update to version 2.0.6 * Bugfix: bug Unforeseen regression with the loss of the backward-compatibility with some older minor of Python 3.5.x. * Bugfix: bug Fix CLI crash when using --minimal output in certain cases. * Improvement: sparkles Minor improvement to the detection efficiency (less than 1%). - Update to version 2.0.5 * Improvement: sparkles The BC-support with v1.x was improved, the old staticmethods are restored. * Remove: fire The project no longer raise warning on tiny content given for detection, will be simply logged as warning instead. * Improvement: sparkles The Unicode detection is slightly improved, see #93 * Bugfix: bug In some rare case, the chunks extractor could cut in the middle of a multi-byte character and could mislead the mess detection. OBS-URL: https://build.opensuse.org/request/show/925848 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=18	2021-10-26 20:41:42 +00:00

1 2

67 Commits