python-charset-normalizer

pool/python-charset-normalizer

SHA256

Fork 1

223430a4ca Accepting request 1128743 from devel:languages:python factory Ana Guerrero 2023-11-27 21:42:20 +0000
9cd0e22679 - update to 3.3.2: * Unintentional memory usage regression when using large payload that match several encoding (#376) * Regression on some detection case showcased in the documentation (#371) * Noise (md) probe that identify malformed arabic representation due to the presence of letters in isolated form * Optional mypyc compilation upgraded to version 1.6.1 for Python >= 3.8 * Improved the general detection reliability based on reports from the community Dirk Mueller 2023-11-25 14:12:46 +0000
5a4b0b3e0c Accepting request 1114778 from devel:languages:python Ana Guerrero 2023-11-23 20:38:43 +0000
3e7e8a34ba - update to 3.3.0: * Allow to execute the CLI (e.g. normalizer) through python -m charset_normalizer.cli or python -m charset_normalizer * Support for 9 forgotten encoding that are supported by Python but unlisted in encoding.aliases as they have no alias * Optional mypyc compilation upgraded to version 1.5.1 for Python >= 3.7 * Unable to properly sort CharsetMatch when both chaos/noise and coherence were close due to an unreachable condition in \_\_lt\_\_ (#350) - Update to 3.0.1 - Update to 3.0.0 * ASCII miss-detection on rare cases (PR #170) * Wrong logging level applied when setting kwarg explain to True - require lower-case name instead of breaking build Dirk Mueller 2023-10-02 09:08:45 +0000
efcb074653 Accepting request 1098807 from devel:languages:python Ana Guerrero 2023-07-17 17:22:47 +0000
c103080fc4 - update to 3.2.0: * Typehint for function from_path no longer enforce PathLike as its first argument * Minor improvement over the global detection reliability * Introduce function is_binary that relies on main capabilities, and optimized to detect binaries * Propagate enable_fallback argument throughout from_bytes, from_path, and from_fp that allow a deeper control over the detection (default True) * Edge case detection failure where a file would contain 'very- long' camel cased word (Issue #289) Dirk Mueller 2023-07-11 13:24:00 +0000
9ded4692d5 Accepting request 1084939 from devel:languages:python Dominique Leuenberger 2023-05-09 11:06:32 +0000
110acf0118 - add sle15_python_module_pythons (jsc#PED-68) Dirk Mueller 2023-05-05 06:41:30 +0000
e5a29dc1ed Accepting request 1074517 from devel:languages:python Dominique Leuenberger 2023-03-29 21:26:15 +0000
5a2b102103 - update to 3.1.0: * Argument should_rename_legacy for legacy function detect and disregard any new arguments without errors (PR #262) * Removed Support for Python 3.6 (PR #260) * Optional speedup provided by mypy/c 1.0.1 Dirk Mueller 2023-03-26 20:04:47 +0000
cfd5a3a805 Accepting request 1039740 from devel:languages:python Dominique Leuenberger 2022-12-04 13:57:44 +0000
20637d8d7f Accepting request 1039709 from home:yarunachalam:branches:devel:languages:python Dirk Mueller 2022-12-03 07:29:01 +0000
d95cdeb12d Accepting request 1032182 from devel:languages:python Dominique Leuenberger 2022-11-04 16:31:30 +0000
fa191726b9 Accepting request 1031656 from home:yarunachalam:branches:devel:languages:python Matej Cepl 2022-10-29 11:47:59 +0000
85cdd603a9 Accepting request 1004361 from devel:languages:python Dominique Leuenberger 2022-09-18 15:31:58 +0000
12f704616b - update to 2.1.1: * Function normalize scheduled for removal in 3.0 * Removed useless call to decode in fn is_unprintable (#206) Dirk Mueller 2022-09-17 15:50:18 +0000
df0a5b7224 Accepting request 998090 from devel:languages:python Dominique Leuenberger 2022-08-20 18:27:45 +0000
eac72ae8c5 Accepting request 998013 from home:bnavigator:branches:devel:languages:python Dirk Mueller 2022-08-19 06:47:38 +0000
18b088feb8 Accepting request 991152 from devel:languages:python Richard Brown 2022-07-26 17:42:09 +0000
8dca1a6616 - update to 2.1.0: * Output the Unicode table version when running the CLI with --version * Re-use decoded buffer for single byte character sets * Fixing some performance bottlenecks * Workaround potential bug in cpython with Zero Width No-Break Space located * in Arabic Presentation Forms-B, Unicode 1.1 not acknowledged as space * CLI default threshold aligned with the API threshold from * Support for Python 3.5 (PR #192) * Use of backport unicodedata from unicodedata2 as Python is quickly catching up, scheduled for removal in 3.0 Dirk Mueller 2022-07-19 11:40:33 +0000
fbde6a8151 Accepting request 954654 from devel:languages:python Dominique Leuenberger 2022-02-16 23:29:57 +0000
259f5f1afe - update to 2.0.12: * ASCII miss-detection on rare cases (PR #170) * Explicit support for Python 3.11 (PR #164) * The logging behavior have been completely reviewed, now using only TRACE and DEBUG levels Dirk Mueller 2022-02-15 08:43:43 +0000
52bba14558 Accepting request 945443 from devel:languages:python Dominique Leuenberger 2022-01-11 20:17:16 +0000
c739862e1a - update to 2.0.10: * Fallback match entries might lead to UnicodeDecodeError for large bytes sequence * Skipping the language-detection (CD) on ASCII Dirk Mueller 2022-01-10 23:04:22 +0000
fecde8793e Accepting request 936118 from devel:languages:python Dominique Leuenberger 2021-12-09 18:45:18 +0000
53a1bfb655 - update to 2.0.9: * Moderating the logging impact (since 2.0.8) for specific environments * Wrong logging level applied when setting kwarg explain to True Dirk Mueller 2021-12-06 20:09:48 +0000
499b903d6c Accepting request 934519 from devel:languages:python Dominique Leuenberger 2021-12-01 19:46:49 +0000
4e6d945d9a - update to 2.0.8: * Improvement over Vietnamese detection * MD improvement on trailing data and long foreign (non-pure latin) * Efficiency improvements in cd/alphabet_languages * call sum() without an intermediary list following PEP 289 recommendations * Code style as refactored by Sourcery-AI * Minor adjustment on the MD around european words * Remove and replace SRTs from assets / tests * Initialize the library logger with a NullHandler by default * Setting kwarg explain to True will add provisionally * Fix large (misleading) sequence giving UnicodeDecodeError * Avoid using too insignificant chunk * Add and expose function set_logging_handler to configure a specific StreamHandler Dirk Mueller 2021-11-29 11:18:31 +0000
380896adbc - require lower-case name instead of breaking build Dirk Mueller 2021-11-26 11:35:38 +0000
515e72fd80 - Use lower-case name of prettytable package Matej Cepl 2021-11-25 22:27:00 +0000
80b5313625 Accepting request 927599 from devel:languages:python Dominique Leuenberger 2021-10-27 20:21:05 +0000
fd5f5dc1f2 Accepting request 925848 from home:mnhauke Dirk Mueller 2021-10-26 20:41:42 +0000
ef5560a5e2 Accepting request 894589 from devel:languages:python Dominique Leuenberger 2021-05-20 17:25:29 +0000
d5d2d1f9e5 Accepting request 894588 from home:pgajdos:python Markéta Machová 2021-05-20 09:54:40 +0000
0ab2750707 Accepting request 870785 from devel:languages:python Richard Brown 2021-03-30 19:03:00 +0000
4ec9d5c90b Accepting request 870710 from home:jayvdb:branches:devel:languages:python Dirk Mueller 2021-02-10 08:09:39 +0000
9de3503c99 Accepting request 808799 from devel:languages:python Yuchen Lin 2020-05-26 15:49:52 +0000
204ac0c668 Accepting request 808744 from home:pgajdos:python Tomáš Chvátal 2020-05-25 13:36:05 +0000
fb143fcd47 Accepting request 767923 from devel:languages:python Dominique Leuenberger 2020-01-28 09:57:38 +0000
611d9d38c7 Accepting request 767602 from home:mcalabkova:branches:devel:languages:python Tomáš Chvátal 2020-01-28 08:11:06 +0000
e226778961 Accepting request 734952 from devel:languages:python Dominique Leuenberger 2019-10-16 07:12:25 +0000
631de8d368 Accepting request 734946 from home:mcalabkova:branches:devel:languages:python Tomáš Chvátal 2019-10-04 09:50:53 +0000
5ab4dcf5cc Accepting request 733394 from devel:languages:python Dominique Leuenberger 2019-09-27 12:51:35 +0000
c385f2c788 - Update to 1.1.1: * from_bytes parameters steps and chunk_size were not adapted to sequence len if provided values were not fitted to content * Sequence having lenght bellow 10 chars was not checked * Legacy detect method inspired by chardet was not returning * Various more test updates Tomáš Chvátal 2019-09-26 10:38:40 +0000
5cb7342274 - Update to 0.3: * Improvement on detection * Performance loss to expect * Added --threshold option to CLI * Bugfix on UTF 7 support * Legacy detect(byte_str) method * BOM support (Unicode mostly) * Chaos prober improved on small text * Language detection has been reviewed to give better result * Bugfix on jp detection, every jp text was considered chaotic Tomáš Chvátal 2019-09-13 11:07:21 +0000
4357e3b4f3 Accepting request 727095 from devel:languages:python Dominique Leuenberger 2019-09-04 07:09:48 +0000
1b68d0fd4b - Fix the tarball to really be the one published by upstream Tomáš Chvátal 2019-08-30 00:46:43 +0000
4b06d6e2e5 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=2 Tomáš Chvátal 2019-08-30 00:46:24 +0000
2365d5732b Accepting request 726939 from home:jayvdb:py-new Tomáš Chvátal 2019-08-29 10:43:06 +0000

Commit Graph Select branches Hide Pull Requests factory Mono Color

Commit Graph

Select branches

Hide Pull Requests

factory