Commit Graph

6 Commits

Author SHA256 Message Date
Tomáš Chvátal
611d9d38c7 Accepting request 767602 from home:mcalabkova:branches:devel:languages:python
- Update to 1.3.4
  * Improvement/Bugfix : False positive when searching for successive upper, lower char. (ProbeChaos)
  * Improvement : Noticeable better detection for jp
  * Bugfix : Passing zero-length bytes to from_bytes
  * Improvement : Expose version in package
  * Bugfix : Division by zero
  * Improvement : Prefers unicode (utf-8) when detected
  * Apparently dropped Python2 silently

OBS-URL: https://build.opensuse.org/request/show/767602
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=10
2020-01-28 08:11:06 +00:00
Tomáš Chvátal
631de8d368 Accepting request 734946 from home:mcalabkova:branches:devel:languages:python
- Update to 1.3.0
  * Backport unicodedata for v12 impl into python if available
  * Add aliases to CharsetNormalizerMatches class
  * Add feature preemptive behaviour, looking for encoding declaration
  * Add method to determine if specific encoding is multi byte
  * Add has_submatch property on a match
  * Add percent_chaos and percent_coherence
  * Coherence ratio based on mean instead of sum of best results
  * Using loguru for trace/debug <3
  * from_byte method improved

OBS-URL: https://build.opensuse.org/request/show/734946
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=8
2019-10-04 09:50:53 +00:00
Tomáš Chvátal
c385f2c788 - Update to 1.1.1:
* from_bytes parameters steps and chunk_size were not adapted to sequence len if provided values were not fitted to content
  * Sequence having lenght bellow 10 chars was not checked
  * Legacy detect method inspired by chardet was not returning
  * Various more test updates

OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=6
2019-09-26 10:38:40 +00:00
Tomáš Chvátal
5cb7342274 - Update to 0.3:
* Improvement on detection
  * Performance loss to expect
  * Added --threshold option to CLI
  * Bugfix on UTF 7 support
  * Legacy detect(byte_str) method
  * BOM support (Unicode mostly)
  * Chaos prober improved on small text
  * Language detection has been reviewed to give better result
  * Bugfix on jp detection, every jp text was considered chaotic

OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=5
2019-09-13 11:07:21 +00:00
Tomáš Chvátal
4b06d6e2e5 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=2 2019-08-30 00:46:24 +00:00
Tomáš Chvátal
2365d5732b Accepting request 726939 from home:jayvdb:py-new
A very new & impressive (and the only) alternative to chardet which has stagnated lately

OBS-URL: https://build.opensuse.org/request/show/726939
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-charset-normalizer?expand=0&rev=1
2019-08-29 10:43:06 +00:00