From a9e704acd6ec862938c862f69cfbe8b8d1cf142b6cafc7e8947f7cf57d28f314 Mon Sep 17 00:00:00 2001 From: Dirk Mueller Date: Wed, 27 Dec 2023 16:08:06 +0000 Subject: [PATCH] - update to 3.5.2: * skip pandas ``pd.NA`` similar to ``None`` * add ``score_multiplier`` argument to ``process.cdist`` which allows multiplying the end result scores * with a constant factor. * drop support for Python 3.7 * improve performance of simd implementation for ``LCS`` / ``Indel`` / ``Jaro`` / ``JaroWinkler`` * improve performance of Jaro and Jaro Winkler for long sequences * implement ``process.extract`` with ``limit=1`` using ``process.extractOne`` which can be faster * the preprocessing function was always called through Python due to a broken C-API version check * fix wraparound issue in simd implementation of Jaro and Jaro Winkler * upgrade to ``Cython==3.0.3`` * add simd implementation for Jaro and Jaro Winkler * add missing tag for python 3.12 support * upgrade to ``Cython==3.0.2`` * implement the remaining missing features from the C++ implementation in the pure Python implementation * added support for Python 3.12 * build x86 with sse2/avx2 runtime detection * upgrade to ``taskflow==3.6`` * added keyword argument ``pad`` to Hamming distance. This controls whether sequences of different * length should be padded or lead to a ``ValueError`` * improve consistency of exception messages between the C++ and pure Python implementation OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-rapidfuzz?expand=0&rev=14 --- python-rapidfuzz.changes | 52 ++++++++++++++++++++++++++++++++++++++++ python-rapidfuzz.spec | 5 ++-- rapidfuzz-2.13.7.tar.gz | 3 --- rapidfuzz-3.5.2.tar.gz | 3 +++ 4 files changed, 58 insertions(+), 5 deletions(-) delete mode 100644 rapidfuzz-2.13.7.tar.gz create mode 100644 rapidfuzz-3.5.2.tar.gz diff --git a/python-rapidfuzz.changes b/python-rapidfuzz.changes index dc5a857..ee47fa2 100644 --- a/python-rapidfuzz.changes +++ b/python-rapidfuzz.changes @@ -1,3 +1,55 @@ +------------------------------------------------------------------- +Wed Dec 27 16:06:20 UTC 2023 - Dirk Müller + +- update to 3.5.2: + * skip pandas ``pd.NA`` similar to ``None`` + * add ``score_multiplier`` argument to ``process.cdist`` which + allows multiplying the end result scores + * with a constant factor. + * drop support for Python 3.7 + * improve performance of simd implementation for ``LCS`` / + ``Indel`` / ``Jaro`` / ``JaroWinkler`` + * improve performance of Jaro and Jaro Winkler for long + sequences + * implement ``process.extract`` with ``limit=1`` using + ``process.extractOne`` which can be faster + * the preprocessing function was always called through Python + due to a broken C-API version check + * fix wraparound issue in simd implementation of Jaro and Jaro + Winkler + * upgrade to ``Cython==3.0.3`` + * add simd implementation for Jaro and Jaro Winkler + * add missing tag for python 3.12 support + * upgrade to ``Cython==3.0.2`` + * implement the remaining missing features from the C++ + implementation in the pure Python implementation + * added support for Python 3.12 + * build x86 with sse2/avx2 runtime detection + * upgrade to ``taskflow==3.6`` + * added keyword argument ``pad`` to Hamming distance. This + controls whether sequences of different + * length should be padded or lead to a ``ValueError`` + * improve consistency of exception messages between the C++ and + pure Python implementation + * upgrade required Cython version to ``Cython==3.0.0b3`` + * allow the usage of ``Hamming`` for different string lengths. + Length differences are handled as insertions / deletions + * remove support for boolean preprocessor functions in + ``rapidfuzz.fuzz`` and ``rapidfuzz.process``. + * The processor argument is now always a callable or ``None``. + * update defaults of the processor argument to be ``None`` + everywhere. For affected functions this can change results, + since strings are no longer preprocessed. + * fix incorrect tag dispatching implementation leading to AVX2 + instructions in the SSE2 code path + * add wheels for windows arm64 + * allow the usage of finite generators as choices in + ``process.extract`` + * fix handling of non symmetric scorers in pure python version + of ``process.cdist`` + * fix default dtype handling when using ``process.cdist`` with + pure python scorers + ------------------------------------------------------------------- Fri Dec 1 17:57:05 UTC 2023 - Antonio Larrosa diff --git a/python-rapidfuzz.spec b/python-rapidfuzz.spec index 845e2d2..6f97471 100644 --- a/python-rapidfuzz.spec +++ b/python-rapidfuzz.spec @@ -18,7 +18,7 @@ %{?sle15_python_module_pythons} Name: python-rapidfuzz -Version: 2.13.7 +Version: 3.5.2 Release: 0 Summary: Rapid fuzzy string matching License: MIT @@ -45,7 +45,8 @@ BuildRequires: gcc-c++ %python_subpackages %description -RapidFuzz is a fast string matching library for Python and C++, which is using the string similarity calculations from FuzzyWuzzy. +RapidFuzz is a fast string matching library for Python and C++, which is using +the string similarity calculations from FuzzyWuzzy. %prep %autosetup -p1 -n rapidfuzz-%{version} diff --git a/rapidfuzz-2.13.7.tar.gz b/rapidfuzz-2.13.7.tar.gz deleted file mode 100644 index 10c1f9b..0000000 --- a/rapidfuzz-2.13.7.tar.gz +++ /dev/null @@ -1,3 +0,0 @@ -version https://git-lfs.github.com/spec/v1 -oid sha256:8d3e252d4127c79b4d7c2ae47271636cbaca905c8bb46d80c7930ab906cf4b5c -size 999972 diff --git a/rapidfuzz-3.5.2.tar.gz b/rapidfuzz-3.5.2.tar.gz new file mode 100644 index 0000000..39a5b95 --- /dev/null +++ b/rapidfuzz-3.5.2.tar.gz @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9e9b395743e12c36a3167a3a9fd1b4e11d92fb0aa21ec98017ee6df639ed385e +size 1524315