diff --git a/pyarrow-11.0.0.tar.gz b/pyarrow-11.0.0.tar.gz deleted file mode 100644 index 652320e..0000000 --- a/pyarrow-11.0.0.tar.gz +++ /dev/null @@ -1,3 +0,0 @@ -version https://git-lfs.github.com/spec/v1 -oid sha256:5461c57dbdb211a632a48facb9b39bbeb8a7905ec95d768078525283caef5f6d -size 1020639 diff --git a/pyarrow-12.0.0.tar.gz b/pyarrow-12.0.0.tar.gz new file mode 100644 index 0000000..a9a069a --- /dev/null +++ b/pyarrow-12.0.0.tar.gz @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:19c812d303610ab5d664b7b1de4051ae23565f9f94d04cbea9e50569746ae1ee +size 1014741 diff --git a/python-pyarrow.changes b/python-pyarrow.changes index 4bd45fd..cf17d90 100644 --- a/python-pyarrow.changes +++ b/python-pyarrow.changes @@ -1,3 +1,66 @@ +------------------------------------------------------------------- +Thu May 18 07:28:28 UTC 2023 - Ben Greiner + +- Update to 12.0.0 + ## Compatibility notes: + * Plasma has been removed in this release (GH-33243). In + addition, the deprecated serialization module in PyArrow was + also removed (GH-29705). IPC (Inter-Process Communication) + functionality of pyarrow or the standard library pickle should + be used instead. + * The deprecated use_async keyword has been removed from the + dataset module (GH-30774) + * Minimum Cython version to build PyArrow from source has been + raised to 0.29.31 (GH-34933). In addition, PyArrow can now be + compiled using Cython 3 (GH-34564). + ## New features: + * A new pyarrow.acero module with initial bindings for the Acero + execution engine has been added (GH-33976) + * A new canonical extension type for fixed shaped tensor data has + been defined. This is exposed in PyArrow as the + FixedShapeTensorType (GH-34882, GH-34956) + * Run-End Encoded arrays binding has been implemented (GH-34686, + GH-34568) + * Method is_nan has been added to Array, ChunkedArray and + Expression (GH-34154) + * Dataframe interchange protocol has been implemented for + RecordBatch (GH-33926) + ## Other improvements: + * Extension arrays can now be concatenated (GH-31868) + * get_partition_keys helper function is implemented in the + dataset module to access the partitioning field’s key/value + from the partition expression of a certain dataset fragment + (GH-33825) + * PyArrow Array objects can now be accepted by the pa.array() + constructor (GH-34411) + * The default row group size when writing parquet files has been + changed (GH-34280) + * RecordBatch has the select() method implemented (GH-34359) + * New method drop_column on the pyarrow.Table supports passing a + single column as a string (GH-33377) + * User-defined tabular functions, which are a user-functions + implemented in Python that return a stateful stream of tabular + data, are now also supported (GH-32916) + * Arrow Archery tool now includes linting of the Cython files + (GH-31905) + * Breaking Change: Reorder output fields of “group_by” node so + that keys/segment keys come before aggregates (GH-33616) + ## Relevant bug fixes: + * Acero can now detect and raise an error in case a join + operation needs too much bytes of key data (GH-34474) + * Fix for converting non-sequence object in pa.array() (GH-34944) + * Fix erroneous table conversion to pandas if table includes an + extension array that does not implement to_pandas_dtype + (GH-34906) + * Reading from a closed ArrayStreamBatchReader now returns + invalid status instead of segfaulting (GH-34165) + * array() now returns pyarrow.Array and not pyarrow.ChunkedArray + for columns with __arrow_array__ method and only one chunk so + that the conversion of pandas dataframe with categorical column + of dtype string[pyarrow] does not fail (GH-33727) + * Custom type mapper in to_pandas now converts index dtypes + together with column dtypes (GH-34283) + ------------------------------------------------------------------- Wed Mar 29 13:25:55 UTC 2023 - Ben Greiner diff --git a/python-pyarrow.spec b/python-pyarrow.spec index db56253..fdf825d 100644 --- a/python-pyarrow.spec +++ b/python-pyarrow.spec @@ -19,7 +19,7 @@ %bcond_with xsimd %define plainpython python Name: python-pyarrow -Version: 11.0.0 +Version: 12.0.0 Release: 0 Summary: Python library for Apache Arrow License: Apache-2.0 AND BSD-3-Clause AND BSD-2-Clause AND MIT @@ -30,13 +30,14 @@ Source: https://files.pythonhosted.org/packages/source/p/pyarrow/pyarrow Source10: LICENSE.txt Source11: NOTICE.txt Source99: python-pyarrow.rpmlintrc -BuildRequires: %{python_module Cython >= 0.29} +BuildRequires: %{python_module Cython >= 0.29.31} BuildRequires: %{python_module devel} BuildRequires: %{python_module numpy-devel >= 1.16.6} BuildRequires: %{python_module pip} BuildRequires: %{python_module setuptools_scm} BuildRequires: %{python_module setuptools} BuildRequires: %{python_module wheel} +BuildRequires: apache-arrow-acero-devel-static = %{version} BuildRequires: apache-arrow-dataset-devel-static = %{version} BuildRequires: apache-arrow-devel = %{version} BuildRequires: apache-arrow-devel-static = %{version} @@ -45,6 +46,7 @@ BuildRequires: apache-parquet-devel-static = %{version} BuildRequires: cmake BuildRequires: fdupes BuildRequires: gcc-c++ +BuildRequires: libzstd-devel-static BuildRequires: openssl-devel BuildRequires: pkgconfig BuildRequires: python-rpm-macros @@ -53,14 +55,12 @@ BuildRequires: pkgconfig(bzip2) >= 1.0.8 BuildRequires: pkgconfig(gmock) >= 1.10 BuildRequires: pkgconfig(gtest) >= 1.10 Requires: python-numpy >= 1.16.6 -Requires(post): update-alternatives -Requires(postun): update-alternatives # SECTION test requirements BuildRequires: %{python_module hypothesis} BuildRequires: %{python_module pandas} -BuildRequires: %{python_module pytest} BuildRequires: %{python_module pytest-lazy-fixture} BuildRequires: %{python_module pytest-xdist} +BuildRequires: %{python_module pytest} # /SECTION %python_subpackages @@ -80,11 +80,11 @@ analytics. Many popular projects use Arrow to ship columnar data efficiently or as the basis for analytic engines. %package devel -Summary: Python library for Apache Arrow - header files -Requires: python-Cython -Requires: python-pyarrow = %{version} -Requires: %plainpython(abi) = %python_version -Supplements: (python-devel and python-pyarrow) +Summary: Python library for Apache Arrow - header files +Requires: python-Cython +Requires: python-pyarrow = %{version} +Requires: %plainpython(abi) = %python_version +Supplements: (python-devel and python-pyarrow) %description devel Python library for Apache Arrow. @@ -121,13 +121,14 @@ export PYARROW_CMAKE_OPTIONS=" \ %install %pyproject_install -%python_clone -a %{buildroot}%{_bindir}/plasma_store %python_expand %fdupes %{buildroot}%{$python_sitearch} %check pushd .. # Unexpected additional warning donttest="test_env_var" +# flaky +donttest="$donttest or test_total_bytes_allocated" %ifarch %{ix86} %{arm32} # tests conversion to 64bit datatypes donttest="$donttest or test_conversion" @@ -141,18 +142,12 @@ donttest="$donttest or test_python_file_large_seeks" donttest="$donttest or test_schema_sizeof" %endif %pytest_arch --pyargs pyarrow -n auto -k "not ($donttest)" +%pytest_arch --pyargs pyarrow -n auto -k "$donttest" || : popd -%post -%python_install_alternative plasma_store - -%postun -%python_uninstall_alternative plasma_store - %files %{python_files} %doc README.md %license LICENSE.txt NOTICE.txt -%python_alternative %{_bindir}/plasma_store %{python_sitearch}/pyarrow %exclude %{python_sitearch}/pyarrow/include %exclude %{python_sitearch}/pyarrow/src