Compare commits
25 Commits
| Author | SHA256 | Date | |
|---|---|---|---|
| 29b55664af | |||
| 1aba9e9712 | |||
| 8697b15a63 | |||
| 853a205aac | |||
| d35123d1c1 | |||
| a03ab640dd | |||
| 12b0bf8517 | |||
| 4d14f521d8 | |||
| 986ddd3f2e | |||
| c3e5d75605 | |||
| 1d57fa866b | |||
| 01acc18061 | |||
| 285eb6979a | |||
| ea30dc8735 | |||
| e02b9f7269 | |||
| 2c44dc303e | |||
| 1392e3167f | |||
| d9b8e0ac6e | |||
| 55775895c9 | |||
| ffdc9dadfc | |||
| be27bc1230 | |||
| 758d4c683d | |||
| 3f02fd3dcd | |||
| 1db8e83530 | |||
| 9bed06f66b |
13
apache-arrow-19.0.1-mimalloc-version.patch
Normal file
13
apache-arrow-19.0.1-mimalloc-version.patch
Normal file
@@ -0,0 +1,13 @@
|
||||
--- a/cpp/thirdparty/versions.txt 2025-02-11 23:16:06.000000000 +0100
|
||||
+++ b/cpp/thirdparty/versions.txt 2025-04-21 15:34:04.565829184 +0200
|
||||
@@ -82,8 +82,8 @@
|
||||
ARROW_JEMALLOC_BUILD_SHA256_CHECKSUM=2db82d1e7119df3e71b7640219b6dfe84789bc0537983c3b7ac4f7189aecfeaa
|
||||
ARROW_LZ4_BUILD_VERSION=v1.10.0
|
||||
ARROW_LZ4_BUILD_SHA256_CHECKSUM=537512904744b35e232912055ccf8ec66d768639ff3abe5788d90d792ec5f48b
|
||||
-ARROW_MIMALLOC_BUILD_VERSION=v2.0.6
|
||||
-ARROW_MIMALLOC_BUILD_SHA256_CHECKSUM=9f05c94cc2b017ed13698834ac2a3567b6339a8bde27640df5a1581d49d05ce5
|
||||
+ARROW_MIMALLOC_BUILD_VERSION=v2.0.9
|
||||
+ARROW_MIMALLOC_BUILD_SHA256_CHECKSUM=4a29edae32a914a706715e2ac8e7e4109e25353212edeed0888f4e3e15db5850
|
||||
ARROW_NLOHMANN_JSON_BUILD_VERSION=v3.10.5
|
||||
ARROW_NLOHMANN_JSON_BUILD_SHA256_CHECKSUM=5daca6ca216495edf89d167f808d1d03c4a4d929cef7da5e10f135ae1540c7e4
|
||||
ARROW_OPENTELEMETRY_BUILD_VERSION=v1.13.0
|
||||
3
apache-arrow-20.0.0.tar.gz
Normal file
3
apache-arrow-20.0.0.tar.gz
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:67e31a4f46528634b8c3cbb0dc60ac8f85859d906b400d83d0b6f732b0c5b0e3
|
||||
size 17592223
|
||||
@@ -1,3 +0,0 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e92401790fdba33bfb4b8aa522626d800ea7fda4b6f036aaf39849927d2cf88d
|
||||
size 17241418
|
||||
@@ -1,328 +1,3 @@
|
||||
-------------------------------------------------------------------
|
||||
Fri Sep 26 16:52:42 UTC 2025 - Ben Greiner <code@bnavigator.de>
|
||||
|
||||
- Remove boost::system dependency for Tumbleweed
|
||||
* Add arrow-boost-system-1.89-boo1249599.patch
|
||||
* gh#boostorg/system#132
|
||||
* boo#1249599
|
||||
|
||||
-------------------------------------------------------------------
|
||||
Thu Sep 25 10:24:04 UTC 2025 - Ben Greiner <code@bnavigator.de>
|
||||
|
||||
- Update to 21.0.0
|
||||
## Bug Fixes
|
||||
* GH-32276 - [C++][FlightRPC] Add option to align RecordBatch
|
||||
buffers given to IPC reader (#44279)
|
||||
* GH-35166 - [C++][Compute] Increase precision of decimals in sum
|
||||
aggregates (#44184)
|
||||
* GH-40756 - [C++] Remove dead Boost urls (#46452)
|
||||
* GH-45532 - [C++] RunEndEncodedBuilder should clear dimensions
|
||||
after a Finish() call (#45533)
|
||||
* GH-45534 - [C++] Test: RunEndEncodeTableColumns should update
|
||||
REE columns' schema types (#45535)
|
||||
* GH-45608 - [C++][Flight] Fix compilation for clang (#46264)
|
||||
* GH-45735 - [C++] Broken tests for extract_regex compute funcion
|
||||
(#45900)
|
||||
* GH-45853 - [C++][Dev] Fix Meson compilation issues in Docker
|
||||
builds (#45858)
|
||||
* GH-46011 - [C++] Hide DCHECK family from public headers
|
||||
(#46015)
|
||||
* GH-46025 - [C++] Use ARROW_CUDA_EXPORT instead of ARROW_EXPORT
|
||||
for libarrow_cuda (#46030)
|
||||
* GH-46052 - [C++][Benchmarking] Don't build grouper benchmark
|
||||
without ARROW_COMPUTE=ON (#46053)
|
||||
* GH-46070 - [C++] Remove duplicate storage_type in JsonExtension
|
||||
(#46071)
|
||||
* GH-46084 - [C++] Always use ARROW_VCPKG to detect vcpkg mode
|
||||
(#46467)
|
||||
* GH-46090 - [C++] Set default IPC option to enabled in Meson
|
||||
(#46114)
|
||||
* GH-46094 - [C++][Docs] Add note to RleDecoder::Get's doc
|
||||
comment (#46874)
|
||||
* GH-46146 - [C++] Merge metadata in SchemaBuidler::AddMetadata
|
||||
(#46654)
|
||||
* GH-46149 - [C++] Opening dataset fails with sshfs-3.7.3 due to
|
||||
F_RDADVISE error (#46346)
|
||||
* GH-46157 - [C++] Move test utility RunEndEncodeTableColumns
|
||||
that uses REE to test_util_internal on acero instead of common
|
||||
gtest_util (#46161)
|
||||
* GH-46192 - [C++] Add substrait dep to third party download
|
||||
script (#46191)
|
||||
* GH-46197 - [C++] Tests use legacy timezones (#46201)
|
||||
* GH-46214 - [C++] Improve S3 client initialization (#46723)
|
||||
* GH-46224 - [C++][Acero] Fix the hang in asof join (#46300)
|
||||
* GH-46231 - [C++][CMake] Fix arrow_bundled_dependencies to be
|
||||
externally accessible by FetchContent (#46232)
|
||||
* GH-46233 - [C++] Fix missing nested braces in QueuedTask
|
||||
initialization (#46234)
|
||||
* GH-46268 - [C++] Improve ArrayData docstrings (#46271)
|
||||
* GH-46270 - [C++][Parquet] Clarify GeoStatistics docstring
|
||||
(#46649)
|
||||
* GH-46299 - [C++][Compute] Don't use static inline const for
|
||||
default options (#46303)
|
||||
* GH-46306 - [C++][Parquet] Should use LoadEnumSafe for geo enum
|
||||
(#46307)
|
||||
* GH-46314 - [C++][Parquet] Fix valgrind error when collecting
|
||||
parameterized tests for MakeWKBPoint (#46320)
|
||||
* GH-46326 - [C++][Parquet] Fix stack overflow in rapidjson value
|
||||
comparison to integer (#46327)
|
||||
* GH-46359 - [C++][Thirdparty] Bump Apache ORC to 2.1.2 (#46360)
|
||||
* GH-46394 - [C++][R] gcc-UBSAN errors on CRAN (#46397)
|
||||
* GH-46395 - [C++][Statistics] Use EqualOptions for min and max
|
||||
in arrow::ArrayStatistics::Equals() (#46422)
|
||||
* GH-46407 - [C++] Fix IPC serialization of sliced list arrays
|
||||
(#46408)
|
||||
* GH-46414 - [C++] Fix GCS filesystem getFileInfo method (#46416)
|
||||
* GH-46417 - [C++][Parquet] Fix UB in LoadEnumSafe for
|
||||
EdgeInterpolationAlgorithm (#46418)
|
||||
* GH-46419 - [C++] Remove duplicate declaration and sync arg
|
||||
names on acero test_util_internal functions (#45400)
|
||||
* GH-46420 - [C++][Dataset] Fix DatasetWriter deadlock on
|
||||
writting batch greater than max_rows_queued (#46139)
|
||||
* GH-46424 - [C++][Parquet] Fix erroneous unit test skip (#46425)
|
||||
* GH-46435 - [Parquet][C++] Fix uninitialized value in writer
|
||||
test (#46533)
|
||||
* GH-46478 - [C++] Implement recent JSON changes into Meson
|
||||
configuration (#46479)
|
||||
* GH-46481 - [C++][Python] Allow nullable schema in FlightInfo
|
||||
(#46489)
|
||||
* GH-46512 - [CI][C++] Install the llvm package explicitly on
|
||||
MSYS2 (#46525)
|
||||
* GH-46564 - [C++] Export ARROW_VCPKG in ArrowConfig.cmake
|
||||
(#46565)
|
||||
* GH-46576 - [C++] Suppress codecvt_utf8 deprecation warning
|
||||
(#46622)
|
||||
* GH-46589 - [C++] Fix utf8_is_digit to support full Unicode
|
||||
digit range (#46590)
|
||||
* GH-46599 - [C++][Doc][Parquet] Update supported types
|
||||
documentation (#46620)
|
||||
* GH-46611 - [Python][C++] Allow building float16 arrays without
|
||||
numpy (#46618)
|
||||
* GH-46623 - [C++][Compute] Fix the failure of large memory test
|
||||
in arrow-compute-row-test (#46635)
|
||||
* GH-46659 - [C++] Fix export of extension arrays with binary
|
||||
view/string view storage (#46660)
|
||||
* GH-46674 - [C++] Construct Array from ExtensionType Scalar
|
||||
(#46675)
|
||||
* GH-46684 - [C++] Fix Meson configuration issue on Windows
|
||||
(#46685)
|
||||
* GH-46704 - [C++] Fix OSS-Fuzz build failure (#46706)
|
||||
* GH-46708 - [C++][Gandiva] Added zero return values for
|
||||
castDECIMAL_utf8 (#46709)
|
||||
* GH-46710 - [C++] Fix ownership and lifetime issues in Dataset
|
||||
Writer (#46711)
|
||||
* GH-46724 - [C++][Parquet] OSSFuzz: Prevent from Bad-cast in
|
||||
handling statistics (#46725)
|
||||
* GH-46761 - [C++] Add executable detection on FreeBSD (#46759)
|
||||
* GH-46764 - [C++][Gandiva] Fix wrong .bc depends (#46765)
|
||||
* GH-46777 - [C++] Use SimplifyIsIn only when the value_set of
|
||||
the expression is lower than a threshold (#46859)
|
||||
* GH-46811 - [C++][Python] Fix crash on
|
||||
FileReaderImpl::GetRecordBatchReader (#46931)
|
||||
* GH-46827 - [C++] Update Meson Configuration for compute shared
|
||||
lib (#46839)
|
||||
* GH-46831 - [C++][R] Remove some pending references to CMake <
|
||||
3.25 (docs + minor CMake references) (#46834)
|
||||
* GH-46841 - [C++][Gandiva] Fix date trunc edge case (#46842)
|
||||
* GH-46863 - [CI][C++] Suppress a false positive UBSAN error in
|
||||
AWS SDK for C++ (#46870)
|
||||
* GH-46871 - [C++][Parquet] Restore implementation of 3
|
||||
arrow::FileReader::GetRecordBatchReader() functions (#46868)
|
||||
* GH-46888 - [C++] Remove override of default buildtype in Meson
|
||||
config (#46919)
|
||||
* GH-46915 - [C++][Compute] Initialize Compute kernels on
|
||||
benchmarks that require extra kernels (#46922)
|
||||
* GH-46934 - [C++][Parquet] Trying to fix ub in AttachStatistics
|
||||
(#46940)
|
||||
* GH-46986 - [CI][C++] Fix a build error with C++20 (#46987)
|
||||
* GH-46988 - [C++][Parquet] Fix FLBA DecodeArrow multiply
|
||||
overflow (#46991)
|
||||
* GH-46995 - [CI][R][C++] Use system memory allocator in
|
||||
sanitizer jobs (#47007)
|
||||
* GH-46998 - [C++] Fix mockfs.cc compiling error with C++23
|
||||
(#46999)
|
||||
* GH-47015 - [CI][C++] Use mold on conda-cpp to work around
|
||||
issues with GNU ld (#47028)
|
||||
* GH-47033 - [C++][Compute] Never use custom gtest main with MSVC
|
||||
(#47049)
|
||||
* GH-47037 - [CI][C++] Fix Fedora 39 CI jobs (#47038)
|
||||
## New Features and Improvements
|
||||
* GH-25025 - [C++] Move non core compute kernels into separate
|
||||
shared library (#46261)
|
||||
* GH-26818 - [C++][Python] Preserve order when writing dataset
|
||||
multi-threaded (#44470)
|
||||
* GH-36753 - [C++] Properly pretty-print and diff HalfFloatArrays
|
||||
(#46857)
|
||||
* GH-37027 - [C++] Add float16 kernels to if-else and
|
||||
vector-replace functions (#46446)
|
||||
* GH-37677 - [C++][FlightRPC] Allow FlightInfo.schema to be
|
||||
nullable
|
||||
* GH-37891 - [C++][Parquet] Refine several classes in Parquet
|
||||
encryption (#46202)
|
||||
* GH-37891 - [C++] Followup Buffer change to use sptr move
|
||||
(#46027)
|
||||
* GH-39294 - [C++][Python] DLPack on Tensor class (#42118)
|
||||
* GH-40278 - [C++] Support casting string to duration in CSV
|
||||
converter (#46035)
|
||||
* GH-40343 - [C++] Move S3FileSystem to the registry (#41559)
|
||||
* GH-43041 - [C++][Python] Read/write Parquet BYTE_ARRAY as
|
||||
Large/View types directly (#46532)
|
||||
* GH-43807 - [C++][Python] Add UUID extension type conversion
|
||||
support to/from Parquet (#45866)
|
||||
* GH-43891 - [C++][Parquet] Faster reading of
|
||||
FIXED_LEN_BYTE_ARRAY data (#46886)
|
||||
* GH-45028 - [C++][Compute] Allow cast to reorder struct fields
|
||||
(#45246)
|
||||
* GH-45083 - [C++] Add HalfFloat kernels for is_nan, is_inf,
|
||||
is_finite, negate, negate_checked, sign (#46866)
|
||||
* GH-45195 - [C++] Update bundled AWS SDK for C++ to 1.11.587
|
||||
(#45306)
|
||||
* GH-45522 - [Parquet][C++] Parquet GEOMETRY and GEOGRAPHY
|
||||
logical type implementations (#45459)
|
||||
* GH-45664 - [C++] Allow
|
||||
LargeString,LargeBinary,FixedSizeBinary,StringView and
|
||||
BinaryView for RecordBatch::MakeStatisticsArray() (#46031)
|
||||
* GH-45750 - [C++][Python][Parquet] Implement Content-Defined
|
||||
Chunking for the Parquet writer (#45360)
|
||||
* GH-45794 - [C++] Add array directory to Meson configuration
|
||||
(#45795)
|
||||
* GH-45796 - [C++] Add integration directory to Meson
|
||||
configuration (#45797)
|
||||
* GH-45798 - [C++] Add extension directory to Meson (#45799)
|
||||
* GH-45800 - [C++] Implement util configuration in Meson (#45824)
|
||||
* GH-45829 - [C++] Add compute directory to Meson configuration
|
||||
(#45830)
|
||||
* GH-45833 - [C++] Add JSON directory to Meson configuration
|
||||
(#45834)
|
||||
* GH-45865 - [C++] Create dedicated benchmark dependency in Meson
|
||||
(#45909)
|
||||
* GH-45908 - [C++][Docs] Rename and expose basic
|
||||
{Array,...}FromJSON helpers as public APIs (#46180)
|
||||
* GH-45957 - [C++][Python] Expose allow_delayed_open on
|
||||
S3FileSystem (#46078)
|
||||
* GH-45978 - [C++] Bump bundled mimalloc version (#45979)
|
||||
* GH-45991 - [C++] Bump bundled nlohmann_json to v3.12.0 (#46112)
|
||||
* GH-45992 - [C++] Bump bundled utf8proc version to 2.10.0
|
||||
(#46032)
|
||||
* GH-46091 - [C++] Use feature options in Meson configuration
|
||||
(#46204)
|
||||
* GH-46092 - [C++] Add filesystem related options to Meson
|
||||
(#46101)
|
||||
* GH-46104 - GH-45937: [C++][Parquet] Logical type definition for
|
||||
variant
|
||||
* GH-46115 - [C++] Implement compression libraries in Meson
|
||||
(#46358)
|
||||
* GH-46116 - [C++] Implement IPC directory in Meson (#46117)
|
||||
* GH-46118 - [C++] Add tensor directory to Meson (#46119)
|
||||
* GH-46132 - [C++][Parquet] Remove deprecated parquet APIs from
|
||||
19.0.0 (#46133)
|
||||
* GH-46141 - [C++] Add flight directory to Meson configuration
|
||||
(#46142)
|
||||
* GH-46153 - [C++] Implement acero directory in Meson (#46154)
|
||||
* GH-46155 - [C++] Implement Tensorflow directory in Meson
|
||||
(#46156)
|
||||
* GH-46163 - [C++] Add vendored directory to Meson (#46164)
|
||||
* GH-46196 - [C++] Remove ARROW_USE_PRECOMPILED_HEADERS and
|
||||
related logic (#46200)
|
||||
* GH-46207 - [C++] Rename arrow::util::StringBuilder and move to
|
||||
internal namespace (#46813)
|
||||
* GH-46209 - [Documentation][C++][Compute] Add cpp developer
|
||||
documentation for row table (#46210)
|
||||
* GH-46215 - [C++][Docs] Add README for Meson subprojects
|
||||
directory (#46216)
|
||||
* GH-46217 - [C++][Parquet] Update the timestamp of
|
||||
parquet::encryption::TwoLevelCacheWithExpiration correctly
|
||||
(#46283)
|
||||
* GH-46219 - [C++][Parquet] Remove PARQUET_MINIMAL_DEPENDENCY
|
||||
option (#46274)
|
||||
* GH-46285 - [C++] Add support for Decimal32/64 and HalfFloat to
|
||||
run_end_encode/run_end_decode (#46286)
|
||||
* GH-46318 - [Docs][C++] Add Extension Array/Type documents
|
||||
(#46319)
|
||||
* GH-46321 - [C++][Doc] Better explain ArrayData IsValid and
|
||||
GetNullCount (#46332)
|
||||
* GH-46338 - [C++] Add compile step for Meson in cpp_build.sh
|
||||
(#46339)
|
||||
* GH-46367 - [C++] Prevent Meson from using git info if built as
|
||||
subproject (#46368)
|
||||
* GH-46386 - [C++] Ensure using our CMake packages not
|
||||
Find*.cmake (#46387)
|
||||
* GH-46388 - [C++] Check Snappy::snappy{,-static} in
|
||||
FindSnappyAlt.cmake (#46389)
|
||||
* GH-46396 - [C++][Documentation][Statistics] Revise the
|
||||
documentation to clarify that arrow::ArrayStatistics is ignored
|
||||
during arrow::Array comparisons (#46470)
|
||||
* GH-46403 - [C++] Add support for limiting element size when
|
||||
printing data (#46536)
|
||||
* GH-46439 - [C++] Use result pattern for all FromJSONString
|
||||
Helpers (#46696)
|
||||
* GH-46439 - [C++] Rename internal Converter class in
|
||||
from_string.cc (#46697)
|
||||
* GH-46439 - [C++] Remove unneeded namespace prefix in
|
||||
test_util_internal.h (#46695)
|
||||
* GH-46444 - [Documentation][C++][Acero] Move internal Swiss
|
||||
table doc into public C++ developer doc (#46445)
|
||||
* GH-46459 - [C++] Make some arrow/util headers internal (#46721)
|
||||
* GH-46462 - [C++][Parquet] Expose currently thrown
|
||||
EncodedStatistics when checking is_stats_set (#46463)
|
||||
* GH-46473 - [C++][Docs] Fix typos in decimal comments (#46474)
|
||||
* GH-46475 - [Documentation][C++][Compute] Consolidate Acero
|
||||
developer docs (#46476)
|
||||
* GH-46477 - [C++] Use vendored flatbuffers in Meson
|
||||
configuration (#46484)
|
||||
* GH-46487 - [C++] Refactor lz4 from ExternalProject to
|
||||
FetchContent (#46390)
|
||||
* GH-46499 - [CI][Crossbow][C++] Use apache/arrow for Meson
|
||||
(#46501)
|
||||
* GH-46508 - [C++] Upgrade OpenTelemetry cpp to avoid build error
|
||||
on recent Clang (#46509)
|
||||
* GH-46522 - [C++][FlightRPC] Add Arrow Flight SQL ODBC driver
|
||||
(#40939)
|
||||
* GH-46529 - [C++] Convert static inline type trait functions to
|
||||
constexpr (#46559)
|
||||
* GH-46537 - [Docs][C++] Add RunEndEncodedArray, FlatArray, and
|
||||
PrimitiveArray API Docs (#46540)
|
||||
* GH-46551 - [C++] Use std::string_view for type schema API
|
||||
(#46553)
|
||||
* GH-46633 - [Docs][C++][Python] Update CombineChunks
|
||||
documentation to specify that binary columns can be combined
|
||||
into multiple chunks (#46638)
|
||||
* GH-46665 - [CI][Crossbow][C++] Use apache/arrow for Alpine
|
||||
Linux (#46666)
|
||||
* GH-46676 - [C++][Python][Parquet] Allow reading Parquet LIST
|
||||
data as LargeList directly (#46678)
|
||||
* GH-46679 - [C++][Meson] Use WrapDB entry for gflags instead of
|
||||
CMake wrapper (#46680)
|
||||
* GH-46683 - [C++][Python] Add utf8_zero_fill compute function
|
||||
for sign-aware zero padding (#46815)
|
||||
* GH-46714 - [C++] Use hidden symbol visibility in Meson
|
||||
configuration (#46715)
|
||||
* GH-46740 - [C++] Update bundled Thrift
|
||||
* GH-46745 - [C++] Update bundled Boost to 1.88.0 and Apache
|
||||
Thrift to 0.22.0 (#46912)
|
||||
* GH-46746 - [C++] Assume AWS SDK >= 1.11.0 (#46742)
|
||||
* GH-46748 - [C++] Initial port on AIX (#46749)
|
||||
* GH-46767 - [C++] Enable EqualOptions::use_atol_ for
|
||||
arrow::Array, arrow::Scalar, arrow::RecordBatch, and
|
||||
arrow::ChuckedArray (#46779)
|
||||
* GH-46771 - [Python][C++] Implement pa.arange function to
|
||||
generate array sequences (#46778)
|
||||
* GH-46785 - [CI][Dev][C++] Suppress needless outputs of cpplint
|
||||
with pre-commit (#46786)
|
||||
* GH-46788 - [C++][Parquet] Enable SIMD for byte stream split
|
||||
with 2 streams (#46789)
|
||||
* GH-46791 - [C++] Add Status::OrElse, IntoStatus<T> and ToStatus
|
||||
(#46792)
|
||||
* GH-46843 - [C++] Don't use unity build for bundled AWS SDK for
|
||||
C++ (#46845)
|
||||
* GH-46864 - [C++] Add half-float test for ArrayFromJSONString
|
||||
(#46865)
|
||||
* GH-46869 - [C++][Parquet] Deprecate arrow::Status
|
||||
parquet::arrow::FileReadeder::GetRecordBatchReader() (#46932)
|
||||
* GH-47025 - [C++][Docs] Increase minimum gcc for building from
|
||||
7.1 to 9 (#47026)
|
||||
- Drop apache-arrow-19.0.1-mimalloc-version.patch
|
||||
|
||||
-------------------------------------------------------------------
|
||||
Fri Jun 13 18:22:55 UTC 2025 - Ben Greiner <code@bnavigator.de>
|
||||
|
||||
|
||||
@@ -1,7 +1,7 @@
|
||||
#
|
||||
# spec file for package apache-arrow
|
||||
#
|
||||
# Copyright (c) 2025 SUSE LLC and contributors
|
||||
# Copyright (c) 2025 SUSE LLC
|
||||
#
|
||||
# All modifications and additions to the file contributed by third parties
|
||||
# remain the property of their copyright owners, unless otherwise agreed
|
||||
@@ -29,15 +29,16 @@
|
||||
%define gccver 13
|
||||
%endif
|
||||
|
||||
%define sonum 2100
|
||||
%define sonum 2000
|
||||
# See git submodule /testing pointing to the correct revision
|
||||
%define arrow_testing_commit fbf6b703dc93d17d75fa3664c5aa2c7873ebaf06
|
||||
%define arrow_testing_commit d2a13712303498963395318a4eb42872e66aead7
|
||||
# See git submodule /cpp/submodules/parquet-testing pointing to the correct revision
|
||||
%define parquet_testing_commit 18d17540097fca7c40be3d42c167e6bfad90763c
|
||||
# See cpp/thirdparty/versions.txt, replace by BuildRequires: pkgconfig(mimalloc) as soon as gh#apache/arrow#42211 is resolved
|
||||
%define arrow_mimalloc_build_version v2.2.4
|
||||
# mimalloc version bumped, see Patch100
|
||||
%define arrow_mimalloc_build_version v2.0.9
|
||||
Name: apache-arrow
|
||||
Version: 21.0.0
|
||||
Version: 20.0.0
|
||||
Release: 0
|
||||
Summary: A development platform for in-memory data
|
||||
License: Apache-2.0 AND BSD-3-Clause AND BSD-2-Clause AND MIT
|
||||
@@ -48,8 +49,7 @@ Source0: https://github.com/apache/arrow/archive/apache-arrow-%{version}.
|
||||
Source1: https://github.com/apache/arrow-testing/archive/%{arrow_testing_commit}.tar.gz#/arrow-testing-%{version}.tar.gz
|
||||
Source2: https://github.com/apache/parquet-testing/archive/%{parquet_testing_commit}.tar.gz#/parquet-testing-%{version}.tar.gz
|
||||
Source3: https://github.com/microsoft/mimalloc/archive/%{arrow_mimalloc_build_version}.tar.gz#/mimalloc-%{arrow_mimalloc_build_version}.tar.gz
|
||||
# PATCH-FIX-OPENSUSE arrow-boost-system-1.89-boo1249599.patch gh#boostorg/system#132, boo#1249599
|
||||
Patch1: arrow-boost-system-1.89-boo1249599.patch
|
||||
Patch100: apache-arrow-19.0.1-mimalloc-version.patch
|
||||
BuildRequires: bison
|
||||
BuildRequires: cmake >= 3.25
|
||||
BuildRequires: fdupes
|
||||
@@ -58,11 +58,8 @@ BuildRequires: gcc%{?gccver}-c++
|
||||
BuildRequires: libboost_context-devel
|
||||
BuildRequires: libboost_date_time-devel
|
||||
BuildRequires: libboost_filesystem-devel
|
||||
BuildRequires: libboost_headers-devel
|
||||
BuildRequires: libboost_process-devel
|
||||
%if 0%{?suse_version} < 1699
|
||||
BuildRequires: libboost_system-devel
|
||||
%endif
|
||||
BuildRequires: libboost_system-devel >= 1.64.0
|
||||
%if %{with static}
|
||||
BuildRequires: libzstd-devel-static
|
||||
%endif
|
||||
@@ -70,27 +67,27 @@ BuildRequires: pkgconfig
|
||||
BuildRequires: python-rpm-macros
|
||||
BuildRequires: python3-base
|
||||
BuildRequires: (cmake(lz4) >= 1.10 or (pkgconfig(liblz4) >= 1.8.3 with pkgconfig(liblz4) < 1.10))
|
||||
BuildRequires: cmake(Snappy) >= 1.2.2
|
||||
BuildRequires: cmake(Snappy) >= 1.1.7
|
||||
BuildRequires: cmake(absl)
|
||||
BuildRequires: cmake(double-conversion) >= 3.1.5
|
||||
BuildRequires: cmake(re2)
|
||||
BuildRequires: pkgconfig(RapidJSON)
|
||||
BuildRequires: pkgconfig(bzip2) >= 1.0.8
|
||||
BuildRequires: pkgconfig(gflags) >= 2.2.2
|
||||
BuildRequires: pkgconfig(grpc++) >= 1.46.3
|
||||
BuildRequires: pkgconfig(libbrotlicommon) >= 1.0.9
|
||||
BuildRequires: pkgconfig(libbrotlidec) >= 1.0.9
|
||||
BuildRequires: pkgconfig(libbrotlienc) >= 1.0.9
|
||||
BuildRequires: pkgconfig(libcares) >= 1.17.2
|
||||
BuildRequires: pkgconfig(libglog) >= 0.5.0
|
||||
BuildRequires: pkgconfig(gflags) >= 2.2.0
|
||||
BuildRequires: pkgconfig(grpc++) >= 1.20.0
|
||||
BuildRequires: pkgconfig(libbrotlicommon) >= 1.0.7
|
||||
BuildRequires: pkgconfig(libbrotlidec) >= 1.0.7
|
||||
BuildRequires: pkgconfig(libbrotlienc) >= 1.0.7
|
||||
BuildRequires: pkgconfig(libcares) >= 1.15.0
|
||||
BuildRequires: pkgconfig(libglog) >= 0.3.5
|
||||
BuildRequires: pkgconfig(libopenssl)
|
||||
BuildRequires: pkgconfig(liburiparser) >= 0.9.3
|
||||
BuildRequires: pkgconfig(libutf8proc) >= 2.10.0
|
||||
BuildRequires: pkgconfig(libzstd) >= 1.5.7
|
||||
BuildRequires: pkgconfig(protobuf) >= 21.3
|
||||
BuildRequires: pkgconfig(sqlite3)
|
||||
BuildRequires: pkgconfig(thrift) >= 0.22.0
|
||||
BuildRequires: pkgconfig(zlib) >= 1.3.1
|
||||
BuildRequires: pkgconfig(libutf8proc)
|
||||
BuildRequires: pkgconfig(libzstd) >= 1.4.3
|
||||
BuildRequires: pkgconfig(protobuf) >= 3.7.1
|
||||
BuildRequires: pkgconfig(sqlite3) >= 3.45.2
|
||||
BuildRequires: pkgconfig(thrift) >= 0.11.0
|
||||
BuildRequires: pkgconfig(zlib) >= 1.2.11
|
||||
%if %{with tests}
|
||||
BuildRequires: timezone
|
||||
BuildRequires: pkgconfig(gmock) >= 1.10
|
||||
@@ -133,20 +130,6 @@ communication.
|
||||
|
||||
This package provides the shared library for the Acero streaming execution engine
|
||||
|
||||
%package -n libarrow_compute%{sonum}
|
||||
Summary: Development platform for in-memory data - shared library
|
||||
Group: System/Libraries
|
||||
|
||||
%description -n libarrow_compute%{sonum}
|
||||
Apache Arrow is a cross-language development platform for in-memory
|
||||
data. It specifies a standardized language-independent columnar memory
|
||||
format for flat and hierarchical data, organized for efficient
|
||||
analytic operations on modern hardware. It also provides computational
|
||||
libraries and zero-copy streaming messaging and interprocess
|
||||
communication.
|
||||
|
||||
This package provides the shared library for the C++ Compute module
|
||||
|
||||
%package -n libarrow_flight%{sonum}
|
||||
Summary: Development platform for in-memory data - shared library
|
||||
Group: System/Libraries
|
||||
@@ -208,7 +191,6 @@ Summary: Development platform for in-memory data - development files
|
||||
Group: Development/Libraries/C and C++
|
||||
Requires: libarrow%{sonum} = %{version}
|
||||
Requires: libarrow_acero%{sonum} = %{version}
|
||||
Requires: libarrow_compute%{sonum} = %{version}
|
||||
Requires: libarrow_dataset%{sonum} = %{version}
|
||||
%if %{with flight}
|
||||
Requires: libarrow_flight%{sonum} = %{version}
|
||||
@@ -217,7 +199,6 @@ Requires: libarrow_flight_sql%{sonum} = %{version}
|
||||
%if %{with static}
|
||||
Suggests: %{name}-devel-static = %{version}
|
||||
Suggests: %{name}-acero-devel-static = %{version}
|
||||
Suggests: %{name}-compute-devel-static = %{version}
|
||||
Suggests: %{name}-dataset-devel-static = %{version}
|
||||
%if %{with flight}
|
||||
Suggests: %{name}-flight-devel-static = %{version}
|
||||
@@ -267,21 +248,6 @@ communication.
|
||||
|
||||
This package provides the static library for the Acero streaming execution engine
|
||||
|
||||
%package compute-devel-static
|
||||
Summary: Development platform for in-memory data - development files
|
||||
Group: Development/Libraries/C and C++
|
||||
Requires: %{name}-devel = %{version}
|
||||
|
||||
%description compute-devel-static
|
||||
Apache Arrow is a cross-language development platform for in-memory
|
||||
data. It specifies a standardized language-independent columnar memory
|
||||
format for flat and hierarchical data, organized for efficient
|
||||
analytic operations on modern hardware. It also provides computational
|
||||
libraries and zero-copy streaming messaging and interprocess
|
||||
communication.
|
||||
|
||||
This package provides the static library for the C++ Compute module
|
||||
|
||||
%package flight-devel-static
|
||||
Summary: Development platform for in-memory data - development files
|
||||
Group: Development/Libraries/C and C++
|
||||
@@ -377,9 +343,7 @@ This package provides utilities for working with the Parquet format.
|
||||
|
||||
%prep
|
||||
%setup -q -n arrow-apache-arrow-%{version} -a1 -a2
|
||||
%if 0%{?suse_version} >= 1699
|
||||
%patch -P1 -p1
|
||||
%endif
|
||||
%autopatch -p1
|
||||
# https://github.com/protocolbuffers/protobuf/issues/12292
|
||||
sed -i 's/find_package(Protobuf/find_package(Protobuf CONFIG/' cpp/cmake_modules/FindProtobufAlt.cmake
|
||||
|
||||
@@ -494,8 +458,6 @@ popd
|
||||
%postun -n libarrow%{sonum} -p /sbin/ldconfig
|
||||
%post -n libarrow_acero%{sonum} -p /sbin/ldconfig
|
||||
%postun -n libarrow_acero%{sonum} -p /sbin/ldconfig
|
||||
%post -n libarrow_compute%{sonum} -p /sbin/ldconfig
|
||||
%postun -n libarrow_compute%{sonum} -p /sbin/ldconfig
|
||||
%if %{with flight}
|
||||
%post -n libarrow_flight%{sonum} -p /sbin/ldconfig
|
||||
%postun -n libarrow_flight%{sonum} -p /sbin/ldconfig
|
||||
@@ -520,10 +482,6 @@ popd
|
||||
%license LICENSE.txt NOTICE.txt
|
||||
%{_libdir}/libarrow_acero.so.*
|
||||
|
||||
%files -n libarrow_compute%{sonum}
|
||||
%license LICENSE.txt NOTICE.txt
|
||||
%{_libdir}/libarrow_compute.so.*
|
||||
|
||||
%if %{with flight}
|
||||
%files -n libarrow_flight%{sonum}
|
||||
%license LICENSE.txt NOTICE.txt
|
||||
@@ -549,7 +507,6 @@ popd
|
||||
%{_libdir}/cmake/Arrow*
|
||||
%{_libdir}/libarrow.so
|
||||
%{_libdir}/libarrow_acero.so
|
||||
%{_libdir}/libarrow_compute.so
|
||||
%{_libdir}/libarrow_dataset.so
|
||||
%if %{with flight}
|
||||
%{_libdir}/libarrow_flight.so
|
||||
@@ -573,10 +530,6 @@ popd
|
||||
%license LICENSE.txt NOTICE.txt
|
||||
%{_libdir}/libarrow_acero.a
|
||||
|
||||
%files compute-devel-static
|
||||
%license LICENSE.txt NOTICE.txt
|
||||
%{_libdir}/libarrow_compute.a
|
||||
|
||||
%files dataset-devel-static
|
||||
%license LICENSE.txt NOTICE.txt
|
||||
%{_libdir}/libarrow_dataset.a
|
||||
|
||||
@@ -1,27 +0,0 @@
|
||||
We have to tell cmake that the openSUSE packager removed the boost::system target.
|
||||
The headers subpackage provides the necessary Boost:system header file.
|
||||
|
||||
diff -ur --no-dereference arrow-apache-arrow-21.0.0.orig/cpp/cmake_modules/ThirdpartyToolchain.cmake arrow-apache-arrow-21.0.0/cpp/cmake_modules/ThirdpartyToolchain.cmake
|
||||
--- arrow-apache-arrow-21.0.0.orig/cpp/cmake_modules/ThirdpartyToolchain.cmake 2025-07-11 09:44:45.000000000 +0200
|
||||
+++ arrow-apache-arrow-21.0.0/cpp/cmake_modules/ThirdpartyToolchain.cmake 2025-09-26 20:53:58.409119646 +0200
|
||||
@@ -1259,7 +1259,7 @@
|
||||
set(Boost_USE_STATIC_LIBS ON)
|
||||
endif()
|
||||
if(ARROW_BOOST_REQUIRE_LIBRARY)
|
||||
- set(ARROW_BOOST_COMPONENTS filesystem system)
|
||||
+ set(ARROW_BOOST_COMPONENTS filesystem)
|
||||
if(ARROW_FLIGHT_SQL_ODBC AND MSVC)
|
||||
list(APPEND ARROW_BOOST_COMPONENTS locale)
|
||||
endif()
|
||||
diff -ur --no-dereference arrow-apache-arrow-21.0.0.orig/cpp/src/arrow/io/CMakeLists.txt arrow-apache-arrow-21.0.0/cpp/src/arrow/io/CMakeLists.txt
|
||||
--- arrow-apache-arrow-21.0.0.orig/cpp/src/arrow/io/CMakeLists.txt 2025-07-11 09:44:45.000000000 +0200
|
||||
+++ arrow-apache-arrow-21.0.0/cpp/src/arrow/io/CMakeLists.txt 2025-09-26 20:53:51.229519926 +0200
|
||||
@@ -30,7 +30,7 @@
|
||||
EXTRA_LINK_LIBS
|
||||
arrow::hadoop
|
||||
Boost::filesystem
|
||||
- Boost::system)
|
||||
+ Boost::headers)
|
||||
endif()
|
||||
|
||||
add_arrow_test(memory_test PREFIX "arrow-io")
|
||||
3
arrow-testing-20.0.0.tar.gz
Normal file
3
arrow-testing-20.0.0.tar.gz
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:9cca062005e329a6a60a30e28f509f5f4bd12384035b64fcaab19a5a46343cc1
|
||||
size 3572581
|
||||
@@ -1,3 +0,0 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3fa7b34468636ff1642c5c3fdf67d8f86ae4bff283c5185a6a986d623bab1d19
|
||||
size 3588150
|
||||
3
mimalloc-v2.0.9.tar.gz
Normal file
3
mimalloc-v2.0.9.tar.gz
Normal file
@@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4a29edae32a914a706715e2ac8e7e4109e25353212edeed0888f4e3e15db5850
|
||||
size 1143452
|
||||
@@ -1,3 +0,0 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:754a98de5e2912fddbeaf24830f982b4540992f1bab4a0a8796ee118e0752bda
|
||||
size 1295861
|
||||
@@ -1,115 +1,3 @@
|
||||
-------------------------------------------------------------------
|
||||
Thu Sep 25 10:25:07 UTC 2025 - Ben Greiner <code@bnavigator.de>
|
||||
|
||||
- Update to 21.0.0
|
||||
## Bug Fixes
|
||||
* GH-44366 - [Python][Acero] RecordBatch.filter on expression
|
||||
raises error if result set is empty (#46057)
|
||||
* GH-45292 - [Python] test_dtypes hypotesis test fails
|
||||
sporadically (#46029)
|
||||
* GH-46080 - [Python][Docs] Provide guidance for tzdata related
|
||||
issues if installing with pip (#46591)
|
||||
* GH-46121 - [Python] Add missing column_index argument to
|
||||
ArrowReaderProperties::read_dictionary's Cython binding
|
||||
(#46122)
|
||||
* GH-46174 - [Python] Failing tests in python minimal builds
|
||||
(#46175)
|
||||
* GH-46238 - [Release][Python] Use array to avoid empty argument
|
||||
in dev/release/post-11-python.sh (#46239)
|
||||
* GH-46343 - [CI][Python] Remove workaround for gdb packaging
|
||||
issue (#46848)
|
||||
* GH-46344 - [CI][Python] Skip doctest for s3.get_file_info to
|
||||
avoid bucket restrictions (#46345)
|
||||
* GH-46355 - [Python] Fix table.to_struct_array with an empty
|
||||
table (#46357)
|
||||
* GH-46481 - [C++][Python] Allow nullable schema in FlightInfo
|
||||
(#46489)
|
||||
* GH-46516 - [CI][Python] Force Cython>3.1.1 for docs builds
|
||||
(#46770)
|
||||
* GH-46606 - [Python] Do not require numpy when normalizing slice
|
||||
(#46732)
|
||||
* GH-46611 - [Python][C++] Allow building float16 arrays without
|
||||
numpy (#46618)
|
||||
* GH-46729 - [Python] Allow constructing InMemoryDataset from
|
||||
RecordBatchReader (#46731)
|
||||
* GH-46811 - [C++][Python] Fix crash on
|
||||
FileReaderImpl::GetRecordBatchReader (#46931)
|
||||
## New Features and Improvements
|
||||
* GH-26818 - [C++][Python] Preserve order when writing dataset
|
||||
multi-threaded (#44470)
|
||||
* GH-38914 - [Python] Add
|
||||
EncryptionConfiguration.uniform_encryption (#46347)
|
||||
* GH-39294 - [C++][Python] DLPack on Tensor class (#42118)
|
||||
* GH-40754 - [Python] Expose tls_ca_file_path to S3FileSystem
|
||||
(#45881)
|
||||
* GH-41496 - [Python][Azure][Docs] Turn on azure on debian-docs
|
||||
(#46892)
|
||||
* GH-41672 - [Python][Doc] Clarify docstring of
|
||||
FixedSizeListArray.values that it ignores the offset (#46144)
|
||||
* GH-42012 - [Python] Add Schema with_field or set_field method
|
||||
(#46348)
|
||||
* GH-43041 - [C++][Python] Read/write Parquet BYTE_ARRAY as
|
||||
Large/View types directly (#46532)
|
||||
* GH-43807 - [C++][Python] Add UUID extension type conversion
|
||||
support to/from Parquet (#45866)
|
||||
* GH-44500 - [Python][Parquet] Map Parquet logical types to Arrow
|
||||
extension types by default (#46772)
|
||||
* GH-44900 - [Python] Support explicit fsspec+{protocol} and
|
||||
hf:// filesystem URIs (#45089)
|
||||
* GH-45229 - [Python] Migrate from scipy.spmatrix to
|
||||
scipy.sparray (#46423)
|
||||
* GH-45229 - [Python] skip scipy.sparse roundtrip tests for
|
||||
float16 (#46413)
|
||||
* GH-45531 - [Python] Add the dim_names argument to
|
||||
from_numpy_ndarray (#46170)
|
||||
* GH-45619 - [Python] Use f-string instead of string.format
|
||||
(#45629)
|
||||
* GH-45653 - [Python] Scalar subclasses should implement Python
|
||||
protocols (#45818)
|
||||
* GH-45750 - [C++][Python][Parquet] Implement Content-Defined
|
||||
Chunking for the Parquet writer (#45360)
|
||||
* GH-45957 - [C++][Python] Expose allow_delayed_open on
|
||||
S3FileSystem (#46078)
|
||||
* GH-46019 - [Python] Raise TypeError on feather read_table if
|
||||
columns is not a Sequence (#46038)
|
||||
* GH-46054 - [Python][Packaging] Re-enable pandas on Windows
|
||||
free-threaded wheel (#46109)
|
||||
* GH-46058 - [Python] Run Python in AppVeyor outside of source
|
||||
directory (#46059)
|
||||
* GH-46130 - [Python] Remove use_legacy_format in favour of
|
||||
setting IpcWriteOptions (#46131)
|
||||
* GH-46198 - [Python] Remove deprecated PyExtensionType (#46199)
|
||||
* GH-46222 - [Python] Allow to specify footer metadata when
|
||||
opening IPC file for writing (#46354)
|
||||
* GH-46349 - [Python] Move parquet definitions to
|
||||
pyarrow/includes/libparquet.pxd (#46437)
|
||||
* GH-46373 - [Python] Exercise fallback case on tests for
|
||||
parquet.read_table in case dataset is not available (#46550)
|
||||
* GH-46544 - [CI][Dev][Python] Use pre-commit for autopep8
|
||||
(#46552)
|
||||
* GH-46545 - [CI][Dev][Python] Update pre-commit for cython-lint
|
||||
(#46580)
|
||||
* GH-46546 - [CI][Dev][Python] Use pre-commit for numpydoc
|
||||
(#46595)
|
||||
* GH-46572 - [Python] expose filter option to python for join
|
||||
(#46566)
|
||||
* GH-46633 - [Docs][C++][Python] Update CombineChunks
|
||||
documentation to specify that binary columns can be combined
|
||||
into multiple chunks (#46638)
|
||||
* GH-46652 - [Python][Docs] Update language for row_group_size
|
||||
parameter (#46653)
|
||||
* GH-46676 - [C++][Python][Parquet] Allow reading Parquet LIST
|
||||
data as LargeList directly (#46678)
|
||||
* GH-46683 - [C++][Python] Add utf8_zero_fill compute function
|
||||
for sign-aware zero padding (#46815)
|
||||
* GH-46771 - [Python][C++] Implement pa.arange function to
|
||||
generate array sequences (#46778)
|
||||
* GH-46833 - [Python] Expose ConfigureManagedIdentityCredential
|
||||
and ConfigureClientSecretCredential to AzureFileSystem on
|
||||
PyArrow (#46837)
|
||||
* GH-46959 - [Python][Packaging] Drop support for manylinux2014
|
||||
(#46965)
|
||||
|
||||
-------------------------------------------------------------------
|
||||
Fri Jun 13 18:22:38 UTC 2025 - Ben Greiner <code@bnavigator.de>
|
||||
|
||||
|
||||
@@ -1,7 +1,7 @@
|
||||
#
|
||||
# spec file for package python-pyarrow
|
||||
#
|
||||
# Copyright (c) 2025 SUSE LLC and contributors
|
||||
# Copyright (c) 2025 SUSE LLC
|
||||
#
|
||||
# All modifications and additions to the file contributed by third parties
|
||||
# remain the property of their copyright owners, unless otherwise agreed
|
||||
@@ -20,7 +20,7 @@
|
||||
%bcond_with xsimd
|
||||
%define plainpython python
|
||||
# See git submodule /testing pointing to the correct revision
|
||||
%define arrow_testing_commit fbf6b703dc93d17d75fa3664c5aa2c7873ebaf06
|
||||
%define arrow_testing_commit d2a13712303498963395318a4eb42872e66aead7
|
||||
# See git submodule /cpp/submodules/parquet-testing pointing to the correct revision
|
||||
%define parquet_testing_commit 18d17540097fca7c40be3d42c167e6bfad90763c
|
||||
%if %{suse_version} <= 1500
|
||||
@@ -28,7 +28,7 @@
|
||||
%define gccver 13
|
||||
%endif
|
||||
Name: python-pyarrow
|
||||
Version: 21.0.0
|
||||
Version: 20.0.0
|
||||
Release: 0
|
||||
Summary: Python library for Apache Arrow
|
||||
License: Apache-2.0 AND BSD-3-Clause AND BSD-2-Clause AND MIT
|
||||
@@ -39,7 +39,7 @@ Source1: arrow-testing-%{version}.tar.gz
|
||||
Source2: parquet-testing-%{version}.tar.gz
|
||||
Source99: python-pyarrow.rpmlintrc
|
||||
BuildRequires: %{python_module Cython >= 3}
|
||||
BuildRequires: %{python_module devel >= 3.9}
|
||||
BuildRequires: %{python_module devel >= 3.8}
|
||||
BuildRequires: %{python_module numpy-devel >= 1.25}
|
||||
BuildRequires: %{python_module pip}
|
||||
BuildRequires: %{python_module setuptools_scm}
|
||||
|
||||
Reference in New Issue
Block a user