1
0
Commit Graph

52 Commits

Author SHA256 Message Date
Dominique Leuenberger
e55c331cb3 Accepting request 1083474 from devel:languages:python:numeric
- update to 2023.4.0:
  * allow loading categoricals even if not so in the pandas metadata,
    when a column is dict-encodedand we only have one row-group (#863)
 * apply dtype to the columns names series, even when selecting no
    columns (#861, 859)
 * don't make strings while estimating bye column size (#858)
 * handle upstream depr (#857, 856)

OBS-URL: https://build.opensuse.org/request/show/1083474
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=27
2023-04-28 14:24:14 +00:00
f1d61b0abc - update to 2023.4.0:
* allow loading categoricals even if not so in the pandas metadata,
    when a column is dict-encodedand we only have one row-group (#863)
 * apply dtype to the columns names series, even when selecting no
    columns (#861, 859)
 * don't make strings while estimating bye column size (#858)
 * handle upstream depr (#857, 856)

OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=48
2023-04-28 08:11:58 +00:00
Dominique Leuenberger
208da3606a Accepting request 1064743 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/1064743
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=26
2023-02-13 15:40:51 +00:00
42fa3a1c16 Accepting request 1064736 from home:apersaud:branches:devel:languages:python:numeric
update to latest version

OBS-URL: https://build.opensuse.org/request/show/1064736
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=46
2023-02-12 22:54:14 +00:00
Dominique Leuenberger
56cbfb5c18 Accepting request 1046424 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/1046424
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=25
2023-01-03 14:05:41 +00:00
8b4bcd5004 Accepting request 1046313 from home:bnavigator:branches:devel:languages:python:numeric
- Update to 2022.12.0
  * check all int32 values before passing to thrift writer
  * fix type of num_rows to i64 for big single file
- Release 2022.11.0
  * Switch to calver
  * Speed up loading of nullable types
  * Allow schema evolution by addition of columns
  * Allow specifying dtypes of output
  * update to scm versioning
  * fixes to row filter, statistics and tests
  * support pathlib.Paths
  * JSON encoder options
- Drop fastparquet-pr813-updatefixes.patch

OBS-URL: https://build.opensuse.org/request/show/1046313
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=44
2023-01-03 07:37:17 +00:00
Dominique Leuenberger
0c11a78ae2 Accepting request 1045047 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/1045047
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=24
2022-12-24 13:52:02 +00:00
68c0cc4bdd Accepting request 1044387 from home:Guillaume_G:fastparquet
- Add patch to fox the test test_delta_from_def_2 on 
  aarch64, armv7 and ppc64le:
  * fastparquet-pr835.patch

OBS-URL: https://build.opensuse.org/request/show/1044387
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=42
2022-12-23 16:12:14 +00:00
Dominique Leuenberger
83c6b584ec Accepting request 1032429 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/1032429
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=23
2022-10-31 12:04:24 +00:00
4d8484644b Accepting request 1032127 from home:bnavigator:branches:devel:languages:python:numeric
- Update to 0.8.3
  * improved key/value handling and rejection of bad types
  * fix regression in consolidate_cats (caught in dask tests)
- Release 0.8.2
  * datetime indexes initialised to 0 to prevent overflow from
    randommemory
  * case from csv_to_parquet where stats exists but has not nulls
    entry
  * define len and bool for ParquetFile
  * maintain int types of optional data tha came from pandas
  * fix for delta encoding
- Add fastparquet-pr813-updatefixes.patch gh#dask/fastparquet#813

OBS-URL: https://build.opensuse.org/request/show/1032127
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=40
2022-10-31 09:54:02 +00:00
Dominique Leuenberger
94c8acf34e Accepting request 972913 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/972913
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=22
2022-04-26 18:16:10 +00:00
6a03a42003 Accepting request 972857 from home:bnavigator:branches:devel:languages:python:numeric
- Update to 0.8.1
  * fix critical buffer overflow crash for large number of columns
    and long column names
  * metadata handling
  * thrift int32 for list
  * avoid error storing NaNs in column stats

OBS-URL: https://build.opensuse.org/request/show/972857
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=38
2022-04-26 14:21:38 +00:00
Dominique Leuenberger
c52b2a2386 Accepting request 950305 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/950305
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=21
2022-02-02 21:40:06 +00:00
59c9a3b022 Accepting request 950136 from home:bnavigator:branches:devel:languages:python:numeric
- Update to 0.8.0
  * our own cythonic thrift implementation (drop thrift dependency)
  * more in-place dataset editing ad reordering
  * python 3.10 support
  * fixes for multi-index and pandas types
- Clean test skips

OBS-URL: https://build.opensuse.org/request/show/950136
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=37
2022-01-31 20:28:31 +00:00
Dominique Leuenberger
f5764d3de9 Accepting request 946883 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/946883
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=20
2022-01-17 21:34:09 +00:00
c9d6b4d9cf Accepting request 946801 from home:bnavigator:branches:devel:languages:python:numeric
- Clean specfile from unused python36 conditionals
- Require thrift 0.15.0 (+patch) for Python 3.10 compatibility
  * gh#dask/fastparquet#514

OBS-URL: https://build.opensuse.org/request/show/946801
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=36
2022-01-17 06:31:02 +00:00
Dominique Leuenberger
da6ce543e6 Accepting request 934419 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/934419
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=19
2021-11-28 20:30:22 +00:00
dc8c5ab00b Accepting request 934308 from home:apersaud:branches:devel:languages:python:numeric
- still some failed builds, but they are also in the current package (and I don't know how to fix them)

- update to version 0.7.2:
  * Ability to remove row-groups in-place for multifile datasets
  * Accept pandas nullable Float type
  * allow empty strings and fix min/max when there is no data
  * make writing statistics optional
  * row selection in to_pandas()

OBS-URL: https://build.opensuse.org/request/show/934308
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=35
2021-11-28 19:15:14 +00:00
Richard Brown
58f5dcc434 Accepting request 911011 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/911011
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=18
2021-08-12 07:01:23 +00:00
4cca0bf52a Accepting request 910725 from home:bnavigator:branches:devel:languages:python:numeric
- Update to version 0.7.1
  * Back compile for older versions of numpy
  * Make pandas nullable types opt-out. The old behaviour (casting
    to float) is still available with ParquetFile(...,
    pandas_nulls=False).
  * Fix time field regression: IsAdjustedToUTC will be False when
    there is no timezone
  * Micro improvements to the speed of ParquetFile creation by
    using simple simple string ops instead of regex and
    regularising filenames once at the start. Effects datasets with
    many files.
- Release 0.7.0
  * This version institutes major, breaking changes, listed here,
    and incremental fixes and additions.
  * Reading a directory without a _metadata summary file now works
    by providing only the directory, instead of a list of
    constituent files. This change also makes direct of use of
    fsspec filesystems, if given, to be able to load the footer
    metadata areas of the files concurrently, if the storage
    backend supports it, and not directly instantiating
    intermediate ParquetFile instances
  * row-level filtering of the data. Whereas previously, only full 
    row-groups could be excluded on the basis of their parquet 
    metadata statistics (if present), filtering can now be done 
    within row-groups too. The syntax is the same as before, 
    allowing for multiple column expressions to be combined with 
    AND|OR, depending on the list structure. This mechanism 
    requires two passes: one to load the columns needed to create 
    the boolean mask, and another to load the columns actually 
    needed in the output. This will not be faster, and may be 
    slower, but in some cases can save significant memory 
    footprint, if a small fraction of rows are considered good and 
    the columns for the filter expression are not in the output. 
    Not currently supported for reading with DataPageV2.
  * DELTA integer encoding (read-only): experimentally working, 
    but we only have one test file to verify against, since it is 
    not trivial to persuade Spark to produce files encoded this 
    way. DELTA can be extremely compact a representation for 
    slowly varying and/or monotonically increasing integers.
  * nanosecond resolution times: the new extended "logical" types 
    system supports nanoseconds alongside the previous millis and 
    micros. We now emit these for the default pandas time type, 
    and produce full parquet schema including both "converted" and 
    "logical" type information. Note that all output has 
    isAdjustedToUTC=True, i.e., these are timestamps rather than 
    local time. The time-zone is stored in the metadata, as 
    before, and will be successfully recreated only in fastparquet 
    and (py)arrow. Otherwise, the times will appear to be UTC. For 
    compatibility with Spark, you may still want to use 
    times="int96" when writing.
  * DataPageV2 writing: now we support both reading and writing. 
    For writing, can be enabled with the environment variable 
    FASTPARQUET_DATAPAGE_V2, or module global fastparquet.writer.
    DATAPAGE_VERSION and is off by default. It will become on by 
    default in the future. In many cases, V2 will result in better 
    read performance, because the data and page headers are 
    encoded separately, so data can be directly read into the 
    output without addition allocation/copies. This feature is 
    considered experimental, but we believe it working well for 
    most use cases (i.e., our test suite) and should be readable 
    by all modern parquet frameworks including arrow and spark.
  * pandas nullable types: pandas supports "masked" extension 
    arrays for types that previously could not support NULL at 
    all: ints and bools. Fastparquet used to cast such columns to 
    float, so that we could represent NULLs as NaN; now we use the 
    new(er) masked types by default. This means faster reading of 
    such columns, as there is no conversion. If the metadata 
    guarantees that there are no nulls, we still use the 
    non-nullable variant unless the data was written with 
    fastparquet/pyarrow, and the metadata indicates that the 
    original datatype was nullable. We already handled writing of 
    nullable columns.

OBS-URL: https://build.opensuse.org/request/show/910725
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=34
2021-08-09 13:21:06 +00:00
Dominique Leuenberger
2ff07b46b5 Accepting request 894287 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/894287
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=17
2021-05-19 15:49:29 +00:00
9dfe59d4e3 Accepting request 894265 from home:bnavigator:branches:devel:languages:python:numeric
- Update to version 0.6.3
  * no release notes
  * new requirement: cramjam instead of separate compression libs
    and their bindings
  * switch from numba to Cython

OBS-URL: https://build.opensuse.org/request/show/894265
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=32
2021-05-19 10:12:11 +00:00
Dominique Leuenberger
77fe2cc0d1 Accepting request 871464 from devel:languages:python:numeric
- skip python 36 build

OBS-URL: https://build.opensuse.org/request/show/871464
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=16
2021-02-15 22:17:01 +00:00
c13c6c472e - skip python 36 build
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=30
2021-02-12 14:50:26 +00:00
Dominique Leuenberger
0c358a6973 Accepting request 870700 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/870700
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=15
2021-02-10 20:30:34 +00:00
6eec839c0e Accepting request 869540 from home:jengelh:branches:devel:languages:python:numeric
- Use of "+=" in %check warrants bash as buildshell.

OBS-URL: https://build.opensuse.org/request/show/869540
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=28
2021-02-09 21:28:22 +00:00
Dominique Leuenberger
96551dd71b Accepting request 869528 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/869528
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=14
2021-02-04 19:24:45 +00:00
3b3a9ec57f Accepting request 869041 from home:bnavigator:branches:devel:languages:python:numeric
- Skip the import without warning test gh#dask/fastparquet#558
- Apply the Cepl-Strangelove-Parameter to pytest
  (--import-mode append)

OBS-URL: https://build.opensuse.org/request/show/869041
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=26
2021-02-04 16:34:25 +00:00
Dominique Leuenberger
95959e899b Accepting request 859938 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/859938
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=13
2021-01-03 20:26:55 +00:00
3fc19bda15 Accepting request 859934 from home:bnavigator:branches:devel:languages:python:numeric
- update to version 0.5
  * no changelog
- update test suite setup -- install the .test module

OBS-URL: https://build.opensuse.org/request/show/859934
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=24
2021-01-03 10:17:10 +00:00
Dominique Leuenberger
4e6677eb79 Accepting request 821679 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/821679
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=12
2020-07-20 19:00:37 +00:00
Todd R
e8e5e639c7 Accepting request 821674 from home:apersaud:branches:devel:languages:python:numeric
update to latest version

OBS-URL: https://build.opensuse.org/request/show/821674
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=22
2020-07-18 19:17:22 +00:00
Dominique Leuenberger
e76154c4d3 Accepting request 819826 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/819826
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=11
2020-07-10 12:13:26 +00:00
Tomáš Chvátal
4c289424b1 Accepting request 819735 from home:mcalabkova:branches:devel:languages:python:numeric
- Update to 0.4.0
  * Changed RangeIndex private methods to public ones
  * Use the python executable used to run the code
  * Add support for Python 3.8
  * support for numba > 0.48
- drop upstreamed patch use-python-exec.patch

OBS-URL: https://build.opensuse.org/request/show/819735
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=20
2020-07-09 21:09:42 +00:00
Dominique Leuenberger
a14482ff11 Accepting request 791651 from devel:languages:python:numeric
- Add patch to use sys.executable and not call py2 binary directly:
  * use-python-exec.patch

- Update to 0.3.3:
  * no upstream changelog

OBS-URL: https://build.opensuse.org/request/show/791651
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=10
2020-04-07 08:30:03 +00:00
Tomáš Chvátal
1e90d85788 - Add patch to use sys.executable and not call py2 binary directly:
* use-python-exec.patch

- Update to 0.3.3:
  * no upstream changelog

OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=18
2020-04-06 07:07:54 +00:00
Dominique Leuenberger
8199ac000c Accepting request 742993 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/742993
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=9
2019-10-27 12:41:13 +00:00
Todd R
b6df563592 Accepting request 742992 from home:TheBlackCat:branches:devel:languages:python:numeric
- Drop broken python 2 support.
- Testing fixes

OBS-URL: https://build.opensuse.org/request/show/742992
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=16
2019-10-25 17:52:59 +00:00
Dominique Leuenberger
feeee33acd Accepting request 720881 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/720881
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=8
2019-08-05 08:42:14 +00:00
Todd R
62d9529bea Accepting request 720816 from home:apersaud:branches:devel:languages:python:numeric
update to latest version

OBS-URL: https://build.opensuse.org/request/show/720816
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=14
2019-08-04 14:05:57 +00:00
Todd R
c46e61d62b Accepting request 719841 from home:TheBlackCat:branches:devel:languages:python:numeric
- Fix spurious test failure

OBS-URL: https://build.opensuse.org/request/show/719841
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=13
2019-07-30 14:23:51 +00:00
Dominique Leuenberger
1bc9fdb782 Accepting request 704258 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/704258
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=7
2019-05-21 08:40:17 +00:00
Tomáš Chvátal
6f19f6c0ef Accepting request 704253 from home:mcepl:branches:devel:languages:python:numeric
- Clean up SPEC file.

OBS-URL: https://build.opensuse.org/request/show/704253
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=11
2019-05-20 13:24:28 +00:00
Dominique Leuenberger
e9972a7dc4 Accepting request 699763 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/699763
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=6
2019-05-09 12:28:11 +00:00
Todd R
bc29c40d35 Accepting request 699761 from home:TheBlackCat:branches:devel:languages:python:numeric
- update to 0.3.1
  * Add schema == (__eq__) and != (__ne__) methods and tests. 
  * Fix item iteration for decimals
  * List missing columns in error message
  * Fix tz being None case
- Update to 0.3.0
  * Squash some warnings and import failures
  * Improvements to in and not in operators
  * Fixes because pandas released

OBS-URL: https://build.opensuse.org/request/show/699761
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=9
2019-04-30 20:10:30 +00:00
Dominique Leuenberger
47ad4c40f3 Accepting request 668842 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/668842
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=5
2019-01-28 19:48:35 +00:00
Tomáš Chvátal
4186890208 Accepting request 668821 from home:apersaud:branches:devel:languages:python:numeric
update to latest version

OBS-URL: https://build.opensuse.org/request/show/668821
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=7
2019-01-26 21:38:54 +00:00
Dominique Leuenberger
3111757c77 Accepting request 651329 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/651329
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=4
2018-11-26 09:29:27 +00:00
Tomáš Chvátal
dfe04380f9 Accepting request 651237 from home:apersaud:branches:devel:languages:python:numeric
update to latest version

OBS-URL: https://build.opensuse.org/request/show/651237
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-fastparquet?expand=0&rev=5
2018-11-23 08:26:16 +00:00
Dominique Leuenberger
c8e155b151 Accepting request 639321 from devel:languages:python:numeric
OBS-URL: https://build.opensuse.org/request/show/639321
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-fastparquet?expand=0&rev=3
2018-10-02 17:47:24 +00:00