- Update to 2.0.2
## Fixed regressions
* Fixed performance regression in GroupBy.apply() (GH53195)
* Fixed regression in merge() on Windows when dtype is np.intc
(GH52451)
* Fixed regression in read_sql() dropping columns with duplicated
column names (GH53117)
* Fixed regression in DataFrame.loc() losing MultiIndex name when
enlarging object (GH53053)
* Fixed regression in DataFrame.to_string() printing a backslash
at the end of the first row of data, instead of headers, when
the DataFrame doesn’t fit the line width (GH53054)
* Fixed regression in MultiIndex.join() returning levels in wrong
order (GH53093)
## Bug fixes
* Bug in arrays.ArrowExtensionArray incorrectly assigning dict
instead of list for .type with pyarrow.map_ and raising a
NotImplementedError with pyarrow.struct (GH53328)
* Bug in api.interchange.from_dataframe() was raising IndexError
on empty categorical data (GH53077)
* Bug in api.interchange.from_dataframe() was returning
DataFrame’s of incorrect sizes when called on slices (GH52824)
* Bug in api.interchange.from_dataframe() was unnecessarily
raising on bitmasks (GH49888)
* Bug in merge() when merging on datetime columns on different
resolutions (GH53200)
* Bug in read_csv() raising OverflowError for engine="pyarrow"
and parse_dates set (GH53295)
* Bug in to_datetime() was inferring format to contain "%H"
instead of "%I" if date contained “AM” / “PM” tokens (GH53147)
OBS-URL: https://build.opensuse.org/request/show/1090040
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-pandas?expand=0&rev=84
- Update to version 1.5.2
## Fixed regressions
* Fixed regression in MultiIndex.join() for extension array
dtypes (GH49277)
* Fixed regression in Series.replace() raising RecursionError
with numeric dtype and when specifying value=None (GH45725)
* Fixed regression in arithmetic operations for DataFrame with
MultiIndex columns with different dtypes (GH49769)
* Fixed regression in DataFrame.plot() preventing Colormap
instance from being passed using the colormap argument if
Matplotlib 3.6+ is used (GH49374)
* Fixed regression in date_range() returning an invalid set of
periods for CustomBusinessDay frequency and start date with
timezone (GH49441)
* Fixed performance regression in groupby operations (GH49676)
* Fixed regression in Timedelta constructor returning object of
wrong type when subclassing Timedelta (GH49579)
## Bug fixes
* Bug in the Copy-on-Write implementation losing track of views
in certain chained indexing cases (GH48996)
* Fixed memory leak in Styler.to_excel() (GH49751)
## Other
* Reverted color as an alias for c and size as an alias for s in
function DataFrame.plot.scatter() (GH49732)
- Add pandas-pr49886-fix-numpy-deprecations.patch
* gh#pandas-dev/pandas#49887
- Move to PEP518 build
OBS-URL: https://build.opensuse.org/request/show/1045082
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-pandas?expand=0&rev=78
- Update to version 1.4.0
* https://pandas.pydata.org/docs/whatsnew/v1.4.0.html
* Enhancements
- Improved warning messages
- Index can hold arbitrary ExtensionArrays
- Enhancements in Styler
- Multi-threaded CSV reading with a new CSV Engine based on
pyarrow
- Rank function for rolling and expanding windows
- Groupby positional indexing
- DataFrame.from_dict and DataFrame.to_dict have new 'tight'
option
* Notable bug fixes
- Inconsistent date string parsing
- Ignoring dtypes in concat with empty or all-NA columns
- Null-values are no longer coerced to NaN-value in
value_counts and mode
- mangle_dupe_cols in read_csv no longer renames unique columns
conflicting with target names
- unstack and pivot_table no longer raises ValueError for
result that would exceed int32 limit
- groupby.apply consistent transform detection
* API changes
- Index.get_indexer_for() no longer accepts keyword arguments
(other than target); in the past these would be silently
ignored if the index was not unique (GH42310)
- Change in the position of the min_rows argument in
DataFrame.to_string() due to change in the docstring
(GH44304)
- Reduction operations for DataFrame or Series now raising a
ValueError when None is passed for skipna (GH44178)
- read_csv() and read_html() no longer raising an error when
one of the header rows consists only of Unnamed: columns
(GH13054)
- Changed the name attribute of several holidays in
USFederalHolidayCalendar to match official federal holiday
names.
* Deprecations
- Deprecated Int64Index, UInt64Index & Float64Index
- Deprecated Frame.append and Series.append
- Split out test runs into separate flavors, optimize memory usage
in pytest-xdist runs
OBS-URL: https://build.opensuse.org/request/show/948450
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-pandas?expand=0&rev=67
- Update to version 1.3.5
* Fixed regression in Series.equals() when comparing floats with
dtype object to None (GH44190)
* Fixed regression in merge_asof() raising error when array was
supplied as join key (GH42844)
* Fixed regression when resampling DataFrame with DateTimeIndex
with empty groups and uint8, uint16 or uint32 columns
incorrectly raising RuntimeError (GH43329)
* Fixed regression in creating a DataFrame from a timezone-aware
Timestamp scalar near a Daylight Savings Time transition
(GH42505)
* Fixed performance regression in read_csv() (GH44106)
* Fixed regression in Series.duplicated() and
Series.drop_duplicates() when Series has Categorical dtype with
boolean categories (GH44351)
* Fixed regression in GroupBy.sum() with timedelta64[ns] dtype
containing NaT failing to treat that value as NA (GH42659)
* Fixed regression in RollingGroupby.cov() and
RollingGroupby.corr() when other had the same shape as each
group would incorrectly return superfluous groups in the result
(GH42915)
OBS-URL: https://build.opensuse.org/request/show/943876
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-pandas?expand=0&rev=66
- Update to version 1.3.4
* Fixed regression in DataFrame.convert_dtypes() incorrectly
converts byte strings to strings (GH43183)
* Fixed regression in GroupBy.agg() where it was failing
silently with mixed data types along axis=1 and MultiIndex (GH43209)
* Fixed regression in merge() with integer and NaN keys
failing with outer merge (GH43550)
* Fixed regression in DataFrame.corr() raising ValueError with
method="spearman" on 32-bit platforms (GH43588)
* Fixed performance regression in MultiIndex.equals() (GH43549)
* Fixed performance regression in GroupBy.first() and GroupBy.last()
with StringDtype (GH41596)
* Fixed regression in Series.cat.reorder_categories() failing to
update the categories on the Series (GH43232)
* Fixed regression in Series.cat.categories() setter failing to
update the categories on the Series (GH43334)
* Fixed regression in read_csv() raising UnicodeDecodeError exception
when memory_map=True (GH43540)
* Fixed regression in DataFrame.explode() raising AssertionError
when column is any scalar which is not a string (GH43314)
* Fixed regression in Series.aggregate() attempting to pass args
and kwargs multiple times to the user supplied func in certain cases (GH43357)
* Fixed regression when iterating over a DataFrame.groupby.rolling
object causing the resulting DataFrames to have an incorrect index if the input groupings were not sorted (GH43386)
* Fixed regression in DataFrame.groupby.rolling.cov() and
DataFrame.groupby.rolling.corr() computing incorrect results if the
input groupings were not sorted (GH43386)
* Fixed bug in pandas.DataFrame.groupby.rolling() and
pandas.api.indexers.FixedForwardWindowIndexer leading to
segfaults and window endpoints being mixed across groups (GH43267)
* Fixed bug in GroupBy.mean() with datetimelike values
including NaT values returning incorrect results (GH43132)
* Fixed bug in Series.aggregate() not passing the first args
to the user supplied func in certain cases (GH43357)
* Fixed memory leaks in Series.rolling.quantile() and
Series.rolling.median() (GH43339)
OBS-URL: https://build.opensuse.org/request/show/926551
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-pandas?expand=0&rev=65
- Update to version 1.3.3
* Fixed regression in DataFrame constructor failing to broadcast
for defined Index and len one list of Timestamp (GH42810)
* Fixed regression in GroupBy.agg() incorrectly raising in some
cases (GH42390)
* Fixed regression in GroupBy.apply() where nan values were
dropped even with dropna=False (GH43205)
* Fixed regression in GroupBy.quantile() which was failing with
pandas.NA (GH42849)
* Fixed regression in merge() where on columns with
ExtensionDtype or bool data types were cast to object in right
and outer merge (GH40073)
* Fixed regression in RangeIndex.where() and RangeIndex.putmask()
raising AssertionError when result did not represent a
RangeIndex (GH43240)
* Fixed regression in read_parquet() where the fastparquet engine
would not work properly with fastparquet 0.7.0 (GH43075)
* Fixed regression in DataFrame.loc.__setitem__() raising
ValueError when setting array as cell value (GH43422)
* Fixed regression in is_list_like() where objects with __iter__
set to None would be identified as iterable (GH43373)
* Fixed regression in DataFrame.__getitem__() raising error for
slice of DatetimeIndex when index is non monotonic (GH43223)
* Fixed regression in Resampler.aggregate() when used after
column selection would raise if func is a list of aggregation
functions (GH42905)
* Fixed regression in DataFrame.corr() where Kendall correlation
would produce incorrect results for columns with repeated
values (GH43401)
* Fixed regression in DataFrame.groupby() where aggregation on
columns with object types dropped results on those columns
(GH42395, GH43108)
* Fixed regression in Series.fillna() raising TypeError when
filling float Series with list-like fill value having a dtype
which couldn’t cast lostlessly (like float32 filled with
float64) (GH43424)
* Fixed regression in read_csv() raising AttributeError when the
file handle is an tempfile.SpooledTemporaryFile object
(GH43439)
* Fixed performance regression in core.window.ewm.
ExponentialMovingWindow.mean() (GH42333)
* Performance improvement for DataFrame.__setitem__() when the
key or value is not a DataFrame, or key is not list-like
(GH43274)
* Fixed bug in DataFrameGroupBy.agg() and DataFrameGroupBy.
transform() with engine="numba" where index data was not being
correctly passed into func (GH43133)
- Release 1.3.2
* Performance regression in DataFrame.isin() and Series.isin()
for nullable data types (GH42714)
* Regression in updating values of Series using boolean index,
created by using DataFrame.pop() (GH42530)
* Regression in DataFrame.from_records() with empty records
(GH42456)
* Fixed regression in DataFrame.shift() where TypeError occurred
when shifting DataFrame created by concatenation of slices and
fills with values (GH42719)
* Regression in DataFrame.agg() when the func argument returned
lists and axis=1 (GH42727)
* Regression in DataFrame.drop() does nothing if MultiIndex has
duplicates and indexer is a tuple or list of tuples (GH42771)
* Fixed regression where read_csv() raised a ValueError when
parameters names and prefix were both set to None (GH42387)
* Fixed regression in comparisons between Timestamp object and
datetime64 objects outside the implementation bounds for
nanosecond datetime64 (GH42794)
* Fixed regression in Styler.highlight_min() and Styler.
highlight_max() where pandas.NA was not successfully ignored
(GH42650)
* Fixed regression in concat() where copy=False was not honored
in axis=1 Series concatenation (GH42501)
* Regression in Series.nlargest() and Series.nsmallest() with
nullable integer or float dtype (GH42816)
* Fixed regression in Series.quantile() with Int64Dtype (GH42626)
* Fixed regression in Series.groupby() and DataFrame.groupby()
where supplying the by argument with a Series named with a
tuple would incorrectly raise (GH42731)
* Bug in read_excel() modifies the dtypes dictionary when reading
a file with duplicate columns (GH42462)
* 1D slices over extension types turn into N-dimensional slices
over ExtensionArrays (GH42430)
* Fixed bug in Series.rolling() and DataFrame.rolling() not
calculating window bounds correctly for the first row when
center=True and window is an offset that covers all the rows
(GH42753)
* Styler.hide_columns() now hides the index name header row as
well as column headers (GH42101)
* Styler.set_sticky() has amended CSS to control the column/index
names and ensure the correct sticky positions (GH42537)
* Bug in de-serializing datetime indexes in PYTHONOPTIMIZED mode
(GH42866)
OBS-URL: https://build.opensuse.org/request/show/920383
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-pandas?expand=0&rev=64
- Update to version 1.3.1
Fixed regressions
* Pandas could not be built on PyPy (GH42355)
* DataFrame constructed with an older version of pandas could not
be unpickled (GH42345)
* Performance regression in constructing a DataFrame from a
dictionary of dictionaries (GH42248)
* Fixed regression in DataFrame.agg() dropping values when the
DataFrame had an Extension Array dtype, a duplicate index, and
axis=1 (GH42380)
* Fixed regression in DataFrame.astype() changing the order of
noncontiguous data (GH42396)
* Performance regression in DataFrame in reduction operations
requiring casting such as DataFrame.mean() on integer data
(GH38592)
* Performance regression in DataFrame.to_dict() and Series.to_dict
() when orient argument one of “records”, “dict”, or “split”
(GH42352)
* Fixed regression in indexing with a list subclass incorrectly
raising TypeError (GH42433, GH42461)
* Fixed regression in DataFrame.isin() and Series.isin() raising
TypeError with nullable data containing at least one missing
value (GH42405)
* Regression in concat() between objects with bool dtype and
integer dtype casting to object instead of to integer (GH42092)
* Bug in Series constructor not accepting a dask.Array (GH38645)
* Fixed regression for SettingWithCopyWarning displaying
incorrect stacklevel (GH42570)
* Fixed regression for merge_asof() raising KeyError when one of
the by columns is in the index (GH34488)
OBS-URL: https://build.opensuse.org/request/show/911851
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-pandas?expand=0&rev=62
- Update to version 1.2.2
* https://pandas.pydata.org/docs/whatsnew/v1.2.2.html
* fixed regressions and bugfixes
- Update to version 1.2.1
* https://pandas.pydata.org/docs/whatsnew/v1.2.1.html
* fixed regressions and bugfixes
* Calling NumPy ufuncs on non-aligned DataFrames
* The deprecated attributes _AXIS_NAMES and _AXIS_NUMBERS of
DataFrame and Series will no longer show up in dir or inspect.
getmembers calls (GH38740)
* Bumped minimum fastparquet version to 0.4.0 to avoid
AttributeError from numba (GH38344)
* Bumped minimum pymysql version to 0.8.1 to avoid test failures
(GH38344)
* Added reference to backwards incompatible check_freq arg of
testing.assert_frame_equal() and testing.assert_series_equal()
in pandas 1.1.0 whats new (GH34050)
- Update to version 1.2.0
* https://pandas.pydata.org/docs/whatsnew/v1.2.0.html
* WARNING:
The xlwt package for writing old-style .xls excel files is
no longer maintained. The xlrd package is now only for reading
old-style .xls files.
Previously, the default argument engine=None to read_excel()
would result in using the xlrd engine in many cases, including
new Excel 2007+ (.xlsx) files. If openpyxl is installed, many
of these cases will now default to using the openpyxl engine.
See the read_excel() documentation for more details.
Thus, it is strongly encouraged to install openpyxl to read
Excel 2007+ (.xlsx) files. Please do not report issues when
using ``xlrd`` to read ``.xlsx`` files. This is no longer
supported, switch to using openpyxl instead.
Attempting to use the xlwt engine will raise a FutureWarning
unless the option io.excel.xls.writer is set to "xlwt". While
this option is now deprecated and will also raise a
FutureWarning, it can be globally set and the warning
suppressed. Users are recommended to write .xlsx files using
the openpyxl engine instead.
Enhancements
* Optionally disallow duplicate labels
* Passing arguments to fsspec backends
* Support for binary file handles in to_csv
* Support for short caption and table position in to_latex
* Change in default floating precision for read_csv and
read_table
* Experimental nullable data types for float data
* Index/column name preservation when aggregating
* GroupBy supports EWM operations directly
Deprecations
* https://pandas.pydata.org/docs/whatsnew/v1.2.0.html#deprecations
- Skip python36 build: New minimum supported Python is 3.7.1
- Only Suggest instead of Recommend optional dependencies. Nobody
wants to pull in all of those packages by default.
- Remove pandas-pytest.ini
- Rework test deselection
- Limit to 4 pytest-xdist workers, as collection consumes a lot of
memory
OBS-URL: https://build.opensuse.org/request/show/872216
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-pandas?expand=0&rev=56
- specfile:
* updated versions of some requirements, require numpy during build
* removed pandas-pr34991-npconstructor.patch, included upstream
* removed sed commands that are not needed anymore
* skip test to see if pandas is installed
- update to version 1.1.1:
* Fixed regressions
+ Fixed regression in CategoricalIndex.format() where, when
stringified scalars had different lengths, the shorter string
would be right-filled with spaces, so it had the same length as
the longest string (GH35439)
+ Fixed regression in Series.truncate() when trying to truncate a
single-element series (GH35544)
+ Fixed regression where DataFrame.to_numpy() would raise a
RuntimeError for mixed dtypes when converting to str (GH35455)
+ Fixed regression where read_csv() would raise a ValueError when
pandas.options.mode.use_inf_as_na was set to True (GH35493)
+ Fixed regression where pandas.testing.assert_series_equal()
would raise an error when non-numeric dtypes were passed with
check_exact=True (GH35446)
+ Fixed regression in .groupby(..).rolling(..) where column
selection was ignored (GH35486)
+ Fixed regression where DataFrame.interpolate() would raise a
TypeError when the DataFrame was empty (GH35598)
+ Fixed regression in DataFrame.shift() with axis=1 and
heterogeneous dtypes (GH35488)
+ Fixed regression in DataFrame.diff() with read-only data
(GH35559)
+ Fixed regression in .groupby(..).rolling(..) where a segfault
would occur with center=True and an odd number of values
OBS-URL: https://build.opensuse.org/request/show/832629
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-pandas?expand=0&rev=29
- Skip test_raw_roundtrip on i586
- Update to version 1.0.5
* Fixed regressions
+ Fix regression in read_parquet() when reading from file-like objects (GH34467).
+ Fix regression in reading from public S3 buckets (GH34626).
Note this disables the ability to read Parquet files from
directories on S3 again (GH26388, GH34632), which was added
in the 1.0.4 release, but is now targeted for pandas 1.1.0.
+ Fixed regression in replace() raising an AssertionError when replacing values in an extension dtype with values of a different dtype (GH34530)
* Bug fixes
+ Fixed building from source with Python 3.8 fetching the wrong version of NumPy
OBS-URL: https://build.opensuse.org/request/show/817948
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-pandas?expand=0&rev=27
- Update to version 1.0.5
* Fixed regressions
+ Fix regression in read_parquet() when reading from file-like objects (GH34467).
+ Fix regression in reading from public S3 buckets (GH34626).
Note this disables the ability to read Parquet files from
directories on S3 again (GH26388, GH34632), which was added
in the 1.0.4 release, but is now targeted for pandas 1.1.0.
+ Fixed regression in replace() raising an AssertionError when replacing values in an extension dtype with values of a different dtype (GH34530)
* Bug fixes
+ Fixed building from source with Python 3.8 fetching the wrong version of NumPy
OBS-URL: https://build.opensuse.org/request/show/816736
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python:numeric/python-pandas?expand=0&rev=43