SHA256
1
0
forked from pool/openmpi4
openmpi4/openmpi4.changes

154 lines
7.7 KiB
Plaintext

-------------------------------------------------------------------
Wed Apr 28 11:08:54 UTC 2021 - Nicolas Morey-Chaisemartin <nmoreychaisemartin@suse.com>
- openmpi4 is now the default openmpi for releases > 15.3
-------------------------------------------------------------------
Wed Apr 28 09:24:33 UTC 2021 - Nicolas Morey-Chaisemartin <nmoreychaisemartin@suse.com>
- Update to version 4.1.1
- Fix a number of datatype issues, including an issue with
improper handling of partial datatypes that could lead to
an unexpected application failure.
- Change UCX PML to not warn about MPI_Request leaks during
MPI_FINALIZE by default. The old behavior can be restored with
the mca_pml_ucx_request_leak_check MCA parameter.
- Reverted temporary solution that worked around launch issues in
SLURM v20.11.{0,1,2}. SchedMD encourages users to avoid these
versions and to upgrade to v20.11.3 or newer.
- Updated PMIx to v3.2.2.
- Disabled gcc built-in atomics by default on aarch64 platforms.
- Disabled UCX PML when UCX v1.8.0 is detected. UCX version 1.8.0 has a bug that
may cause data corruption when its TCP transport is used in conjunction with
the shared memory transport. UCX versions prior to v1.8.0 are not affected by
this issue. Thanks to @ksiazekm for reporting the issue.
- Fixed detection of available UCX transports/devices to better inform PML
prioritization.
- Fixed SLURM support to mark ORTE daemons as non-MPI tasks.
- Improved AVX detection to more accurately detect supported
platforms. Also improved the generated AVX code, and switched to
using word-based MCA params for the op/avx component (vs. numeric
big flags).
- Improved OFI compatibility support and fixed memory leaks in error
handling paths.
- Improved HAN collectives with support for Barrier and Scatter. Thanks
to @EmmanuelBRELLE for these changes and the relevant bug fixes.
- Fixed MPI debugger support (i.e., the MPIR_Breakpoint() symbol).
Thanks to @louisespellacy-arm for reporting the issue.
- Fixed ORTE bug that prevented debuggers from reading MPIR_Proctable.
- Removed PML uniformity check from the UCX PML to address performance
regression.
- Fixed MPI_Init_thread(3) statement about C++ binding and update
references about MPI_THREAD_MULTIPLE. Thanks to Andreas Lösel for
bringing the outdated docs to our attention.
- Added fence_nb to Flux PMIx support to address segmentation faults.
- Ensured progress of AIO requests in the POSIX FBTL component to
prevent exceeding maximum number of pending requests on MacOS.
- Used OPAL's mutli-thread support in the orted to leverage atomic
operations for object refcounting.
- Fixed segv when launching with static TCP ports.
- Fixed --debug-daemons mpirun CLI option.
- Fixed bug where mpirun did not honor --host in a managed job
allocation.
- Made a managed allocation filter a hostfile/hostlist.
- Fixed bug to marked a generalized request as pending once initiated.
- Fixed external PMIx v4.x check.
- Fixed OSHMEM build with `--enable-mem-debug`.
- Fixed a performance regression observed with older versions of GCC when
__ATOMIC_SEQ_CST is used. Thanks to @BiplabRaut for reporting the issue.
- Fixed buffer allocation bug in the binomial tree scatter algorithm when
non-contiguous datatypes are used. Thanks to @sadcat11 for reporting the issue.
- Fixed bugs related to the accumulate and atomics functionality in the
osc/rdma component.
- Fixed race condition in MPI group operations observed with
MPI_THREAD_MULTIPLE threading level.
- Fixed a deadlock in the TCP BTL's connection matching logic.
- Fixed pml/ob1 compilation error when CUDA support is enabled.
- Fixed a build issue with Lustre caused by unnecessary header includes.
- Fixed a build issue with IMB LSF workload manager.
- Fixed linker error with UCX SPML.
-------------------------------------------------------------------
Wed Mar 24 08:01:35 UTC 2021 - Egbert Eich <eich@suse.com>
- Update to version 4.1.0
* collectives: Add HAN and ADAPT adaptive collectives components.
Both components are off by default and can be enabled by specifying
"mpirun --mca coll_adapt_priority 100 --mca coll_han_priority 100 ...".
We intend to enable both by default in Open MPI 5.0.
* OMPIO is now the default for MPI-IO on all filesystems, including
Lustre (prior to this, ROMIO was the default for Lustre). Many
thanks to Mark Dixon for identifying MPI I/O issues and providing
access to Lustre systems for testing.
* Minor MPI one-sided RDMA performance improvements.
* Fix hcoll MPI_SCATTERV with MPI_IN_PLACE.
* Add AVX support for MPI collectives.
* Updates to mpirun(1) about "slots" and PE=x values.
* Fix buffer allocation for large environment variables. Thanks to
@zrss for reporting the issue.
* Upgrade the embedded OpenPMIx to v3.2.2.
* Fix issue with extra-long values in MCA files. Thanks to GitHub
user @zrss for bringing the issue to our attention.
* UCX: Fix zero-sized datatype transfers.
* Fix --cpu-list for non-uniform modes.
* Fix issue in PMIx callback caused by missing memory barrier on Arm platforms.
* OFI MTL: Various bug fixes.
* Fixed issue where MPI_TYPE_CREATE_RESIZED would create a datatype
with unexpected extent on oddly-aligned datatypes.
* collectives: Adjust default tuning thresholds for many collective
algorithms
* runtime: fix situation where rank-by argument does not work
* Portals4: Clean up error handling corner cases
* runtime: Remove --enable-install-libpmix option, which has not
worked since it was added
* UCX: Allow UCX 1.8 to be used with the btl uct
* UCX: Replace usage of the deprecated NB API of UCX with NBX
* OMPIO: Add support for the IME file system
* OFI/libfabric: Added support for multiple NICs
* OFI/libfabric: Added support for Scalable Endpoints
* OFI/libfabric: Added btl for one-sided support
* OFI/libfabric: Multiple small bugfixes
* libnbc: Adding numerous performance-improving algorithms
- Removed: reproducible.patch - replaced by spec file settings.
-------------------------------------------------------------------
Tue Sep 8 18:35:17 UTC 2020 - Nicolas Morey-Chaisemartin <nmoreychaisemartin@suse.com>
- Update to version 4.0.5
- See NEWS for the detailled changelog
-------------------------------------------------------------------
Thu Jun 11 08:42:56 UTC 2020 - Nicolas Morey-Chaisemartin <nmoreychaisemartin@suse.com>
- Update to version 4.0.4
- See NEWS for the detailled changelog
-------------------------------------------------------------------
Tue Jun 9 08:40:06 UTC 2020 - Nicolas Morey-Chaisemartin <nmoreychaisemartin@suse.com>
- Update to version 4.0.3
- See NEWS for the detailled changelog
- Fixes compilation with UCX 1.8
- Drop memory-patcher-fix-compiler-warning.patch which was merged upstream
-------------------------------------------------------------------
Thu Mar 19 08:28:04 UTC 2020 - Nicolas Morey-Chaisemartin <nmoreychaisemartin@suse.com>
- Drop different package string between SLES and Leap
-------------------------------------------------------------------
Wed Jan 15 09:05:14 UTC 2020 - Nicolas Morey-Chaisemartin <nmoreychaisemartin@suse.com>
- Add memory-patcher-fix-compiler-warning.patch to fix 64bit portability issues
-------------------------------------------------------------------
Thu Oct 31 17:05:14 UTC 2019 - Nicolas Morey-Chaisemartin <nmoreychaisemartin@suse.com>
- Link against libnuma (bsc#1155120)
-------------------------------------------------------------------
Thu Oct 24 14:40:07 UTC 2019 - Nicolas Morey-Chaisemartin <nmoreychaisemartin@suse.com>
- Initial version (4.0.2)
- Add reproducible.patch for reproducible builds.