SHA256
1
0
forked from pool/openucx
Commit Graph

8 Commits

Author SHA256 Message Date
Nicolas Morey-Chaisemartin
cc6c36d10f Accepting request 874909 from home:NMoreyChaisemartin:branches:science:HPC
- Update openucx-s390x-support.patch to fix mmap syscall on s390x (bsc#1182691)

OBS-URL: https://build.opensuse.org/request/show/874909
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=44
2021-02-24 17:24:21 +00:00
Nicolas Morey-Chaisemartin
f10927b874 Accepting request 840386 from home:NMoreyChaisemartin:branches:science:HPC
- Update to v1.9.0 (jsc#SLE-15163)
  - Features:
    - Added a new class of communication APIs '*_nbx' that enable API extendability while
    - preserving ABI backward compatibility
    - Added asynchronous event support to UCT/IB/DEVX
    - Added support for latest CUDA library version
    - Added NAK-based reliability protocol for UCT/IB/UD to optimize resends
    - Added new tests for ROCm
    - Added new configuration parameters for protocol selection
    - Added performance optimization for Fujitsu A64FX with InfiniBand
    - Added performance optimization for clear cache code aarch64
    - Added support for relaxed-order PCIe access in IB RDMA transports
    - Added new TCP connection manager
    - Added support for UCT/IB PKey with partial membership in IB transports
    - Added support for RoCE LAG
    - Added support for ROCm 3.7 and above
    - Added flow control for RDMA read operations
    - Improved endpoint flush implementation for UCT/IB
    - Improved UD timer to avoid interrupting the main thread when not in use
    - Improved latency estimation for network path with CUDA
    - Improved error reporting messages
    - Improved performance in active message flow (removed malloc call)
    - Improved performance in ptr_array flow
    - Improved performance in UCT/SM progress engine flow
    - Improved I/O demo code
    - Improved rendezvous protocol for CUDA
    - Updated examples code
  - Bugfixes:
    - Fixes for most resent versions of GCC, CLANG, ARMCLANG, PGI
    - Fixes in UCT/IB for strict order keys

OBS-URL: https://build.opensuse.org/request/show/840386
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=42
2020-10-09 06:50:44 +00:00
9033dd246f Accepting request 811684 from home:NMoreyChaisemartin:branches:science:HPC
- Update to v1.8.0
  - Features:
    - Improved detection for DEVX support
    - Improved TCP scalability
    - Added support for ROCM to perftest
    - Added support for different source and target memory types to perftest
    - Added optimized memcpy for ROCM devices
    - Added hardware tag-matching for CUDA buffers
    - Added support for CUDA and ROCM managed memories
    - Added support for client/server disconnect protocol over rdma connection manager
    - Added support for striding receive queue for hardware tag-matching
    - Added XPMEM-based rendezvous protocol for shared memory
    - Added support shared memory communication between containers on same machine
    - Added support for multi-threaded RDMA memory registration for large regions
    - Added new test cases to Azure CI
    - Added support for multiple listening transports
    - Added UCT socket-based connection manager transport
    - Updated API for UCT component management
    - Added API to retrieve the listening port
    - Added UCP active message API
    - Removed deprecated API for querying UCT memory domains
    - Refactored server/client examples
    - Added support for dlopen interception in UCM
    - Added support for PCIe atomics
    - Updated Java API: added support for most of UCP layer operations
    - Updated support for Mellanox DevX API
    - Added multiple UCT/TCP transport performance optimizations
    - Optimized memcpy() for Intel platforms
    - Added protection from non-UCX socket based app connections
    - Improved search time for PKEY object
    - Enabled gtest over IPv6 interfaces
    - Updated Mellanox and Bull device IDs
    - Added support for CUDA_VISIBLE_DEVICES
    - Increased limits for CUDA IPC registration
  - Bugfixes:
    - Multiple fixes in JUCX
    - Fixes in UCP thread safety
    - Fixes for most recent versions GCC, PGI, and ICC
    - Fixes for CPU affinity on Azure instances
    - Fixes in XPMEM support on PPC64
    - Performance fixes in CUDA IPC
    - Fixes in RDMA CM flows
    - Multiple fixes in TCP transport
    - Multiple fixes in documentation
    - Fixes in transport lane selection logic
    - Fixes in Java jar build
    - Fixes in socket connection manager for Nvidia DGX-2 platform
    - Multiple fixes in UCP, UCT, UCM libraries
    - Multiple fixes for BSD and Mac OS systems
    - Fixes for Clang compiler
    - Fix CPU optimization configuration options
    - Fix JUCX build on GPU nodes
    - Fix in Azure release pipeline flow
    - Fix in CUDA memory hooks management
    - Fix in GPU memory peer direct gtest
    - Fix in TCP connection establishment flow
    - Fix in GPU IPC check
    - Fix in CUDA Jenkins test flow
    - Multiple fixes in CUDA IPC flow
    - Fix adding missing header files
    - Fix to prevent failures in presence of VPN enabled Ethernet interfaces
- Refresh openucx-s390x-support.patch against new version

OBS-URL: https://build.opensuse.org/request/show/811684
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=37
2020-06-05 08:02:58 +00:00
Nicolas Morey-Chaisemartin
de6138b03e Accepting request 733589 from home:NMoreyChaisemartin:branches:science:HPC
- Update to v1.6.0
  - Features:
    - Modular architecture for UCT transports
    - ROCm transport re-design: support for managed memory, direct copy, ROCm GDR
    - Random scheduling policy for DC transport
    - Optimized out-of-box settings for multi-rail
    - Added support for OmniPath (using Verbs)
    - Support for PCI atomics with IB transports
    - Reduced UCP address size for homogeneous environments
  - Bugfixes:
    - Multiple stability and performance improvements in TCP transport
    - Multiple stability fixed in Verbs and MLX5 transports
    - Multiple stability fixes in UCM memory hooks
    - Multiple stability fixes in UGNI transport
    - RPM Spec file cleanup
    - Fixing compilation issues with most recent clang and gcc compilers
    - Fixing the wrong name of aliases
    - Fix data race in UCP wireup
    - Fix segfault when libuct.so is reloaded - issue #3558
    - Include Java sources in distribution
    - Handle EADDRNOTAVAIL in rdma_cm connection manager
    - Disable ibcm on RHEL7+ by default
    - Fix data race in UCP proxy endpoint
    - Static checker fixes
    - Fallback to ibv_create_cq() if ibv_create_cq_ex() returns ENOSYS
    - Fix malloc hooks test
    - Fix checking return status in ucp_client_server example
    - Fix gdrcopy libdir config value
    - Fix printing atomic capabilities in ucx_info
    - Fix perftest warmup iterations to be non-zero

OBS-URL: https://build.opensuse.org/request/show/733589
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=33
2019-09-27 08:19:55 +00:00
Nicolas Morey-Chaisemartin
fd1e5380fe Accepting request 678966 from home:NMoreyChaisemartin:branches:science:HPC
- Update openucx-s390x-support.patch to fix support of 1.5.0 on s390x
- Add baselibs.conf for ppc

- Update to v1.5.0 (bsc#1121267)
  * Features:
  * New emulation mode enabling full UCX functionality (Atomic, Put, Get)
  * over TCP and RDMA-CORE interconnects which don't implement full RDMA semantics
  * Non-blocking API for all one-sided operations. All blocking communication APIs marked
  * as deprecated
  * New client/server connection establishment API, which allows connected handover between workers
  * Support for rdma-core direct-verbs (DEVX) and DC with mlx5 transports
  * GPU - Support for stream API and receive side pipelining
  * Malloc hooks using binary instrumentation instead of symbol override
  * Statistics for UCT tag API
  * GPU-to-Infiniband HCA affinity support based on locality/distance (PCIe)
  * Bugfixes:
  * Fix overflow in RC/DC flush operations
  * Update description in SPEC file and README
  * Fix RoCE source port for dc_mlx5 flow control
  * Improve ucx_info help message
  * Fix segfault in UCP, due to int truncation in count_one_bits()
  * Multiple other bugfixes (full list on github)
  * Tested configurations:
  * InfiniBand: MLNX_OFED 4.4-4.5, distribution inbox drivers, rdma-core
  * CUDA: gdrcopy 1.2, cuda 9.1.85
  * XPMEM: 2.6.2
  * KNEM: 1.1.2

OBS-URL: https://build.opensuse.org/request/show/678966
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=27
2019-02-25 16:53:29 +00:00
Nicolas Morey-Chaisemartin
6c87d0bee6 Accepting request 628372 from home:NMoreyChaisemartin:branches:science:HPC
- Update to v1.3.0 (bsc#1104159)
  - Added stream-based communication API to UCP
  - Added support for GPU platforms: Nvidia CUDA and AMD ROCM software stacks
  - Added API for client/server based connection establishment
  - Added support for TCP transport
  - Support for InfiniBand tag-matching offload for DC and accelerated transports
  - Multi-rail support for eager and rendezvous protocols
  - Added support for tag-matching communications with CUDA buffers
  - Added ucp_rkey_ptr() to obtain pointer for shared memory region
  - Avoid progress overhead on unused transports
  - Improved scalability of software tag-matching by using a hash table
  - Added transparent huge-pages allocator
  - Added non-blocking flush and disconnect for UCP
  - Support fixed-address memory allocation via ucp_mem_map()
  - Added ucp_tag_send_nbr() API to avoid send request allocation
  - Support global addressing in all IB transports
  - Add support for external epoll fd and edge-triggered events
  - Added registration cache for knem
  - Initial support for Java bindings
  - Multiple bugfixes (full list on github)
- Drop UCT-UD-fixed-compilation-by-gcc8.patch as it was fixed upstream
- Refresh openucx-s390x-support.patch against latest sources

OBS-URL: https://build.opensuse.org/request/show/628372
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=21
2018-08-09 10:25:09 +00:00
2b8d3bdf06 Switch to "proper" 1.2.1 tarball.
Rediff openucx-s390x-support.patch as p1 to be in line with potential git-generated patches.

OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=9
2017-09-19 13:53:14 +00:00
Dominique Leuenberger
42718c30e4 Accepting request 507873 from science:HPC
- Disable avx at configure level

- Add openucx-s390x-support.patch to fix compilation on s390x
- Compile openucx on s390x

- Fix compilation on ppc

- Update to snapshot 1.3+git44
  * No changelog was found
- Add -Wno-error and disable AVX/SSE as it is not guaranteed
  to exist.

OBS-URL: https://build.opensuse.org/request/show/507873
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=3
2017-07-12 17:33:54 +00:00