Commit Graph

99 Commits

Author SHA256 Message Date
Nicolas Morey
a9159c654a Merge remote-tracking branch 'devel/main' into slfo-main (jsc#PED-15325)
* devel/main:
  Update to v1.19.1
  Add patches to fix a badly initialized value in settings
  Fix a badly initialized value in settings
  Minor fixes to openucx-s390x-support.patch
  Add Gitea build results
  - Update to ucx 1.19.0   - UCP     - Enabled multi-GPU support within a single process     - Added dynamic selection between strong and weak fences in RMA flush operations     - Improved endpoint reconfiguration capabilities     - Added All2All lane selection for multi-NIC-GPU systems     - Improved rkey debug info when config cache limit is reached     - Improved UCP protocol selection based on available memory types     - Removed dummy memory key from irrelevant transports (TCP, CMA and CUDA)     - Improved RNDV performance with device-local staging buffers     - Enabled error handling for RMA get_offload protocols     - Made UCX_TLS=^ib disable all transports including auxiliary     - Fixed send request status handling     - Fixed performance degradation in RNDV by optimizing md cache updates     - Fixed protocol selection when first lane is filtered out by fragment size     - Fixed rkey selection by using memory registration flag   - UCT     - Defined uct_rkey_unpack_v2 API to support passing sys-dev   - RDMA CORE (IB, ROCE, etc.)     - Added SRD transport support in EFA with reordering, AM, and control operations     - Removed XGVMI BF2 support (umem)     - Removed device memory indirect key     - Fixed VFS objects for DCIs and pools     - Added routing table cache to the reachability check     - Fixed strict order usage in IB auxiliary rkeys     - Improved various init logging messages     - Improved reliability of DC transport by adding DCI validation and separating connection logic     - Fixed segfault in DC fence operation   - UCS     - Removed compilation warnings
  - Update to ucx 1.18.1   - CUDA     - Added config keys to update cuda_copy bandwidth for coherent platforms     - Improved cache invalidation of memory allocated using CUDA memory pool   - AZP     - Added Ubuntu 24.04 to build and release pipeline   - UCP     - Fixed assertion failure when maximum lane fragment is smaller than AM header     - Fixed potential active message user header use after free with protocol reconfiguration   - CUDA     - Fixed registration of CUDA Fabric memory allocated by UCT     - Fixed VA recycling check of memory allocated using VMM and CUDA memory pool   - RDMA CORE (IB, ROCE, etc.)     - Do not use ConnectX-8 SMI subdevices for communication     - Fixed remote access error by disabling ODP when the device supports DDP     - Fixed configuration logic by disabling DDP when AR is disabled   - UCM     - Fixed crash with bistro hooks for CUDA 12.9 on amd64
  add patches to fix gcc-15 compile errors (boo#1241939)
  - Add UCT-IB-UD-Use-GRH-to-detect-address-family-on-non-Mellanox-hardware.patch   to fix an UD init issue on non-Mellanox RDMA HW (bsc#1240204).
  Accepting request 1247273 from home:NMorey:branches:science:HPC
  Accepting request 1247161 from home:NMorey:branches:science:HPC
  Accepting request 1199375 from home:NMorey:branches:science:HPC
  Accepting request 1184022 from openSUSE:Factory:RISCV
  Accepting request 1183477 from home:NMorey:branches:science:HPC

Signed-off-by: Nicolas Morey <nmorey@suse.com>
2026-01-15 08:41:20 +01:00
Nicolas Morey
3ed2204149 Update to v1.19.1
- Features
    - UCP
      - Do not require transport memory support if rendezvous protocol is not used
    - Build
      - Added CUDA 13 support to the release pipeline
      - Added Rocky OS support to the release pipeline
  - Bugfixes
    - UCS
      - Fixed Netlink fetch mechanism

Signed-off-by: Nicolas Morey <nmorey@suse.com>
2026-01-02 14:34:18 +01:00
Nicolas Morey
ed9e44370b Add patches to fix a badly initialized value in settings
Signed-off-by: Nicolas Morey <nmorey@suse.com>
2025-11-07 17:54:15 +01:00
Nicolas Morey
7690a30a01 Fix a badly initialized value in settings
Signed-off-by: Nicolas Morey <nmorey@suse.com>
2025-11-07 17:22:29 +01:00
Nicolas Morey
a1035f1e89 Minor fixes to openucx-s390x-support.patch
Signed-off-by: Nicolas Morey <nmorey@suse.com>
2025-11-05 17:49:32 +01:00
2e169061f4 Add Gitea build results 2025-10-27 17:35:42 +01:00
0e3357c05c Accepting request 1298351 from science:HPC
OBS-URL: https://build.opensuse.org/request/show/1298351
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=37
2025-08-09 17:58:51 +00:00
5f25c6c29c - Update to ucx 1.19.0
- UCP
    - Enabled multi-GPU support within a single process
    - Added dynamic selection between strong and weak fences in RMA flush operations
    - Improved endpoint reconfiguration capabilities
    - Added All2All lane selection for multi-NIC-GPU systems
    - Improved rkey debug info when config cache limit is reached
    - Improved UCP protocol selection based on available memory types
    - Removed dummy memory key from irrelevant transports (TCP, CMA and CUDA)
    - Improved RNDV performance with device-local staging buffers
    - Enabled error handling for RMA get_offload protocols
    - Made UCX_TLS=^ib disable all transports including auxiliary
    - Fixed send request status handling
    - Fixed performance degradation in RNDV by optimizing md cache updates
    - Fixed protocol selection when first lane is filtered out by fragment size
    - Fixed rkey selection by using memory registration flag
  - UCT
    - Defined uct_rkey_unpack_v2 API to support passing sys-dev
  - RDMA CORE (IB, ROCE, etc.)
    - Added SRD transport support in EFA with reordering, AM, and control operations
    - Removed XGVMI BF2 support (umem)
    - Removed device memory indirect key
    - Fixed VFS objects for DCIs and pools
    - Added routing table cache to the reachability check
    - Fixed strict order usage in IB auxiliary rkeys
    - Improved various init logging messages
    - Improved reliability of DC transport by adding DCI validation and separating connection logic
    - Fixed segfault in DC fence operation
  - UCS
    - Removed compilation warnings

OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=82
2025-08-08 08:15:59 +00:00
f8b8d435cc Accepting request 1285180 from science:HPC
OBS-URL: https://build.opensuse.org/request/show/1285180
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=36
2025-06-13 16:42:51 +00:00
e6035d1f52 - Update to ucx 1.18.1
- CUDA
    - Added config keys to update cuda_copy bandwidth for coherent platforms
    - Improved cache invalidation of memory allocated using CUDA memory pool
  - AZP
    - Added Ubuntu 24.04 to build and release pipeline
  - UCP
    - Fixed assertion failure when maximum lane fragment is smaller than AM header
    - Fixed potential active message user header use after free with protocol reconfiguration
  - CUDA
    - Fixed registration of CUDA Fabric memory allocated by UCT
    - Fixed VA recycling check of memory allocated using VMM and CUDA memory pool
  - RDMA CORE (IB, ROCE, etc.)
    - Do not use ConnectX-8 SMI subdevices for communication
    - Fixed remote access error by disabling ODP when the device supports DDP
    - Fixed configuration logic by disabling DDP when AR is disabled
  - UCM
    - Fixed crash with bistro hooks for CUDA 12.9 on amd64

OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=80
2025-06-12 14:32:39 +00:00
f22c7e86d8 Accepting request 1277496 from science:HPC
OBS-URL: https://build.opensuse.org/request/show/1277496
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=35
2025-05-23 12:29:12 +00:00
d8d8c7c955 add patches to fix gcc-15 compile errors (boo#1241939)
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=78
2025-05-14 20:27:27 +00:00
aa486005dd Accepting request 1266178 from science:HPC
OBS-URL: https://build.opensuse.org/request/show/1266178
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=34
2025-04-02 15:09:07 +00:00
77c5e72d38 Accepting request 1266178 from science:HPC
OBS-URL: https://build.opensuse.org/request/show/1266178
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=34
2025-04-02 15:09:07 +00:00
28afc5599d - Add UCT-IB-UD-Use-GRH-to-detect-address-family-on-non-Mellanox-hardware.patch
to fix an UD init issue on non-Mellanox RDMA HW (bsc#1240204).

OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=76
2025-04-01 13:23:59 +00:00
46d315ac9e - Add UCT-IB-UD-Use-GRH-to-detect-address-family-on-non-Mellanox-hardware.patch
to fix an UD init issue on non-Mellanox RDMA HW (bsc#1240204).

OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=76
2025-04-01 13:23:59 +00:00
7905fb8b39 Accepting request 1247274 from science:HPC
- Update to ucx 1.18.0
  - UCP
    - Enabled using CUDA staging buffers for pipeline protocols by default
    - Added endpoint reconfiguration support for non-reused p2p scenarios
    - Enabled non-cacheable memory domains, activated for gdr_copy
    - Added user_data parameter to ucp_ep_query
    - Added support for host memory pipeline through CUDA buffers for rendezvous protocol
    - Added global VA infrastructure and memory region in absence of error handling
    - Made protocol performance node names more informative
    - Enforced always running on the same thread in single thread mode
    - Multiple improvements in protocols selection infrastructure
    - Added UCP_MEM_MAP_LOCK API flag to enforce locked memory mapping
    - Allowed up-to 64 endpoint lanes for systems with many transports or devices
    - Added usage tracker to worker
    - Improved various logging messages
    - Fixed stack overflow in exported rkey unpack
    - Removed extra remote-cpu overhead from protocol estimation for zcopy
    - Fixed performance estimation for rndv pipeline protocols
    - Fixed ATP sending by picking the correct lane
    - Fixed missing reg_id on memh creation
    - Fixed repeated invalidations by retaining existing access flags
    - Fixed abort reason propagation for rendezvous RTR mtype
    - Do not check transport availability if it is disabled by UCX_TLS environment variable
    - Fixed wrong flag being used for checking BCOPY capability
    - Fixed sending too many ATPs for small messages
    - Enforced 16 bits size for Active Messages identifiers
    - Fixed unnecessary status check for emulated AMO
    - Fixed more than one fragment sending in rendezvous pipeline
    - Fixed crash by using biggest max frag across all lanes
    - Fixed missing memory handle flags by copying from parent to child

OBS-URL: https://build.opensuse.org/request/show/1247274
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=33
2025-02-20 15:28:03 +00:00
ad2b6e1eb3 Accepting request 1247274 from science:HPC
- Update to ucx 1.18.0
  - UCP
    - Enabled using CUDA staging buffers for pipeline protocols by default
    - Added endpoint reconfiguration support for non-reused p2p scenarios
    - Enabled non-cacheable memory domains, activated for gdr_copy
    - Added user_data parameter to ucp_ep_query
    - Added support for host memory pipeline through CUDA buffers for rendezvous protocol
    - Added global VA infrastructure and memory region in absence of error handling
    - Made protocol performance node names more informative
    - Enforced always running on the same thread in single thread mode
    - Multiple improvements in protocols selection infrastructure
    - Added UCP_MEM_MAP_LOCK API flag to enforce locked memory mapping
    - Allowed up-to 64 endpoint lanes for systems with many transports or devices
    - Added usage tracker to worker
    - Improved various logging messages
    - Fixed stack overflow in exported rkey unpack
    - Removed extra remote-cpu overhead from protocol estimation for zcopy
    - Fixed performance estimation for rndv pipeline protocols
    - Fixed ATP sending by picking the correct lane
    - Fixed missing reg_id on memh creation
    - Fixed repeated invalidations by retaining existing access flags
    - Fixed abort reason propagation for rendezvous RTR mtype
    - Do not check transport availability if it is disabled by UCX_TLS environment variable
    - Fixed wrong flag being used for checking BCOPY capability
    - Fixed sending too many ATPs for small messages
    - Enforced 16 bits size for Active Messages identifiers
    - Fixed unnecessary status check for emulated AMO
    - Fixed more than one fragment sending in rendezvous pipeline
    - Fixed crash by using biggest max frag across all lanes
    - Fixed missing memory handle flags by copying from parent to child

OBS-URL: https://build.opensuse.org/request/show/1247274
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=33
2025-02-20 15:28:03 +00:00
47635a7117 Accepting request 1247273 from home:NMorey:branches:science:HPC
- Refresh openucx-s390x-support.patch due to API changes

OBS-URL: https://build.opensuse.org/request/show/1247273
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=74
2025-02-20 06:38:45 +00:00
145da08ae6 - Refresh openucx-s390x-support.patch due to API changes
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=74
2025-02-20 06:38:45 +00:00
9a474b25ce Accepting request 1247161 from home:NMorey:branches:science:HPC
- Update to ucx 1.18.0
  - UCP
    - Enabled using CUDA staging buffers for pipeline protocols by default
    - Added endpoint reconfiguration support for non-reused p2p scenarios
    - Enabled non-cacheable memory domains, activated for gdr_copy
    - Added user_data parameter to ucp_ep_query
    - Added support for host memory pipeline through CUDA buffers for rendezvous protocol
    - Added global VA infrastructure and memory region in absence of error handling
    - Made protocol performance node names more informative
    - Enforced always running on the same thread in single thread mode
    - Multiple improvements in protocols selection infrastructure
    - Added UCP_MEM_MAP_LOCK API flag to enforce locked memory mapping
    - Allowed up-to 64 endpoint lanes for systems with many transports or devices
    - Added usage tracker to worker
    - Improved various logging messages
    - Fixed stack overflow in exported rkey unpack
    - Removed extra remote-cpu overhead from protocol estimation for zcopy
    - Fixed performance estimation for rndv pipeline protocols
    - Fixed ATP sending by picking the correct lane
    - Fixed missing reg_id on memh creation
    - Fixed repeated invalidations by retaining existing access flags
    - Fixed abort reason propagation for rendezvous RTR mtype
    - Do not check transport availability if it is disabled by UCX_TLS environment variable
    - Fixed wrong flag being used for checking BCOPY capability
    - Fixed sending too many ATPs for small messages
    - Enforced 16 bits size for Active Messages identifiers
    - Fixed unnecessary status check for emulated AMO
    - Fixed more than one fragment sending in rendezvous pipeline
    - Fixed crash by using biggest max frag across all lanes
    - Fixed missing memory handle flags by copying from parent to child

OBS-URL: https://build.opensuse.org/request/show/1247161
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=73
2025-02-19 20:35:36 +00:00
222004fc02 - Update to ucx 1.18.0
- UCP
    - Enabled using CUDA staging buffers for pipeline protocols by default
    - Added endpoint reconfiguration support for non-reused p2p scenarios
    - Enabled non-cacheable memory domains, activated for gdr_copy
    - Added user_data parameter to ucp_ep_query
    - Added support for host memory pipeline through CUDA buffers for rendezvous protocol
    - Added global VA infrastructure and memory region in absence of error handling
    - Made protocol performance node names more informative
    - Enforced always running on the same thread in single thread mode
    - Multiple improvements in protocols selection infrastructure
    - Added UCP_MEM_MAP_LOCK API flag to enforce locked memory mapping
    - Allowed up-to 64 endpoint lanes for systems with many transports or devices
    - Added usage tracker to worker
    - Improved various logging messages
    - Fixed stack overflow in exported rkey unpack
    - Removed extra remote-cpu overhead from protocol estimation for zcopy
    - Fixed performance estimation for rndv pipeline protocols
    - Fixed ATP sending by picking the correct lane
    - Fixed missing reg_id on memh creation
    - Fixed repeated invalidations by retaining existing access flags
    - Fixed abort reason propagation for rendezvous RTR mtype
    - Do not check transport availability if it is disabled by UCX_TLS environment variable
    - Fixed wrong flag being used for checking BCOPY capability
    - Fixed sending too many ATPs for small messages
    - Enforced 16 bits size for Active Messages identifiers
    - Fixed unnecessary status check for emulated AMO
    - Fixed more than one fragment sending in rendezvous pipeline
    - Fixed crash by using biggest max frag across all lanes
    - Fixed missing memory handle flags by copying from parent to child

OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=73
2025-02-19 20:35:36 +00:00
50926fe318 Accepting request 1199376 from science:HPC
OBS-URL: https://build.opensuse.org/request/show/1199376
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=32
2024-09-09 12:43:20 +00:00
83523eaad4 Accepting request 1199376 from science:HPC
OBS-URL: https://build.opensuse.org/request/show/1199376
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=32
2024-09-09 12:43:20 +00:00
68685ed0da Accepting request 1199375 from home:NMorey:branches:science:HPC
- Refresh openucx-s390x-support.patch to fix compilation on s390x

OBS-URL: https://build.opensuse.org/request/show/1199375
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=71
2024-09-07 14:26:13 +00:00
9f2cde7a87 - Refresh openucx-s390x-support.patch to fix compilation on s390x
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=71
2024-09-07 14:26:13 +00:00
1aaa6114cd Accepting request 1184228 from science:HPC
OBS-URL: https://build.opensuse.org/request/show/1184228
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=31
2024-07-03 18:26:35 +00:00
a5f1adbb12 Accepting request 1184228 from science:HPC
OBS-URL: https://build.opensuse.org/request/show/1184228
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=31
2024-07-03 18:26:35 +00:00
de09e2a891 Accepting request 1184022 from openSUSE:Factory:RISCV
- Enable build on riscv64

OBS-URL: https://build.opensuse.org/request/show/1184022
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=69
2024-07-01 08:27:55 +00:00
8094d4b34d - Enable build on riscv64
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=69
2024-07-01 08:27:55 +00:00
2b4398a74a Accepting request 1183479 from science:HPC
- Update to 1.17.0
  - See NEWS for the complete CHANGELOG
- Refresh openucx-s390x-support.patch against the latest sources
- Add upstream fix UCS-TIME-Add-math.h-to-provide-INFINITY.patch
  to fix compilation on ppc64

OBS-URL: https://build.opensuse.org/request/show/1183479
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=30
2024-06-29 13:16:13 +00:00
49c5ede7c9 Accepting request 1183479 from science:HPC
- Update to 1.17.0
  - See NEWS for the complete CHANGELOG
- Refresh openucx-s390x-support.patch against the latest sources
- Add upstream fix UCS-TIME-Add-math.h-to-provide-INFINITY.patch
  to fix compilation on ppc64

OBS-URL: https://build.opensuse.org/request/show/1183479
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=30
2024-06-29 13:16:13 +00:00
b79725a512 Accepting request 1183477 from home:NMorey:branches:science:HPC
- Update to 1.17.0
  - See NEWS for the complete CHANGELOG
- Refresh openucx-s390x-support.patch against the latest sources
- Add upstream fix UCS-TIME-Add-math.h-to-provide-INFINITY.patch
  to fix compilation on ppc64

OBS-URL: https://build.opensuse.org/request/show/1183477
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=67
2024-06-26 17:49:24 +00:00
cfaa4352a9 - Update to 1.17.0
- See NEWS for the complete CHANGELOG
- Refresh openucx-s390x-support.patch against the latest sources
- Add upstream fix UCS-TIME-Add-math.h-to-provide-INFINITY.patch
  to fix compilation on ppc64

OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=67
2024-06-26 17:49:24 +00:00
d7ff57612d Accepting request 1151438 from science:HPC
Prepare for RPM 4.20 (forwarded request 1151423 from dimstar)

OBS-URL: https://build.opensuse.org/request/show/1151438
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=29
2024-02-27 21:44:20 +00:00
0835e04bcc Accepting request 1151423 from home:dimstar:rpm4.20:o
Prepare for RPM 4.20

OBS-URL: https://build.opensuse.org/request/show/1151423
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=65
2024-02-26 12:52:59 +00:00
0a42199aad Accepting request 1116008 from science:HPC
OBS-URL: https://build.opensuse.org/request/show/1116008
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=28
2023-10-08 10:17:06 +00:00
2a1a111b03 Accepting request 1115979 from home:NMorey:branches:science:HPC
- Update to 1.15.0
  - UCP
    - Added 2-stage pipeline protocol in the new protocol infrastructure
    - Added reset and abort functionality of rendezvous protocols in the
       new infrastructure
    - Added zero-copy rendezvous data send protocol in the new infrastructure
    - Added support for user memory handle in the new protocol infrastructure
    - Added option to force ODP registration for certain memory types
    - Enabled lock free memory region deregistration
    - Updated allow/deny transport list feature to control auxiliary transport selection
    - Multiple performance improvements of the new protocol infrastructure
    - Multiple improvements in error and debug messages
    - Fixed assertion when sending from non-contiguous GPU buffer to managed buffer
    - Fixed the race condition on endpoint configurations
    - Fixed endpoint reconfiguration issues due to asymmetrical selection
    - Fixed endpoint reconfiguration error due to wrong locality detection
    - Fixed crash during connection manager cleanup
    - Fixed rkey index calculation for rendezvous protocol
    - Fixed rcache dump function
    - Removed logging from rkey unpack in release mode
    - Fixed dobule free of rkey in rendezvous protocol
    - Fixed rendezvous pipeline protocol error flow
    - Fixed error handling in rendezvous get zcopy protocol
    - Replay pending requests of wireup EP CM during connection establishment
      to prevent potential ordering issues and wrong configuration
    - Pass user-provided memory type to the function that checks whether the buffer
      can be sent inline or not
    - Avoid memory registration during UCP context initialization
    - Fixed CPU/device atomics selection in the new protocol infrastructure
    - Multiple fixes in the new protocol infrastructure information output

OBS-URL: https://build.opensuse.org/request/show/1115979
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=63
2023-10-06 09:59:22 +00:00
ba3eec4113 Accepting request 1100646 from science:HPC
OBS-URL: https://build.opensuse.org/request/show/1100646
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=27
2023-07-26 11:22:10 +00:00
7d6841ca26 Accepting request 1100640 from home:NMorey:branches:science:HPC
- Update to v1.14.1
  - Fixed ROCm to prevent the locking of host pinned memory
  - Added CUDA 12 based UCX builds to the release flow
  - Increased the maximal number of endpoint configurations
  - Fixed filter for a slow-lanes in selection logic
  - Fixed TCP transport bandwidth calculation
  - Fixed device detection for ROCM
  - Fixed compatibility with CUDA 12
  - Fixed rendezvous threshold for multi-path configurations
  - Fixed error message in case of static link
  - Fixed BlueField-3 detection
  - Multiple fixes for Azure CI pipeline

OBS-URL: https://build.opensuse.org/request/show/1100640
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=61
2023-07-25 13:54:29 +00:00
8a8941ab4f Accepting request 1075600 from science:HPC
- Update to v1.14.0
  - UCP
    - Added API for querying transport and device names on endpoint
    - Added API for querying datatype object
    - Added API for exporting and importing memory keys (no implementation yet)
    - Added support for non-persistent active message header
    - Added infrastructure to print protocols v2 performance
    - Multiple performance improvements for protocols v2
    - Added support for non-contiguous datatypes for rendezvous protocols v2
    - Added support for reset and abort request in protocols v2
    - Added support for user memory handles in RMA API
    - Added multi-rail support for RMA API in protocols v2
    - Added support for up to 16 different lanes per endpoint
    - Added support for dmabuf memory registration in protocols v2
    - Added strong fence mode for ucp_worker_fence() API
  - UCT
    - Added new uct_md_mem_attach() API to support exported memory handles
    - Added remote completion mode for endpoint flush (via new flag)
    - Added support for dmabuf registration
    - Added new uct_ep_connect_to_ep_v2() API
    - Added new uct_mem_reg_v2() API
    - Added new uct_md_query_v2() API
    - Added support for IPv6 loopback address in TCP transport
  - RDMA CORE (IB, ROCE, etc.)
    - Added ECE (enhanced connection establishment) support for RC and DC transports
    - Added support for hardware DCS in DC transport
    - Added UD interface and endpoint resource information to VFS
    - Added CQ creation via DEVX API
    - Removed support for accelerated IB transports over legacy experimental verbs
  - UCS
    - Added support for auto-correction of user environment variables
  - UCM
    - Implemented CUDA bistro hooks for aarch64 (to enable memory cache on this platform)
    - Added support for CUDA virtual/stream-ordered memory with cudaMallocAsync
  - Documentation
    - Added FAQ for using pkg-config tool to build applications with UCX
  - Tools
    - Added runtime library version to the 'ucx_info -v' output
    - Added support for memory types in ucx_info
  - Many bugfixes. See NEWS.
- Drop patch merged upstream:
  - UCS-DEBUG-replace-PTR-with-void.patch
  - gcc13-fix.patch
- Refresh openucx-s390x-support.patch

OBS-URL: https://build.opensuse.org/request/show/1075600
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=26
2023-04-01 21:26:51 +00:00
a42d04ee36 Remove remaining gcc13 patch
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=59
2023-03-30 16:49:05 +00:00
b714ee86f4 - Add gcc13-fix.patch for GCC13 support
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=58
2023-03-29 10:50:52 +00:00
6a412379a9 Accepting request 1075167 from home:NMorey:branches:science:HPC
- Update to v1.14.0
  - UCP
    - Added API for querying transport and device names on endpoint
    - Added API for querying datatype object
    - Added API for exporting and importing memory keys (no implementation yet)
    - Added support for non-persistent active message header
    - Added infrastructure to print protocols v2 performance
    - Multiple performance improvements for protocols v2
    - Added support for non-contiguous datatypes for rendezvous protocols v2
    - Added support for reset and abort request in protocols v2
    - Added support for user memory handles in RMA API
    - Added multi-rail support for RMA API in protocols v2
    - Added support for up to 16 different lanes per endpoint
    - Added support for dmabuf memory registration in protocols v2
    - Added strong fence mode for ucp_worker_fence() API
  - UCT
    - Added new uct_md_mem_attach() API to support exported memory handles
    - Added remote completion mode for endpoint flush (via new flag)
    - Added support for dmabuf registration
    - Added new uct_ep_connect_to_ep_v2() API
    - Added new uct_mem_reg_v2() API
    - Added new uct_md_query_v2() API
    - Added support for IPv6 loopback address in TCP transport
  - RDMA CORE (IB, ROCE, etc.)
    - Added ECE (enhanced connection establishment) support for RC and DC transports
    - Added support for hardware DCS in DC transport
    - Added UD interface and endpoint resource information to VFS
    - Added CQ creation via DEVX API
    - Removed support for accelerated IB transports over legacy experimental verbs
  - UCS
    - Added support for auto-correction of user environment variables
  - UCM
    - Implemented CUDA bistro hooks for aarch64 (to enable memory cache on this platform)
    - Added support for CUDA virtual/stream-ordered memory with cudaMallocAsync
  - Documentation
    - Added FAQ for using pkg-config tool to build applications with UCX
  - Tools
    - Added runtime library version to the 'ucx_info -v' output
    - Added support for memory types in ucx_info
  - Many bugfixes. See NEWS.
- Drop patch merged upstream:
  - UCS-DEBUG-replace-PTR-with-void.patch
  - gcc13-fix.patch
- Refresh openucx-s390x-support.patch

OBS-URL: https://build.opensuse.org/request/show/1075167
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=57
2023-03-29 08:50:48 +00:00
61b71445ce Accepting request 1069629 from science:HPC
- Add upstream gcc13-fix.patch fix. (forwarded request 1069627 from marxin)

OBS-URL: https://build.opensuse.org/request/show/1069629
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=25
2023-03-07 15:48:49 +00:00
1c9eb00a8e Accepting request 1069627 from home:marxin:branches:science:HPC
- Add upstream gcc13-fix.patch fix.

OBS-URL: https://build.opensuse.org/request/show/1069627
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=55
2023-03-06 12:24:21 +00:00
1c024f5f2d Accepting request 1058681 from science:HPC
- openucx-s390x-support.patch: fix use of clz builtin for 64-bit value (forwarded request 1058654 from Andreas_Schwab)

OBS-URL: https://build.opensuse.org/request/show/1058681
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=24
2023-01-17 16:34:47 +00:00
dfc3070ec1 Accepting request 1058654 from home:Andreas_Schwab:Factory
- openucx-s390x-support.patch: fix use of clz builtin for 64-bit value

OBS-URL: https://build.opensuse.org/request/show/1058654
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=53
2023-01-16 11:22:10 +00:00
ec8c3382db Accepting request 1008219 from science:HPC
- Update openucx-s390x-support.patch to add missing ucs_ffs32 on s390x
- Drop baselibs.conf as openucx only works on 64b systems

OBS-URL: https://build.opensuse.org/request/show/1008219
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/openucx?expand=0&rev=23
2022-10-10 16:44:15 +00:00
Nicolas Morey-Chaisemartin
8322be19fe Accepting request 1008118 from home:NMoreyChaisemartin:branches:science:HPC
- Drop baselibs.conf as openucx only works on 64b systems

OBS-URL: https://build.opensuse.org/request/show/1008118
OBS-URL: https://build.opensuse.org/package/show/science:HPC/openucx?expand=0&rev=51
2022-10-05 07:28:51 +00:00