* Use full URL for sources. * Drop -ffat-lto-objects in cflags as we do not build static libs. * Add gmock and gtest BuildRequires to allow using system gtest for tests rather than re-building the bundled libs every time. * Do not force 4 parallel threads when running ctest. OBS-URL: https://build.opensuse.org/package/show/science/kokkos?expand=0&rev=32
644 lines
30 KiB
Plaintext
644 lines
30 KiB
Plaintext
-------------------------------------------------------------------
|
|
Mon May 5 20:24:59 UTC 2025 - Atri Bhattacharya <badshah400@gmail.com>
|
|
|
|
- Update to version 4.6.01:
|
|
* Fix symbol visibility: make sure that AUTO has external
|
|
linkage (gh#kokkos/kokkos#7898).
|
|
* Building with generated makefiles: add missing HIP XNACK
|
|
source file (gh#kokkos/kokkos#8030).
|
|
* Fix building benchmarks when Kokkos exists as a
|
|
subproject/subdirectory (gh#kokkos/kokkos#8045).
|
|
- Changes from version 4.6.00:
|
|
* Too many to list, see
|
|
<https://github.com/kokkos/kokkos/blob/4.6.00/CHANGELOG.md>.
|
|
- Use full URL for sources.
|
|
- Drop -ffat-lto-objects in cflags as we do not build static libs.
|
|
- Add gmock and gtest BuildRequires to allow using system gtest
|
|
for tests rather than re-building the bundled libs every time.
|
|
- Do not force 4 parallel threads when running ctest.
|
|
|
|
-------------------------------------------------------------------
|
|
Tue May 7 03:00:34 UTC 2024 - ming li <mli@suse.com> - 4.3.00
|
|
|
|
- Update to version 4.3.00:
|
|
* Features:
|
|
Add Experimental::sort_by_key(exec, keys, values) algorithm #6801
|
|
* Backend and Architecture Enhancements:
|
|
CUDA:
|
|
Experimental multi-GPU support (from the same process) #6782
|
|
Link against CUDA libraries even with
|
|
KOKKOS_ENABLE_COMPILE_AS_CMAKE_LANGUAGE #6701
|
|
Don't use the compiler launcher script if the CMake compile
|
|
language is CUDA. #6704
|
|
nvcc(wrapper): adding "long" and "short" versions for all flags #6615
|
|
HIP:
|
|
Fix compilation when using amdclang (with ROCm >= 5.7) and RDC #6857
|
|
Use rocthrust for sorting, when available #6793
|
|
SYCL:
|
|
We only support OneAPI SYCL implementation: add check during
|
|
initialization
|
|
Error out on initialization if the backend is different from
|
|
ext_oneapi_* #6784
|
|
Filter GPU devices for ext_onapi_* GPU devices #6758
|
|
Performance Improvements
|
|
Avoid unnecessary zero-memset of the scratch flags in SYCL #6739
|
|
Use host-pinned memory to copy reduction/scan result #6500
|
|
Address deprecations after oneAPI 2023.2.0 #6577
|
|
Make sure to call find_dependency for oneDPL if necessary #6870
|
|
OpenMPTarget:
|
|
Use LLVM extensions for dynamic shared memory #6380
|
|
Guard scratch memory usage in ParallelReduce #6585
|
|
Update linker flags for Intel GPUs update #6735
|
|
Improve handling of printf on Intel GPUs #6652
|
|
OpenACC:
|
|
Add atomics support #6446
|
|
Make the OpenACC backend asynchronous #6772
|
|
Threads:
|
|
Add missing broadcast to TeamThreadRange parallel_scan #6601
|
|
OpenMP:
|
|
Improve performance of view initializations and filling with zeros #6573
|
|
* General Enhancements
|
|
Improve performance of random number generation when using a normal
|
|
distribution on GPUs #6556
|
|
Allocate temporary view with the user-provided execution space instance
|
|
and do not initialize in unique algorithm #6598
|
|
Add deduction guide for Kokkos::Array #6373
|
|
Provide new public headers <Kokkos_Clamp.hpp> and <Kokkos_MinMax.hpp> #6687
|
|
Fix/improvement to remove_if parallel algorithm: use the provided execution
|
|
space instance for temporary allocations and drop unnecessaryinitialization
|
|
+ avoid evaluating twice the predicate during final pass #6747
|
|
Add runtime function to query the number of devices and make device ID
|
|
consistent with KOKKOS_VISIBLE_DEVICES #6713
|
|
simd: support vector_aligned_tag #6243
|
|
Avoid unnecessary allocation when default constructing Bitset #6524
|
|
Fix constness for views in std algorithms #6813
|
|
Improve error message on unsafe implicit conversion in MDRangePolicy #6855
|
|
CTAD (deduction guides) for RangePolicy #6850
|
|
CTAD (deduction guides) for MDRangePolicy #5516
|
|
* Build System Changes
|
|
Require Kokkos_ENABLE_ATOMICS_BYPASS option to bypass atomic operation for
|
|
Serial backend only builds #6692
|
|
Add support for RISCV and the Milk-V's Pioneer #6773
|
|
Add C++26 standard to CMake setup #6733
|
|
Fix Makefile when using gnu_generate_makefile.sh and make >= 4.3 #6606
|
|
Cuda: Fix configuring with CMake >= 3.28.4 - temporary fallback to internal
|
|
CudaToolkit.cmake #6898
|
|
* Incompatibilities (i.e. breaking changes)
|
|
Remove all DEPRECATED_CODE_3 option and all code that was guarded by it #6523
|
|
Drop guards to accommodate external code defining KOKKOS_ASSERT #6665
|
|
Profiling::ProfilingSection(std::string) constructor marked explicit
|
|
and nodiscard #6690
|
|
Add bound check preconditions for RangePolicy and MDRangePolicy #6617 #6726
|
|
Add checks for unsafe implicit conversions in RangePolicy #6754
|
|
Remove Kokkos::[b]half_t volatile overloads #6579
|
|
Remove KOKKOS_IMPL_DO_NOT_USE_PRINTF #6593
|
|
Check matching static extents in View constructor #5190
|
|
Tools(profiling): fix typo Kokkos_Tools_Optim[i]zationGoal #6642
|
|
Remove variadic range policy constructor (disallow passing multiple trailing
|
|
chunk size arguments) #6845
|
|
Improve message on view out of bounds access and always abort #6861
|
|
Drop KOKKOS_ENABLE_INTEL_MM_ALLOC macro #6797
|
|
Remove Kokkos::Experimental::LogicalMemorySpace (without going through
|
|
deprecation) #6557
|
|
Remove Experimental::HBWSpace and support for linking against memkind #6791
|
|
Drop librt TPL and associated KOKKOS_ENABLE_LIBRT macro #6798
|
|
Drop support for old CPU architectures (ARCH_BGQ, ARCH_POWER7, ARCH_WSM and
|
|
associated ARCH_SSE4 macro) #6806
|
|
Drop support for deprecated command-line arguments and environment variables #6744
|
|
* Deprecations
|
|
Provide kokkos_swap as part of Core and deprecate Experimental::swap in
|
|
Algorithms #6697
|
|
Deprecate {Cuda,HIP}::detect_device_count() and Cuda::[detect_]device_arch() #6710
|
|
Deprecate ExecutionSpace::in_parallel() #6582
|
|
* Bug Fixes
|
|
Fix team-level MDRange reductions: #6511
|
|
Fix CUDA and SYCL small value type (16-bit) team reductions #5334
|
|
Enable {transform_}exclusive_scan in place #6667
|
|
fill_random overload that do not take an execution space instance argument
|
|
should fence #6658
|
|
HIP,Cuda,OpenMPTarget: Fixup use provided execution space when copying host
|
|
inaccessible reduction result #6777
|
|
Fix typo in cuda_func_set_attribute[s]_wrapper preventing proper setting of
|
|
desired occupancy #6786
|
|
Avoid undefined behavior due to conversion between signed and unsigned integers in
|
|
shift_{right, left}_team_impl #6821
|
|
Fix a bug in Makefile.kokkos when using AMD GPU architectures as AMD_GFXYYY #6892
|
|
|
|
-------------------------------------------------------------------
|
|
Sun Mar 10 14:55:26 UTC 2024 - Sarah Kriesch <sarah.kriesch@opensuse.org>
|
|
|
|
- Remove constraints file because of improved memory usage
|
|
- Edit shebang lines for nvcc_wrapper and kokkos_launch_compiler because of rpm errors
|
|
|
|
-------------------------------------------------------------------
|
|
Wed Dec 6 16:28:45 UTC 2023 - Cezary Skrzyński <czars1988@gmail.com> - 4.2.00
|
|
|
|
- Update to new upstream release 4.2.00:
|
|
see https://github.com/kokkos/kokkos/blob/4.2.00/CHANGELOG.md
|
|
|
|
-------------------------------------------------------------------
|
|
Wed Jul 5 12:00:00 UTC 2023 - Vicente Adolfo Bolea Sanchez <vicente.bolea@gmail.com> - 4.1.00
|
|
|
|
- Update to new upstream release 4.1.00:
|
|
see https://github.com/kokkos/kokkos/blob/4.1.00/CHANGELOG.md
|
|
- Use a tarball as Source0 instead of an URL.
|
|
|
|
-------------------------------------------------------------------
|
|
Wed May 3 12:00:01 UTC 2023 - Vicente Adolfo Bolea Sanchez <vicente.bolea@gmail.com> - 4.0.01
|
|
|
|
- Update to new upstream release 4.0.01:
|
|
see https://github.com/kokkos/kokkos/blob/62d2b6c879b74b6ae7bd06eb3e5e80139c4708e6/CHANGELOG.md
|
|
- Use Ninja for building the project.
|
|
|
|
-------------------------------------------------------------------
|
|
Tue Apr 4 19:39:56 UTC 2023 - Cezary Skrzyński <czars1988@gmail.com> - 4.0.00
|
|
|
|
- Update to new upstream release 4.0.00:
|
|
see https://github.com/kokkos/kokkos/blob/master/CHANGELOG.md#400-2023-02-21
|
|
- Increase memory limits for build to avoid OpenMP tests running
|
|
out of memory
|
|
- Add kokkos-do-not-use-env-as-interpreter.patch to fix rpmlint
|
|
complaint about `env` being used as interpreter
|
|
|
|
-------------------------------------------------------------------
|
|
Tue Apr 26 10:40:11 UTC 2022 - Jan Engelhardt <jengelh@inai.de>
|
|
|
|
- Resolve rpmlint report "libkokkos3.x86_64: E:
|
|
shlib-policy-name-error SONAME: libkokkoscontainers.so.3.3.0,
|
|
expected package suffix: 3_3_0"
|
|
|
|
-------------------------------------------------------------------
|
|
Tue Oct 12 22:41:31 UTC 2021 - Stefan Brüns <stefan.bruens@rwth-aachen.de>
|
|
|
|
- Remove random constraint adjustments again, and use a global
|
|
lower limit of 6 GB of memory (minimum for the OMP tests).
|
|
- Limit the number of parallel jobs and OMP threads in the unit
|
|
tests, to keep the required memory bounded.
|
|
This way, even architectures with many cores and comparatively
|
|
low memory can build successfully.
|
|
|
|
-------------------------------------------------------------------
|
|
Sun Oct 3 16:28:39 UTC 2021 - Sarah Kriesch <sarah.kriesch@opensuse.org>
|
|
|
|
- New entries in constraints file because of missing build nodes with
|
|
18GB of memory for s390x and PPC:
|
|
- 2GB for memoryperjob at x86, riscv64 and s390x
|
|
- aarch64 3GB for memoryperjob because of "Out of memory" with 2GB
|
|
- PPC64 and PPC64le with 1GB for memoryperjob because of missing
|
|
node capacity
|
|
|
|
-------------------------------------------------------------------
|
|
Sat Dec 19 15:05:39 UTC 2020 - Christoph Junghans <junghans@votca.org>
|
|
|
|
- dropped 3308.patch - merged upstream
|
|
- Version bump to 3.3.00:
|
|
- Features:
|
|
- Require C++14 as minimum C++ standard. C++17 and C++20 are
|
|
supported too.
|
|
- HIP backend is nearly feature complete. Kokkos Dynamic Task
|
|
Graphs are missing.
|
|
- Major update for OpenMPTarget: many capabilities now work.
|
|
For details contact us.
|
|
- Added DPC++/SYCL backend: primary capabilites are working.
|
|
- Added Kokkos Graph API analogous to CUDA Graphs.
|
|
- Added parallel_scan support with TeamThreadRange
|
|
[gh#kokkos/kokkos#3536]
|
|
- Added Logical Memory Spaces [gh#kokkos/kokkos#3546]
|
|
- Added initial half precision support [gh#kokkos/kokkos#3439]
|
|
- Experimental feature: control cuda occupancy
|
|
[gh#kokkos/kokkos#3379]
|
|
- Implemented enhancements Backends and Archs:
|
|
- Add a64fx and fujitsu Compiler support
|
|
[gh#kokkos/kokkos#3614]
|
|
- Adding support for AMD gfx908 archictecture
|
|
[gh#kokkos/kokkos#3375]
|
|
- SYCL parallel_for MDRangePolicy [gh#kokkos/kokkos#3583]
|
|
- SYCL add parallel_scan [gh#kokkos/kokkos#3577]
|
|
- SYCL custom reductions [gh#kokkos/kokkos#3544]
|
|
- SYCL Enable container unit tests [gh#kokkos/kokkos#3550]
|
|
- SYCL feature level 5 [gh#kokkos/kokkos#3480]
|
|
- SYCL Feature level 4 (parallel_for) [gh#kokkos/kokkos#3474]
|
|
- SYCL feature level 3 [gh#kokkos/kokkos#3451]
|
|
- SYCL feature level 2 [gh#kokkos/kokkos#3447]
|
|
- OpenMPTarget: Hierarchial reduction for + operator on
|
|
scalars [gh#kokkos/kokkos#3504]
|
|
- OpenMPTarget hierarchical [gh#kokkos/kokkos#3411]
|
|
- HIP Add Impl::atomic_[store,load] [gh#kokkos/kokkos#3440]
|
|
- HIP enable global lock arrays [gh#kokkos/kokkos#3418]
|
|
- HIP Implement multiple occupancy paths for various HIP
|
|
kernel launchers [gh#kokkos/kokkos#3366]
|
|
- Implemented enhancements Policies:
|
|
- MDRangePolicy: Let it be semiregular [gh#kokkos/kokkos#3494]
|
|
- MDRangePolicy: Check narrowing conversion in construction
|
|
[gh#kokkos/kokkos#3527]
|
|
- MDRangePolicy: CombinedReducers support
|
|
[gh#kokkos/kokkos#3395]
|
|
- Kokkos Graph: Interface and Default Implementation
|
|
[gh#kokkos/kokkos#3362]
|
|
- Kokkos Graph: add Cuda Graph implementation
|
|
[gh#kokkos/kokkos#3369]
|
|
- TeamPolicy: implemented autotuning of team sizes and vector
|
|
lengths [gh#kokkos/kokkos#3206]
|
|
- RangePolicy: Initialize all data members in default
|
|
constructor [gh#kokkos/kokkos#3509]
|
|
- Implemented enhancements BuildSystem:
|
|
- Auto-generate core test files for all backends
|
|
[gh#kokkos/kokkos#3488]
|
|
- Avoid rewriting test files when calling cmake
|
|
[gh#kokkos/kokkos#3548]
|
|
- RULE_LAUNCH_COMPILE and RULE_LAUNCH_LINK system for
|
|
nvcc_wrapper [gh#kokkos/kokkos#3136]
|
|
- Adding -include as a known argument to nvcc_wrapper
|
|
[gh#kokkos/kokkos#3434]
|
|
- Install hpcbind script [gh#kokkos/kokkos#3402]
|
|
- cmake/kokkos_tribits.cmake: add parsing for args
|
|
[gh#kokkos/kokkos#3457]
|
|
- Implemented enhancements Tools:
|
|
- Changed namespacing of
|
|
Kokkos::Tools::Impl::Impl::tune_policy
|
|
[gh#kokkos/kokkos#3455]
|
|
- Delegate to an impl allocate/deallocate method to allow
|
|
specifying a SpaceHandle for MemorySpaces
|
|
[gh#kokkos/kokkos#3530]
|
|
- Use the Kokkos Profiling interface rather than the Impl
|
|
interface [gh#kokkos/kokkos#3518]
|
|
- Runtime option for tuning [gh#kokkos/kokkos#3459]
|
|
- Dual View Tool Events [gh#kokkos/kokkos#3326]
|
|
- Implemented enhancements Other:
|
|
- Abort on errors instead of just printing
|
|
[gh#kokkos/kokkos#3528]
|
|
- Enable C++14 macros unconditionally [gh#kokkos/kokkos#3449]
|
|
- Make ViewMapping trivially copyable [gh#kokkos/kokkos#3436]
|
|
- Rename struct ViewMapping to class [gh#kokkos/kokkos#3435]
|
|
- Replace enums in Kokkos_ViewMapping.hpp (removes -Wextra)
|
|
[gh#kokkos/kokkos#3422]
|
|
- Use bool for enums representing bools
|
|
[gh#kokkos/kokkos#3416]
|
|
- Fence active instead of default execution space instances
|
|
[gh#kokkos/kokkos#3388]
|
|
- Refactor parallel_reduce fence usage [gh#kokkos/kokkos#3359]
|
|
- Moved Space EBO helpers to Kokkos_EBO
|
|
[gh#kokkos/kokkos#3357]
|
|
- Add remove_cvref type trait [gh#kokkos/kokkos#3340]
|
|
- Adding identity type traits and update definition of
|
|
identity_t alias [gh#kokkos/kokkos#3339]
|
|
- Add is_specialization_of type trait [gh#kokkos/kokkos#3338]
|
|
- Make ScratchMemorySpace semi-regular [gh#kokkos/kokkos#3309]
|
|
- Optimize min/max atomics with early exit on no-op case
|
|
[gh#kokkos/kokkos#3265]
|
|
- Refactor Backend Development [gh#kokkos/kokkos#2941]
|
|
- Fixed bugs:
|
|
- Fixup MDRangePolicy construction from Kokkos arrays
|
|
[gh#kokkos/kokkos#3591]
|
|
- Add atomic functions for unsigned long long using gcc
|
|
built-in [gh#kokkos/kokkos#3588]
|
|
- Fixup silent pointless comparison with zero in
|
|
checked_narrow_cast (compiler workaround)
|
|
[gh#kokkos/kokkos#3566]
|
|
- Fixes for ROCm 3.9 [gh#kokkos/kokkos#3565]
|
|
- Fix windows build issues which crept in for the CUDA build
|
|
[gh#kokkos/kokkos#3532]
|
|
- HIP Fix atomics of large data types and clean up lock arrays
|
|
[gh#kokkos/kokkos#3529]
|
|
- Pthreads fix exception resulting from 0 grain size
|
|
[gh#kokkos/kokkos#3510]
|
|
- Fixup do not require atomic operation to be default
|
|
constructible [gh#kokkos/kokkos#3503]
|
|
- Fix race condition in HIP backend [gh#kokkos/kokkos#3467]
|
|
- Replace KOKKOS_DEBUG with KOKKOS_ENABLE_DEBUG
|
|
[gh#kokkos/kokkos#3458]
|
|
- Fix multi-stream team scratch space definition for HIP
|
|
[gh#kokkos/kokkos#3398]
|
|
- HIP fix template deduction [gh#kokkos/kokkos#3393]
|
|
- Fix compiling with HIP and C++17 [gh#kokkos/kokkos#3390]
|
|
- Fix sigFPE in HIP blocksize deduction
|
|
[gh#kokkos/kokkos#3378]
|
|
- Type alias change: replace CS with CTS to avoid conflicts
|
|
with NVSHMEM [gh#kokkos/kokkos#3348]
|
|
- Clang compilation of CUDA backend on Windows
|
|
[gh#kokkos/kokkos#3345]
|
|
- Fix HBW support [gh#kokkos/kokkos#3343]
|
|
- Added missing fences to unique token [gh#kokkos/kokkos#3260]
|
|
- Incompatibilities:
|
|
- Remove unused utilities (forward, move, and expand_variadic)
|
|
from Kokkos::Impl [gh#kokkos/kokkos#3535]
|
|
- Remove unused traits [gh#kokkos/kokkos#3534]
|
|
- HIP: Remove old HCC code [gh#kokkos/kokkos#3301]
|
|
- Prepare for deprecation of ViewAllocateWithoutInitializing
|
|
[gh#kokkos/kokkos#3264]
|
|
- Remove ROCm backend [gh#kokkos/kokkos#3148]
|
|
|
|
-------------------------------------------------------------------
|
|
Sun Sep 20 14:32:52 UTC 2020 - Christoph Junghans <junghans@votca.org>
|
|
|
|
- added 3308.patch to fix naming of printer-tool
|
|
- Version bump to 3.2.00:
|
|
- Implemented enhancements:
|
|
- HIP:Enable stream in HIP [gh#kokkos/kokkos#3163]
|
|
- HIP:Add support for shuffle reduction for the HIP backend
|
|
[gh#kokkos/kokkos#3154]
|
|
- HIP:Add implementations of missing HIPHostPinnedSpace
|
|
methods for LAMMPS [gh#kokkos/kokkos#3137]
|
|
- HIP:Require HIP 3.5.0 or higher [gh#kokkos/kokkos#3099]
|
|
- HIP:WorkGraphPolicy for HIP [gh#kokkos/kokkos#3096]
|
|
- OpenMPTarget: Significant update to the new experimental
|
|
backend. Requires C++17, works on Intel GPUs, reference
|
|
counting fixes. [gh#kokkos/kokkos#3169]
|
|
- Windows Cuda support [gh#kokkos/kokkos#3018]
|
|
- Pass -Wext-lambda-captures-this to NVCC when support for
|
|
__host__ __device__ lambda is enabled from CUDA 11
|
|
[gh#kokkos/kokkos#3241]
|
|
- Use explicit staging buffer for constant memory kernel
|
|
launches and cleanup host/device synchronization
|
|
[gh#kokkos/kokkos#3234]
|
|
- Various fixup to policies including making TeamPolicy
|
|
default constructible and making RangePolicy and TeamPolicy
|
|
assignable: [gh#kokkos/kokkos#3202], [gh#kokkos/kokkos#3203],
|
|
[gh#kokkos/kokkos#3196]
|
|
- Annotations for DefaultExectutionSpace and
|
|
DefaultHostExectutionSpace to use in static analysis
|
|
[gh#kokkos/kokkos#3189]
|
|
- Add documentation on using Spack to install Kokkos and
|
|
developing packages that depend on Kokkos
|
|
[gh#kokkos/kokkos#3187]
|
|
- Add OpenMPTarget backend flags for NVC++ compiler
|
|
[gh#kokkos/kokkos#3185]
|
|
- Move deep_copy/create_mirror_view on
|
|
Experimental::OffsetView into Kokkos:: namespace
|
|
[gh#kokkos/kokkos#3166]
|
|
- Allow for larger block size in HIP [gh#kokkos/kokkos#3165]
|
|
- View: Added names of Views to the different View
|
|
initialize/free kernels [gh#kokkos/kokkos#3159]
|
|
- Cuda: Caching cudaFunctorAttributes and whether L1/Shmem
|
|
prefer was set [gh#kokkos/kokkos#3151]
|
|
- BuildSystem: Improved performance in default configuration
|
|
by defaulting to Release build [gh#kokkos/kokkos#3131]
|
|
- Cuda: Update CUDA occupancy calculation
|
|
[gh#kokkos/kokkos#3124]
|
|
- Vector: Adding data() to Vector [gh#kokkos/kokkos#3123]
|
|
- BuildSystem: Add CUDA Ampere configuration support
|
|
[gh#kokkos/kokkos#3122]
|
|
- General: Apply [[noreturn]] to Kokkos::abort when applicable
|
|
[gh#kokkos/kokkos#3106]
|
|
- TeamPolicy: Validate storage level argument passed to
|
|
TeamPolicy::set_scratch_size() [gh#kokkos/kokkos#3098]
|
|
- BuildSystem: Make kokkos_has_string() function in
|
|
Makefile.kokkos case insensitive [gh#kokkos/kokkos#3091]
|
|
- Modify KOKKOS_FUNCTION macro for clang-tidy analysis
|
|
[gh#kokkos/kokkos#3087]
|
|
- Move allocation profiling to allocate/deallocate calls
|
|
[gh#kokkos/kokkos#3084]
|
|
- BuildSystem: FATAL_ERROR when attempting in-source build
|
|
[gh#kokkos/kokkos#3082]
|
|
- Change enums in ScatterView to types [gh#kokkos/kokkos#3076]
|
|
- HIP: Changes for new compiler/runtime
|
|
[gh#kokkos/kokkos#3067]
|
|
- Extract and use get_gpu [gh#kokkos/kokkos#3061] ,
|
|
[gh#kokkos/kokkos#3048]
|
|
- Add is_allocated to View-like containers
|
|
[gh#kokkos/kokkos#3059]
|
|
- Combined reducers for scalar references
|
|
[gh#kokkos/kokkos#3052]
|
|
- Add configurable capacity for UniqueToken
|
|
[gh#kokkos/kokkos#3051]
|
|
- Add installation testing [gh#kokkos/kokkos#3034]
|
|
- HIP: Add UniqueToken [gh#kokkos/kokkos#3020]
|
|
- Autodetect number of devices [gh#kokkos/kokkos#3013]
|
|
- Fixed bugs:
|
|
- Check error code from cudaStreamSynchronize in CUDA fences
|
|
[gh#kokkos/kokkos#3255]
|
|
- Fix issue with C++ standard flags when using nvcc\_wrapper
|
|
with PGI [gh#kokkos/kokkos#3254]
|
|
- Add missing threadfence in lock-based atomics
|
|
[gh#kokkos/kokkos#3208]
|
|
- Fix dedup of linker flags for shared lib on CMake <=3.12
|
|
[gh#kokkos/kokkos#3176]
|
|
- Fix memory leak with CUDA streams [gh#kokkos/kokkos#3170]
|
|
- BuildSystem: Fix OpenMP Target flags for Cray
|
|
[gh#kokkos/kokkos#3161]
|
|
- ScatterView: fix for OpenmpTarget remove inheritance from
|
|
reducers [gh#kokkos/kokkos#3162]
|
|
- BuildSystem: Set OpenMP flags according to host compiler
|
|
[gh#kokkos/kokkos#3127]
|
|
- OpenMP: Fix logic for nested omp in partition_master bug
|
|
[gh#kokkos/kokkos#3101]
|
|
- nvcc_wrapper: send --cudart to nvcc instead of host compiler
|
|
[gh#kokkos/kokkos#3092]
|
|
- BuildSystem: Fixes for Cuda/11 and c++17
|
|
[gh#kokkos/kokkos#3085]
|
|
- HIP: Fix print_configuration [gh#kokkos/kokkos#3080]
|
|
- Conditionally define get_gpu [gh#kokkos/kokkos#3072]
|
|
- Fix bounds for ranges in random number generator
|
|
[gh#kokkos/kokkos#3069]
|
|
- Fix Cuda minor arch check [gh#kokkos/kokkos#3035]
|
|
- BuildSystem: Add -expt-relaxed-constexpr flag to
|
|
nvcc_wrapper [gh#kokkos/kokkos#3021]
|
|
- Incompatibilities:
|
|
- Remove ETI support [gh#kokkos/kokkos#3157]
|
|
- Remove KOKKOS_INTERNAL_ENABLE_NON_CUDA_BACKEND
|
|
[gh#kokkos/kokkos#3147]
|
|
- Remove core/unit_test/config [gh#kokkos/kokkos#3146]
|
|
- Removed the preprocessor branch for KOKKOS_ENABLE_PROFILING
|
|
[gh#kokkos/kokkos#3115]
|
|
- Disable profiling with MSVC [gh#kokkos/kokkos#3066]
|
|
- Closed issues:
|
|
- Silent error (Validate storage level arg to
|
|
set_scratch_size) [gh#kokkos/kokkos#3097]
|
|
- Remove KOKKKOS_ENABLE_PROFILING Option
|
|
[gh#kokkos/kokkos#3095]
|
|
- Cuda 11 -> allow C++17 [gh#kokkos/kokkos#3083]
|
|
- In source build failure not explained
|
|
[gh#kokkos/kokkos#3081]
|
|
- Allow naming of Views for initialization kernel
|
|
[gh#kokkos/kokkos#3070]
|
|
- DefaultInit tests failing when using CTest resource
|
|
allocation feature [gh#kokkos/kokkos#3040]
|
|
- Add installation testing. [gh#kokkos/kokkos#3037]
|
|
- nvcc_wrapper needs to handle -expt-relaxed-constexpr flag
|
|
[gh#kokkos/kokkos#3017]
|
|
- CPU core oversubscription warning on macOS with OpenMP
|
|
backend [gh#kokkos/kokkos#2996]
|
|
- Default behavior of KOKKOS_NUM_DEVICES to use all devices
|
|
available [gh#kokkos/kokkos#2975]
|
|
- Assert blocksize > 0 [gh#kokkos/kokkos#2974]
|
|
- Add ability to assign kokkos profile function from
|
|
executable [gh#kokkos/kokkos#2973]
|
|
- ScatterView Support for the pre/post increment operator
|
|
[gh#kokkos/kokkos#2967]
|
|
- Compiler issue: Cuda build with clang 10 has errors with the
|
|
atomic unit tests [gh#kokkos/kokkos#3237]
|
|
- Incompatibility of flags for C++ standard with PGI v20.4 on
|
|
Power9/NVIDIA V100 system [gh#kokkos/kokkos#3252]
|
|
- Error configuring as subproject [gh#kokkos/kokkos#3140]
|
|
- CMake fails with Nvidia compilers when the GPU architecture
|
|
option is not supplied (Fix configure with OMPT and Cuda)
|
|
[gh#kokkos/kokkos#3207]
|
|
- PGI compiler being passed the gcc -fopenmp flag
|
|
[gh#kokkos/kokkos#3125]
|
|
- Cuda: Memory leak when using CUDA stream
|
|
[gh#kokkos/kokkos#3167]
|
|
- RangePolicy has an implicitly deleted assignment operator
|
|
[gh#kokkos/kokkos#3192]
|
|
- MemorySpace::allocate needs to have memory pool counting.
|
|
[gh#kokkos/kokkos#3064]
|
|
- Missing write fence for lock based atomics on CUDA
|
|
[gh#kokkos/kokkos#3038]
|
|
- CUDA compute capability version check problem
|
|
[gh#kokkos/kokkos#3026]
|
|
- Make DynRankView fencing consistent [gh#kokkos/kokkos#3014]
|
|
- nvcc_wrapper cant handle -Xcompiler -o out.o
|
|
[gh#kokkos/kokkos#2993]
|
|
- Reductions of non-trivial types of size 4 fail in CUDA shfl
|
|
operations [gh#kokkos/kokkos#2990]
|
|
- complex_double misalignment in reduce, clang+CUDA
|
|
[gh#kokkos/kokkos#2989]
|
|
- Span of degenerated (zero-length) subviews is not zero in
|
|
some special cases [gh#kokkos/kokkos#2979]
|
|
- Rank 1 custom layouts dont work as expected.
|
|
[gh#kokkos/kokkos#2840]
|
|
|
|
-------------------------------------------------------------------
|
|
Thu Aug 20 08:53:04 UTC 2020 - Martin Liška <mliska@suse.cz>
|
|
|
|
- Use memoryperjob constraint instead of %limit_build macro.
|
|
|
|
-------------------------------------------------------------------
|
|
Wed May 6 01:58:29 UTC 2020 - Christoph Junghans <junghans@votca.org>
|
|
|
|
- drop fix-StackTraceTest.patch, merged upstream
|
|
- Version bump to 3.1.01:
|
|
see https://github.com/kokkos/kokkos/blob/master/CHANGELOG.md
|
|
* Fixed bugs:
|
|
- Fix complex_double misalignment in reduce, clang+CUDA
|
|
[gh#kokkos/kokkos#2989]
|
|
- Fix compilation fails when profiling disabled and CUDA
|
|
enabled [gh#kokkos/kokkos#3001]
|
|
- Fix cuda reduction of non-trivial scalars of size 4
|
|
[gh#kokkos/kokkos#2990]
|
|
- Configure and install version file when building in Trilinos
|
|
[gh#kokkos/kokkos#2957]
|
|
- Fix OpenMPTarget build missing include and namespace
|
|
[gh#kokkos/kokkos#3000]
|
|
- fix typo in KOKKOS_SET_EXE_PROPERTY() [gh#kokkos/kokkos#2959]
|
|
- Fix non-zero span subviews of zero sized subviews
|
|
[gh#kokkos/kokkos#2979]
|
|
|
|
-------------------------------------------------------------------
|
|
Wed Apr 22 12:57:15 UTC 2020 - Christoph Junghans <junghans@votca.org>
|
|
|
|
- add missing Conflicts: marker
|
|
|
|
-------------------------------------------------------------------
|
|
Fri Apr 17 15:20:27 UTC 2020 - Christoph Junghans <junghans@votca.org>
|
|
|
|
- added fix-StackTraceTest.patch to fix StackTraceTest test
|
|
- Version bump to 3.1.00:
|
|
see https://github.com/kokkos/kokkos/blob/master/CHANGELOG.md
|
|
* Features:
|
|
- HIP Support for AMD
|
|
- OpenMPTarget Support with clang
|
|
- Windows VS19 (Serial) Support [gh#kokkos/kokkos#1533]
|
|
* Implemented enhancements:
|
|
- generate_makefile.bash should allow tests to be disabled
|
|
[gh#kokkos/kokkos#2886]
|
|
- clang/7+cuda/9 build -Werror-unused parameter error in
|
|
nightly test [gh#kokkos/kokkos#2884]
|
|
- ScatterView memory space is not user settable
|
|
[gh#kokkos/kokkos#2826]
|
|
- clang/8+cuda/10.0 build error with c++17
|
|
[gh#kokkos/kokkos#2809]
|
|
- warnings.... [gh#kokkos/kokkos#2805]
|
|
- Kokkos version in cpp define [gh#kokkos/kokkos#2787]
|
|
- Remove Defunct QThreads Backend [gh#kokkos/kokkos#2751]
|
|
- Improve Kokkos::fence behavior with multiple execution
|
|
spaces [gh#kokkos/kokkos#2659]
|
|
- polylithic(?) initialization of Kokkos
|
|
[gh#kokkos/kokkos#2658]
|
|
- Unnecessary(?) check for host execution space initialization
|
|
from Cuda initialization [gh#kokkos/kokkos#2652]
|
|
- Kokkos error reporting failures with CUDA GPUs in exclusive
|
|
mode [gh#kokkos/kokkos#2471]
|
|
- atomicMax equivalent (and other atomics)
|
|
[gh#kokkos/kokkos#2401]
|
|
- Fix alignment for Kokkos::complex [gh#kokkos/kokkos#2255]
|
|
- Warnings with Cuda 10.1 [gh#kokkos/kokkos#2206]
|
|
- dual view with Kokkos::ViewAllocateWithoutInitializing
|
|
[gh#kokkos/kokkos#2188]
|
|
- Check error code from
|
|
cudaOccupancyMaxActiveBlocksPerMultiprocessor
|
|
[gh#kokkos/kokkos#2172]
|
|
- Add non-member Kokkos::resize/realloc for DualView
|
|
[gh#kokkos/kokkos#2170]
|
|
- Construct DualView without initialization
|
|
[gh#kokkos/kokkos#2046]
|
|
- Expose is_assignable to determine if one view can be
|
|
assigned to another [gh#kokkos/kokkos#1936]
|
|
- profiling label [gh#kokkos/kokkos#1935]
|
|
- team_broadcast of bool failed on CUDA backend
|
|
[gh#kokkos/kokkos#1908]
|
|
- View static_extent [gh#kokkos/kokkos#660]
|
|
- Misleading Kokkos::Cuda::initialize ERROR message when
|
|
compiled for wrong GPU architecture [gh#kokkos/kokkos#1944]
|
|
- Cryptic Error When Malloc Fails [gh#kokkos/kokkos#2164]
|
|
- Drop support for intermediate standards in CMake
|
|
[gh#kokkos/kokkos#2336]
|
|
* Fixed bugs:
|
|
- DualView sync_device with length zero creates cuda errors
|
|
[gh#kokkos/kokkos#2946]
|
|
- building with nvcc and clang (or clang based XL) as host
|
|
compiler: "Kokkos::atomic_fetch_min(volatile int *, int)"
|
|
has already been defined [gh#kokkos/kokkos#2903]
|
|
- Cuda 9.1,10.1 debug builds failing due to
|
|
-Werror=unused-parameter [gh#kokkos/kokkos#2880]
|
|
- clang -Werror: Kokkos_FixedBufferMemoryPool.hpp:140:28:
|
|
error: unused parameter 'alloc_size' [gh#kokkos/kokkos#2869]
|
|
- intel/16.0.1, intel/17.0.1 nightly build failures with
|
|
debugging enabled [gh#kokkos/kokkos#2867]
|
|
- intel/16.0.1 debug build errors [gh#kokkos/kokkos#2863]
|
|
- xl/16.1.1 with cpp14, openmp build, nightly test failures
|
|
[gh#kokkos/kokkos#2856]
|
|
- Intel nightly test failures: team_vector
|
|
[gh#kokkos/kokkos#2852]
|
|
- Kokkos Views with intmax/2<N<intmax can hang during
|
|
construction [gh#kokkos/kokkos#2850]
|
|
- workgraph_fib test seg-faults with threads backend and hwloc
|
|
[gh#kokkos/kokkos#2797]
|
|
- cuda.view_64bit test hangs on Power8+Kepler37 system -
|
|
develop and 2.9.00 branches [gh#kokkos/kokkos#2771]
|
|
- device_type for Kokkos_Random ? [gh#kokkos/kokkos#2693]
|
|
- "More than one tag given" error in Experimental::require()
|
|
[gh#kokkos/kokkos#2608]
|
|
- Segfault on Marvell from our finalization stack
|
|
[gh#kokkos/kokkos#2542]
|
|
|
|
-------------------------------------------------------------------
|
|
Thu Feb 27 21:47:22 UTC 2020 - Christoph Junghans <junghans@votca.org>
|
|
|
|
- Version bump to 3.0.00 - see https://github.com/kokkos/kokkos/blob/master/CHANGELOG.md
|
|
Implemented enhancements:
|
|
+ BuildSystem: Standalone Modern CMake Support #2104
|
|
+ StyleFormat: ClangFormat Style #2157
|
|
+ Documentation: Document build system and CMake philosophy #2263
|
|
+ BuildSystem: Add Alias with Namespace Kokkos:: to Interal Libraries #2530
|
|
+ BuildSystem: Universal Kokkos find_package #2099
|
|
+ BuildSystem: Dropping support for Kokkos_{DEVICES,OPTIONS,ARCH} in CMake #2329
|
|
+ BuildSystem: Set Kokkos_DEVICES and Kokkos_ARCH variables in exported CMake configuration #2193
|
|
+ BuildSystem: Drop support for CUDA 7 and CUDA 8 #2489
|
|
+ BuildSystem: Drop CMake option SEPARATE_TESTS #2266
|
|
+ BuildSystem: Support expt-relaxed-constexpr same as expt-extended-lambda #2411
|
|
+ BuildSystem: Add Xnvlink to command line options allowed in nvcc_wrapper #2197
|
|
+ BuildSystem: Install Kokkos config files and target files to lib/cmake/Kokkos #2162
|
|
+ BuildSystem: nvcc_wrappers and c++ 14 #2035
|
|
+ BuildSystem: Kokkos version major/version minor (Feature request) #1930
|
|
+ BuildSystem: CMake namespaces (and other modern cmake cleanup) #1924
|
|
+ BuildSystem: Remove capability to install Kokkos via GNU Makefiles #2332
|
|
+ Documentation: Remove PDF ProgrammingGuide in Kokkos replace with link #2244
|
|
+ View: Add Method to Resize View without Initialization #2048
|
|
+ Vector: implement “insert” method for Kokkos_Vector (as a serial function on host) #2437
|
|
|
|
-------------------------------------------------------------------
|
|
Tue Feb 4 17:45:52 UTC 2020 - Christoph Junghans <junghans@votca.org>
|
|
|
|
- Initial commit
|
|
|