Accepting request 656046 from home:namtrac:branches:science
- Update to version 0.3.4 common: * The new, experimental thread-local memory allocation had inadvertently been left enabled for gmake builds in 0.3.3 despite the announcement. It is now disabled by default, and single-threaded builds will keep using the old allocator even if the USE_TLS option is turned on. * OpenBLAS will now provide enough buffer space for at least 50 threads by default. * The output of openblas_get_config() now contains the version number. * A serious thread safety bug in GEMV operation with small M and large N size has been fixed. * The code will now automatically call blas_thread_init after a fork if needed before handling a call to openblas_set_num_threads * Accesses to parallelized level3 functions from multiple callers are now serialized to avoid thread races (unless using OpenMP). * This should provide better performance than the known-threadsafe (but non-default) USE_SIMPLE_THREADED_LEVEL3 option. * When building LAPACK with gfortran, -frecursive is now (again) enabled by default to ensure correct behaviour. * The OpenBLAS version cblas.h now supports both CBLAS_ORDER and CBLAS_LAYOUT as the name of the matrix row/column order option. * Externally set LDFLAGS are now passed through to the final compile/link * steps to facilitate setting platform-specific linker flags. OBS-URL: https://build.opensuse.org/request/show/656046 OBS-URL: https://build.opensuse.org/package/show/science/openblas?expand=0&rev=70
This commit is contained in:
parent
0737842b18
commit
f70e95f316
@ -1,3 +0,0 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:49d88f4494ae780e3d7fa51769c00d982d7cdb73e696054ac3baa81d42f13bab
|
||||
size 11821696
|
3
OpenBLAS-0.3.4.tar.gz
Normal file
3
OpenBLAS-0.3.4.tar.gz
Normal file
@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:4b4b4453251e9edb5f57465bf2b3cf67b19d811d50c8588cdf2ea1f201bb834f
|
||||
size 11846102
|
@ -1,3 +1,71 @@
|
||||
-------------------------------------------------------------------
|
||||
Fri Dec 7 12:29:27 UTC 2018 - Ismail Dönmez <idonmez@suse.com>
|
||||
|
||||
- Update to version 0.3.4
|
||||
common:
|
||||
* The new, experimental thread-local memory allocation had
|
||||
inadvertently been left enabled for gmake builds in 0.3.3
|
||||
despite the announcement. It is now disabled by default,
|
||||
and single-threaded builds will keep using the old
|
||||
allocator even if the USE_TLS option is turned on.
|
||||
* OpenBLAS will now provide enough buffer space for at least
|
||||
50 threads by default.
|
||||
* The output of openblas_get_config() now contains the version
|
||||
number.
|
||||
* A serious thread safety bug in GEMV operation with small M and
|
||||
large N size has been fixed.
|
||||
* The code will now automatically call blas_thread_init after
|
||||
a fork if needed before handling a call to
|
||||
openblas_set_num_threads
|
||||
* Accesses to parallelized level3 functions from multiple
|
||||
callers are now serialized to avoid thread races
|
||||
(unless using OpenMP).
|
||||
* This should provide better performance than the
|
||||
known-threadsafe (but non-default)
|
||||
USE_SIMPLE_THREADED_LEVEL3 option.
|
||||
* When building LAPACK with gfortran, -frecursive is now
|
||||
(again) enabled by default to ensure correct behaviour.
|
||||
* The OpenBLAS version cblas.h now supports both CBLAS_ORDER
|
||||
and CBLAS_LAYOUT as the name of the matrix row/column order
|
||||
option.
|
||||
* Externally set LDFLAGS are now passed through to the final
|
||||
compile/link
|
||||
* steps to facilitate setting platform-specific linker flags.
|
||||
* A potential race condition during the build of LAPACK
|
||||
(that would usually manifest itself as a failure to build
|
||||
TESTING/MATGEN) has been fixed.
|
||||
* xHEMV has been changed to stay single-threaded for small
|
||||
input sizes where the overhead of multithreading exceeds
|
||||
any possible gains
|
||||
* CSWAP and ZSWAP have been limited to a single thread
|
||||
except on ARMV8 or ThunderX hardware with sizable input.
|
||||
* Linker flags for the PGI compiler have been updated
|
||||
* Behaviour of AXPY with zero increments is now handled
|
||||
in the C interface, correcting the result on at least
|
||||
Intel Atom.
|
||||
* The result matrix from calling SGELSS with an all-zero
|
||||
input matrix is now zeroed completely.
|
||||
x86_64:
|
||||
* Autodetection of AMD Ryzen2 has been fixed (again).
|
||||
* CMAKE builds now support labeling of an INTERFACE64=1
|
||||
build of the library with the _64 suffix.
|
||||
* AVX512 version of DGEMM has been added and the
|
||||
AVX512 SGEMM kernel has been sped up by rewriting
|
||||
with C intrinsics
|
||||
* Fixed compilation on RHEL5/CENTOS5
|
||||
(issue with typename __WAIT_STATUS)
|
||||
armv8:
|
||||
* DYNAMic_ARCH support is now available for 64bit ARM
|
||||
* cross-compiling for ARMV8 under iOS now works.
|
||||
* cpu-specific code has been rearranged to make better
|
||||
use of both hardware commonalities and model-specific
|
||||
compiler optimizations.
|
||||
* XGENE1 has been removed as a TARGET, superseded by the
|
||||
improved generic ARMV8 support.
|
||||
armv7:
|
||||
* Older assembly mnemonics have been converted to UAL
|
||||
form to allow building with clang 7.0
|
||||
|
||||
-------------------------------------------------------------------
|
||||
Tue Oct 9 19:00:49 UTC 2018 - Dmitry Roshchin <dmitry_r@opensuse.org>
|
||||
|
||||
|
@ -18,8 +18,8 @@
|
||||
|
||||
%global flavor @BUILD_FLAVOR@%{nil}
|
||||
|
||||
%define _vers 0_3_3
|
||||
%define vers 0.3.3
|
||||
%define _vers 0_3_4
|
||||
%define vers 0.3.4
|
||||
%define pname openblas
|
||||
|
||||
%bcond_with ringdisabled
|
||||
|
Loading…
x
Reference in New Issue
Block a user