Accepting request 629943 from home:namtrac:branches:science
- Update to version 0.3.2 common: * Fixes for regressions caused by the rewrite of the thread initialization code in 0.3.1 x86_64: * Added autodetection of AMD Ryzen 2 * Fixed build with older versions of MSVC power: * Fixed cpu autodetection for the BSDs mips64: * Fixed utest errors in AXPY, DSDOT, ROT and SWAP - Version 0.3.1 common: * Rewritten thread initialization code with significantly reduced overhead * Added CBLAS interfaces to the IxAMIN BLAS extension functions * Fixed the lapack-test target * CMAKE builds now create an OpenBLASConfig.cmake file * ZAXPY now uses a single thread for small input sizes * The LAPACK code was updated from Reference-LAPACK/lapack#253 power: * Corrected CROT and ZROT behaviour with zero INC_X armv7: * Corrected xDOT behaviour with zero INC_X or INC_Y x86_64: * Retired some older targets of DYNAMIC_ARCH builds to a new option DYNAMIC_OLDER, this affects PENRYN,DUNNINGTON, OPTERON,OPTERON_SSE3,BOBCAT,ATOM and NANO (which will still be supported via the slower PRESCOTT kernels when this option is not set) OBS-URL: https://build.opensuse.org/request/show/629943 OBS-URL: https://build.opensuse.org/package/show/science/openblas?expand=0&rev=67
This commit is contained in:
parent
0deb6f1a78
commit
868106f937
3
OpenBLAS-0.3.2.tar.gz
Normal file
3
OpenBLAS-0.3.2.tar.gz
Normal file
@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:e8ba64f6b103c511ae13736100347deb7121ba9b41ba82052b1a018a65c0cb15
|
||||
size 11812530
|
@ -1,13 +0,0 @@
|
||||
Index: OpenBLAS-0.2.8/ctest/c_xerbla.c
|
||||
===================================================================
|
||||
--- OpenBLAS-0.2.8.orig/ctest/c_xerbla.c
|
||||
+++ OpenBLAS-0.2.8/ctest/c_xerbla.c
|
||||
@@ -131,7 +131,7 @@ void F77_xerbla(char *srname, void *vinf
|
||||
int BLASFUNC(xerbla)(char *name, blasint *info, blasint length) {
|
||||
|
||||
F77_xerbla(name, info);
|
||||
-
|
||||
+ return 0;
|
||||
};
|
||||
|
||||
|
@ -1,3 +0,0 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:5ef38b15d9c652985774869efd548b8e3e972e1e99475c673b25537ed7bcf394
|
||||
size 11637301
|
@ -1,3 +1,102 @@
|
||||
-------------------------------------------------------------------
|
||||
Fri Aug 17 12:56:04 UTC 2018 - idonmez@suse.com
|
||||
|
||||
- Update to version 0.3.2
|
||||
common:
|
||||
* Fixes for regressions caused by the rewrite of the thread
|
||||
initialization code in 0.3.1
|
||||
x86_64:
|
||||
* Added autodetection of AMD Ryzen 2
|
||||
* Fixed build with older versions of MSVC
|
||||
power:
|
||||
* Fixed cpu autodetection for the BSDs
|
||||
mips64:
|
||||
* Fixed utest errors in AXPY, DSDOT, ROT and SWAP
|
||||
- Version 0.3.1
|
||||
common:
|
||||
* Rewritten thread initialization code with significantly
|
||||
reduced overhead
|
||||
* Added CBLAS interfaces to the IxAMIN BLAS extension functions
|
||||
* Fixed the lapack-test target
|
||||
* CMAKE builds now create an OpenBLASConfig.cmake file
|
||||
* ZAXPY now uses a single thread for small input sizes
|
||||
* The LAPACK code was updated from Reference-LAPACK/lapack#253
|
||||
power:
|
||||
* Corrected CROT and ZROT behaviour with zero INC_X
|
||||
armv7:
|
||||
* Corrected xDOT behaviour with zero INC_X or INC_Y
|
||||
x86_64:
|
||||
* Retired some older targets of DYNAMIC_ARCH builds to a
|
||||
new option DYNAMIC_OLDER, this affects PENRYN,DUNNINGTON,
|
||||
OPTERON,OPTERON_SSE3,BOBCAT,ATOM and NANO (which will still
|
||||
be supported via the slower PRESCOTT kernels when this option
|
||||
is not set)
|
||||
* Added an option DYNAMIC_LIST that (used in conjunction with
|
||||
DYNAMIC_ARCH) allows to specify the list of x86_64 targets to
|
||||
include. Any target not on the list will be supported by
|
||||
the Sandybridge or Nehalem kernels if available, or by Prescott.
|
||||
* Improved SWITCH_RATIO on Haswell for increased GEMM throughput
|
||||
* Added initial support for Intel Skylake X, including an AVX512
|
||||
SGEMM kernel
|
||||
* Added autodetection of Intel Cannon Lake series as Skylake X
|
||||
* Added a default L2 cache size for hypervisors that return zero
|
||||
here (Chromebook)
|
||||
* Fixed a name clash with recent Windows10 headers that broke the
|
||||
build with (at least) recent mingw from MSYS2
|
||||
* Fixed a link error in mixed clang/gfortran builds with OpenMP
|
||||
* Updated the OSX deployment target to 10.8
|
||||
* Switched on parallel make for builds on MS Windows by default
|
||||
x86:
|
||||
* Fixed SSWAP and DSWAP behaviour with zero INC_X and INC_Y
|
||||
- Version 0.3.0
|
||||
common:
|
||||
* Fixed some more thread race and locking bugs
|
||||
* Added preliminary support for calling an OpenMP build of the
|
||||
library from multiple threads
|
||||
* Removed performance impact of thread locks added in 0.2.20
|
||||
on OpenMP code
|
||||
* General code cleanup
|
||||
* Optimized DSDOT implementation
|
||||
* Improved thread distribution for GEMM
|
||||
* Corrected IMATCOPY/OMATCOPY implementation
|
||||
* Fixed out-of-bounds accesses in the multithreaded xBMV/xPMV
|
||||
and SYMV implementations
|
||||
* Cmake build improvements
|
||||
* pkgconfig file now contains build options
|
||||
* openblas_get_config() now reports USE_OPENMP and NUM_THREADS
|
||||
settings used for the build
|
||||
* Corrections and improvements for systems with more than 64 cpus
|
||||
* LAPACK code updated to 3.8.0 including later fixes
|
||||
* Added ReLAPACK, a recursive implementation of several LAPACK functions
|
||||
* Rewrote ROTMG to handle cases that the netlib code failed to address
|
||||
* Disabled (broken) multithreading code for xTRMV
|
||||
* corrected prototypes of complex CBLAS functions to make our
|
||||
cblas.h match the generally accepted standard
|
||||
* Shared memory access failures on startup are now handled more gracefully
|
||||
* Restored utests from earlier releases (and made them pass on all
|
||||
affected systems)
|
||||
sparc:
|
||||
* several fixes for cpu autodetection
|
||||
arm:
|
||||
* Added support for CortexA53 and A72
|
||||
* Added autodetection for ThunderX2T99
|
||||
* Made most optimized kernels the default for generic ARMv8 targets
|
||||
x86_64:
|
||||
* Parallelized DDOT kernel for Haswell
|
||||
* Changed alignment directives in assembly kernels to boost performance on OSX
|
||||
* Fixed register handling in the GEMV microkernels (bug exposed by gcc7)
|
||||
* Added support for building on OpenBSD and Dragonfly
|
||||
* Updated compiler options to work with Intel release 2018
|
||||
* Support fully optimized build with clang/flang on Microsoft Windows
|
||||
* Fixed building on AIX
|
||||
ibm z:
|
||||
* added optimized BLAS 1/2 functions
|
||||
mips:
|
||||
* Fixed cpu autodetection helper code
|
||||
* Added mips32 1004K cpu (Mediatek MT7621 and similar SoC)
|
||||
* Added mips64 I6500 cpu
|
||||
- Remove c_xerbla_no-void-return.patch: fixed upstream.
|
||||
|
||||
-------------------------------------------------------------------
|
||||
Tue Jan 30 18:19:33 CET 2018 - ro@suse.de
|
||||
|
||||
|
@ -18,8 +18,8 @@
|
||||
|
||||
%global flavor @BUILD_FLAVOR@%{nil}
|
||||
|
||||
%define _vers 0_2_20
|
||||
%define vers 0.2.20
|
||||
%define _vers 0_3_2
|
||||
%define vers 0.3.2
|
||||
%define pname openblas
|
||||
|
||||
%bcond_with ringdisabled
|
||||
@ -122,17 +122,15 @@ Summary: An optimized BLAS library based on GotoBLAS2
|
||||
License: BSD-3-Clause
|
||||
Group: Productivity/Scientific/Math
|
||||
Url: http://www.openblas.net
|
||||
Source0: https://github.com/xianyi/OpenBLAS/archive/v%{version}.tar.gz#/%{pname}-%{version}.tar.gz
|
||||
Source0: https://github.com/xianyi/OpenBLAS/archive/v%{version}.tar.gz#/OpenBLAS-%{version}.tar.gz
|
||||
Source1: README.SUSE
|
||||
Source2: README.HPC.SUSE
|
||||
# PATCH-FIX-UPSTREAM c_xerbla_no-void-return.patch
|
||||
Patch1: c_xerbla_no-void-return.patch
|
||||
# PATCH-FIX-UPSTREAM openblas-noexecstack.patch
|
||||
Patch2: openblas-noexecstack.patch
|
||||
Patch1: openblas-noexecstack.patch
|
||||
# PATCH-FIX-UPSTREADM fix-arm64-cpuid-return.patch
|
||||
Patch4: fix-arm64-cpuid-return.patch
|
||||
Patch2: fix-arm64-cpuid-return.patch
|
||||
# PATCH port
|
||||
Patch5: openblas-s390.patch
|
||||
Patch3: openblas-s390.patch
|
||||
|
||||
BuildRoot: %{_tmppath}/%{name}-%{version}-build
|
||||
|
||||
@ -238,8 +236,7 @@ This package contains headers for OpenBLAS.
|
||||
%setup -q -n OpenBLAS-%{version}
|
||||
%patch1 -p1
|
||||
%patch2 -p1
|
||||
%patch4 -p1
|
||||
%patch5 -p1
|
||||
%patch3 -p1
|
||||
%ifarch s390
|
||||
sed -i -e "s@m32@m31@" Makefile.system
|
||||
%endif
|
||||
@ -450,7 +447,8 @@ fi
|
||||
%defattr(-,root,root,-)
|
||||
%{p_libdir}/lib%{libname}.so
|
||||
%if %{with hpc}
|
||||
%doc Changelog.txt GotoBLAS* LICENSE README.md README.HPC.SUSE
|
||||
%license LICENSE
|
||||
%doc Changelog.txt GotoBLAS* README.md README.HPC.SUSE
|
||||
%hpc_pkgconfig_file
|
||||
%{p_cmakedir}/
|
||||
%{p_includedir}/
|
||||
@ -464,7 +462,8 @@ fi
|
||||
%if 0%{?build_devel}
|
||||
%files -n %{pname}-devel
|
||||
%defattr(-,root,root,-)
|
||||
%doc Changelog.txt GotoBLAS* LICENSE README.md README.SUSE
|
||||
%license LICENSE
|
||||
%doc Changelog.txt GotoBLAS* README.md README.SUSE
|
||||
%{p_libdir}/libopenblas.so
|
||||
%dir %{p_libdir}/cmake
|
||||
%{p_cmakedir}/
|
||||
|
Loading…
Reference in New Issue
Block a user