common: * Fixed a race condition on thread shutdown in non-OpenMP builds * Fixed custom BUFFERSIZE option getting ignored in gmake builds * Fixed CMAKE compilation of the TRMM kernels for GENERIC platforms * Added CBLAS interfaces for CROTG, ZROTG, CSROT and ZDROT * Improved performance of OMATCOPY_RT across all platforms * Changed perl scripts to use env instead of a hardcoded /usr/bin/perl * Fixed potential misreading of the GCC compiler version in the build scripts * Fixed convergence problems in LAPACK complex GGEV/GGES (Reference-LAPACK #477) * Reduced the stacksize requirements for running the LAPACK testsuite (Reference-LAPACK #335) RISC V: * Fixed compilation on RISCV (missing entry in getarch) POWER: * Fixed compilation for DYNAMIC_ARCH with clang and with older gcc versions * Added support for compilation on FreeBSD/ppc64le * Added optimized POWER10 kernels for SSCAL, DSCAL, CSCAL, ZSCAL * Added optimized POWER10 kernels for SROT, DROT, CDOT, SASUM, DASUM * Improved SSWAP, DSWAP, CSWAP, ZSWAP performance on POWER10 * Improved SCOPY and CCOPY performance on POWER10 * Improved SGEMM and DGEMM performance on POWER10 * Added support for compilation with the NVIDIA HPC compiler x86_64: * Added an optimized bfloat16 GEMM kernel for Cooperlake * Added CPUID autodetection for Intel Rocket Lake and Tiger Lake cpus * Improved the performance of SASUM,DASUM,SROT,DROT on AMD Ryzen cpus * Added support for compilation with the NAG Fortran compiler * Fixed recognition of the AMD AOCC compiler * Fixed compilation for DYNAMIC_ARCH with clang on Windows * Added support for running the BLAS/CBLAS tests on Windows OBS-URL: https://build.opensuse.org/package/show/science/openblas?expand=0&rev=120
40 lines
1.3 KiB
Diff
40 lines
1.3 KiB
Diff
Index: OpenBLAS-0.3.14/c_check
|
|
===================================================================
|
|
--- OpenBLAS-0.3.14.orig/c_check
|
|
+++ OpenBLAS-0.3.14/c_check
|
|
@@ -12,7 +12,7 @@ $hostarch = "x86_64" if ($hostarch eq "a
|
|
$hostarch = "arm" if ($hostarch ne "arm64" && $hostarch =~ /^arm.*/);
|
|
$hostarch = "arm64" if ($hostarch eq "aarch64");
|
|
$hostarch = "power" if ($hostarch =~ /^(powerpc|ppc).*/);
|
|
-$hostarch = "zarch" if ($hostarch eq "s390x");
|
|
+$hostarch = "zarch" if ($hostarch =~ /^s390/);
|
|
|
|
#$tmpf = new File::Temp( UNLINK => 1 );
|
|
$binary = $ENV{"BINARY"};
|
|
Index: OpenBLAS-0.3.14/ctest.c
|
|
===================================================================
|
|
--- OpenBLAS-0.3.14.orig/ctest.c
|
|
+++ OpenBLAS-0.3.14/ctest.c
|
|
@@ -117,7 +117,7 @@ ARCH_X86_64
|
|
ARCH_POWER
|
|
#endif
|
|
|
|
-#if defined(__s390x__) || defined(__zarch__)
|
|
+#if defined(__s390x__) || defined(__s390__) || defined(__zarch__)
|
|
ARCH_ZARCH
|
|
#endif
|
|
|
|
Index: OpenBLAS-0.3.14/getarch.c
|
|
===================================================================
|
|
--- OpenBLAS-0.3.14.orig/getarch.c
|
|
+++ OpenBLAS-0.3.14/getarch.c
|
|
@@ -1333,7 +1333,7 @@ USE OF THIS SOFTWARE, EVEN IF ADVISED OF
|
|
#define OPENBLAS_SUPPORTED
|
|
#endif
|
|
|
|
-#if defined(__zarch__) || defined(__s390x__)
|
|
+#if defined(__zarch__) || defined(__s390x__) || defined(__s390__)
|
|
#define ZARCH
|
|
#include "cpuid_zarch.c"
|
|
#define OPENBLAS_SUPPORTED
|