* General: + Reworked the unfinished implementation of `HUGETLB` from GotoBLAS for allocating huge memory pages as buffers on suitable systems. + Changed the unfinished implementation of `GEMM3M` for the generic target on all architectures to at least forward to regular GEMM. + Improved multithreaded `GEMM` performance for large non-skinny matrices. + Improved BLAS3 performance on larger multicore systems through improved parallelism. + Improved performance of the initial memory allocation by reducing locking overhead. + Improved performance of `GBMV` at small problem sizes by introducing a size barrier for the switch to multithreading. + Added an implementation of the `CBLAS_GEMM_BATCH` extension. + Fixed corner cases involving the handling of NAN and INFINITY arguments in `?SCAL` on all architectures. + Fixed NAN handling and potential accuracy issues in compilations with Intel ICX by supplying a suitable fp-model option by default. + It is now possible to register a callback function that replaces the built-in support for multithreading with an external backend like TBB (`openblas_set_threads_callback_function`). + Fixed potential duplication of suffixes in shared library naming. + Improved C compiler detection by the build system to tolerate more naming variants for gcc builds. + Fixed an unnecessary dependency of the utest on CBLAS. + Fixed spurious error reports from the BLAS extensions `utest`. + Fixed unwanted invocation of the `GEMM3M` tests in cross- compilation. + Fixed a flaw in the makefile build that could lead to the OBS-URL: https://build.opensuse.org/package/show/science/openblas?expand=0&rev=184
40 lines
984 B
Diff
40 lines
984 B
Diff
From: Egbert Eich <eich@suse.com>
|
|
Date: Wed Nov 30 20:14:53 2022 +0100
|
|
Subject: Handle s390 correctly
|
|
Patch-mainline: Not yet
|
|
Git-commit: f1761f16899756e4da71df35b82772bcbcc33460
|
|
References:
|
|
|
|
Signed-off-by: Egbert Eich <eich@suse.com>
|
|
Signed-off-by: Egbert Eich <eich@suse.de>
|
|
---
|
|
c_check | 2 +-
|
|
ctest.c | 2 +-
|
|
2 files changed, 2 insertions(+), 2 deletions(-)
|
|
diff --git a/c_check b/c_check
|
|
index 01d4f4a7..7db8bb42 100755
|
|
--- a/c_check
|
|
+++ b/c_check
|
|
@@ -11,7 +11,7 @@ case "$hostarch" in
|
|
arm*) [ "$hostarch" = "arm64" ] || hostarch='arm' ;;
|
|
aarch64) hostarch=arm64 ;;
|
|
powerpc*|ppc*) hostarch=power ;;
|
|
- s390x) hostarch=zarch ;;
|
|
+ s390*) hostarch=zarch ;;
|
|
esac
|
|
|
|
makefile="$1"
|
|
diff --git a/ctest.c b/ctest.c
|
|
index df628b1d..f09571b1 100644
|
|
--- a/ctest.c
|
|
+++ b/ctest.c
|
|
@@ -121,7 +121,7 @@ ARCH_X86_64
|
|
ARCH_POWER
|
|
#endif
|
|
|
|
-#if defined(__s390x__) || defined(__zarch__)
|
|
+#if defined(__s390x__) || defined(__s390__) || defined(__zarch__)
|
|
ARCH_ZARCH
|
|
#endif
|
|
|