openblas/openblas-noexecstack.patch

29 lines
1.2 KiB
Diff
Raw Normal View History

- Update to version 0.3.14 common: * Fixed a race condition on thread shutdown in non-OpenMP builds * Fixed custom BUFFERSIZE option getting ignored in gmake builds * Fixed CMAKE compilation of the TRMM kernels for GENERIC platforms * Added CBLAS interfaces for CROTG, ZROTG, CSROT and ZDROT * Improved performance of OMATCOPY_RT across all platforms * Changed perl scripts to use env instead of a hardcoded /usr/bin/perl * Fixed potential misreading of the GCC compiler version in the build scripts * Fixed convergence problems in LAPACK complex GGEV/GGES (Reference-LAPACK #477) * Reduced the stacksize requirements for running the LAPACK testsuite (Reference-LAPACK #335) RISC V: * Fixed compilation on RISCV (missing entry in getarch) POWER: * Fixed compilation for DYNAMIC_ARCH with clang and with older gcc versions * Added support for compilation on FreeBSD/ppc64le * Added optimized POWER10 kernels for SSCAL, DSCAL, CSCAL, ZSCAL * Added optimized POWER10 kernels for SROT, DROT, CDOT, SASUM, DASUM * Improved SSWAP, DSWAP, CSWAP, ZSWAP performance on POWER10 * Improved SCOPY and CCOPY performance on POWER10 * Improved SGEMM and DGEMM performance on POWER10 * Added support for compilation with the NVIDIA HPC compiler x86_64: * Added an optimized bfloat16 GEMM kernel for Cooperlake * Added CPUID autodetection for Intel Rocket Lake and Tiger Lake cpus * Improved the performance of SASUM,DASUM,SROT,DROT on AMD Ryzen cpus * Added support for compilation with the NAG Fortran compiler * Fixed recognition of the AMD AOCC compiler * Fixed compilation for DYNAMIC_ARCH with clang on Windows * Added support for running the BLAS/CBLAS tests on Windows OBS-URL: https://build.opensuse.org/package/show/science/openblas?expand=0&rev=120
2021-03-18 08:47:05 +00:00
Index: OpenBLAS-0.3.14/exports/Makefile
Accepting request 814610 from home:namtrac:branches:science - Update to version 0.3.10 common: * Improved thread locking behaviour in blas_server and parallel getrf * Imported bugfix 394 from LAPACK (spurious reference to "XERBL" due to overlong lines) * Imported bugfix 403 from LAPACK (compile option "recursive" required for correctness with Intel and PGI) * Imported bugfix 408 from LAPACK (wrong scaling in ZHEEQUB) * Imported bugfix 411 from LAPACK (infinite loop in LARGV/LARTG/LARTGP) * Fixed mismatches between BUFFERSIZE and GEMM_UNROLL parameters that could lead to crashes at large matrix sizes * Restored internal soname in dynamic libraries on FreeBSD and Dragonfly * Added API (openblas_setaffinity) to set thread affinity programmatically on Linux * Added initial infrastructure for half-precision floating point (bfloat16) support with a generic implementation of SHGEMM * Added CMAKE build system support for building the cblas_Xgemm3m functions * Fixed CMAKE support for building in a path with embedded spaces * Fixed CMAKE (non)handling of NO_EXPRECISION and MAX_STACK_ALLOC * Fixed GCC version detection in the Makefiles * Allowed overriding the names of AR, AS and LD in Makefile builds POWER: * fixed big-endian POWER8 ELFv2 builds on FreeBSD * Fixed GCC version checks and DYNAMIC_ARCH builds on POWER9 * Fixed CMAKE build support for POWER9 * fixed a potential race condition in the thread buffer allocation * Worked around LAPACK test failures on PPC G4 MIPS: * fixed a potential race condition in the thread buffer allocation OBS-URL: https://build.opensuse.org/request/show/814610 OBS-URL: https://build.opensuse.org/package/show/science/openblas?expand=0&rev=92
2020-06-15 05:33:39 +00:00
===================================================================
- Update to version 0.3.14 common: * Fixed a race condition on thread shutdown in non-OpenMP builds * Fixed custom BUFFERSIZE option getting ignored in gmake builds * Fixed CMAKE compilation of the TRMM kernels for GENERIC platforms * Added CBLAS interfaces for CROTG, ZROTG, CSROT and ZDROT * Improved performance of OMATCOPY_RT across all platforms * Changed perl scripts to use env instead of a hardcoded /usr/bin/perl * Fixed potential misreading of the GCC compiler version in the build scripts * Fixed convergence problems in LAPACK complex GGEV/GGES (Reference-LAPACK #477) * Reduced the stacksize requirements for running the LAPACK testsuite (Reference-LAPACK #335) RISC V: * Fixed compilation on RISCV (missing entry in getarch) POWER: * Fixed compilation for DYNAMIC_ARCH with clang and with older gcc versions * Added support for compilation on FreeBSD/ppc64le * Added optimized POWER10 kernels for SSCAL, DSCAL, CSCAL, ZSCAL * Added optimized POWER10 kernels for SROT, DROT, CDOT, SASUM, DASUM * Improved SSWAP, DSWAP, CSWAP, ZSWAP performance on POWER10 * Improved SCOPY and CCOPY performance on POWER10 * Improved SGEMM and DGEMM performance on POWER10 * Added support for compilation with the NVIDIA HPC compiler x86_64: * Added an optimized bfloat16 GEMM kernel for Cooperlake * Added CPUID autodetection for Intel Rocket Lake and Tiger Lake cpus * Improved the performance of SASUM,DASUM,SROT,DROT on AMD Ryzen cpus * Added support for compilation with the NAG Fortran compiler * Fixed recognition of the AMD AOCC compiler * Fixed compilation for DYNAMIC_ARCH with clang on Windows * Added support for running the BLAS/CBLAS tests on Windows OBS-URL: https://build.opensuse.org/package/show/science/openblas?expand=0&rev=120
2021-03-18 08:47:05 +00:00
--- OpenBLAS-0.3.14.orig/exports/Makefile
+++ OpenBLAS-0.3.14/exports/Makefile
- Update to version 0.3.11 common: * Reduced the default BLAS3_MEM_ALLOC_THRESHOLD (used as an upper limit for placing temporary arrays on the stack) to be compatible with a stack size of 1mb (as imposed by the JAVA runtime library) * Added mixed-precision dot function SBDOT and utility functions shstobf16, shdtobf16, sbf16tos and dbf16tod to convert between single or double precision float arrays and bfloat16 arrays * Fixed prototypes of LAPACK_?ggsvp and LAPACK_?ggsvd functions in lapack.h * Fixed underflow and rounding errors in LAPACK SLANV2 and DLANV2 (causing miscalculations in e.g. SHSEQR/DHSEQR, LAPACK issue #263) * Fixed workspace calculation in LAPACK ?GELQ (LAPACK issue #415) * Fixed several bugs in the LAPACK testsuite * Improved performance of TRMM and TRSM for certain problem sizes * Fixed infinite recursions and workspace miscalculations in ReLAPACK * CMAKE builds no longer require pkg-config for creating the .pc file * Makefile builds no longer misread NO_CBLAS=0 or NO_LAPACK=0 as enabling these options * Fixed detection of gfortran when invoked through an mpi wrapper * Improve thread reinitialization performance with OpenMP after a fork * Added support for building only the subset of the library required for a particular precision by specifying BUILD_SINGLE, BUILD_DOUBLE * Optional function name prefixes and suffixes are now correctly reflected in the generated cblas.h * Added CMAKE build support for the LAPACK and multithreading tests power: * Added optimized support for POWER10 * Added support for compiling for POWER8 in 32bit mode * Added support for compilation with LLVM/clang OBS-URL: https://build.opensuse.org/package/show/science/openblas?expand=0&rev=102
2020-10-21 05:48:02 +00:00
@@ -179,6 +179,7 @@ else ifeq ($(F_COMPILER), FLANG)
Accepting request 814610 from home:namtrac:branches:science - Update to version 0.3.10 common: * Improved thread locking behaviour in blas_server and parallel getrf * Imported bugfix 394 from LAPACK (spurious reference to "XERBL" due to overlong lines) * Imported bugfix 403 from LAPACK (compile option "recursive" required for correctness with Intel and PGI) * Imported bugfix 408 from LAPACK (wrong scaling in ZHEEQUB) * Imported bugfix 411 from LAPACK (infinite loop in LARGV/LARTG/LARTGP) * Fixed mismatches between BUFFERSIZE and GEMM_UNROLL parameters that could lead to crashes at large matrix sizes * Restored internal soname in dynamic libraries on FreeBSD and Dragonfly * Added API (openblas_setaffinity) to set thread affinity programmatically on Linux * Added initial infrastructure for half-precision floating point (bfloat16) support with a generic implementation of SHGEMM * Added CMAKE build system support for building the cblas_Xgemm3m functions * Fixed CMAKE support for building in a path with embedded spaces * Fixed CMAKE (non)handling of NO_EXPRECISION and MAX_STACK_ALLOC * Fixed GCC version detection in the Makefiles * Allowed overriding the names of AR, AS and LD in Makefile builds POWER: * fixed big-endian POWER8 ELFv2 builds on FreeBSD * Fixed GCC version checks and DYNAMIC_ARCH builds on POWER9 * Fixed CMAKE build support for POWER9 * fixed a potential race condition in the thread buffer allocation * Worked around LAPACK test failures on PPC G4 MIPS: * fixed a potential race condition in the thread buffer allocation OBS-URL: https://build.opensuse.org/request/show/814610 OBS-URL: https://build.opensuse.org/package/show/science/openblas?expand=0&rev=92
2020-06-15 05:33:39 +00:00
else
ifneq ($(C_COMPILER), LSB)
$(CC) $(CFLAGS) $(LDFLAGS) -shared -o ../$(LIBSONAME) \
+ -Wl,-z,noexecstack \
-Wl,--whole-archive $< -Wl,--no-whole-archive \
-Wl,-soname,$(INTERNALNAME) $(EXTRALIB)
$(CC) $(CFLAGS) $(LDFLAGS) -w -o linktest linktest.c ../$(LIBSONAME) $(FEXTRALIB) && echo OK.
- Update to version 0.3.11 common: * Reduced the default BLAS3_MEM_ALLOC_THRESHOLD (used as an upper limit for placing temporary arrays on the stack) to be compatible with a stack size of 1mb (as imposed by the JAVA runtime library) * Added mixed-precision dot function SBDOT and utility functions shstobf16, shdtobf16, sbf16tos and dbf16tod to convert between single or double precision float arrays and bfloat16 arrays * Fixed prototypes of LAPACK_?ggsvp and LAPACK_?ggsvd functions in lapack.h * Fixed underflow and rounding errors in LAPACK SLANV2 and DLANV2 (causing miscalculations in e.g. SHSEQR/DHSEQR, LAPACK issue #263) * Fixed workspace calculation in LAPACK ?GELQ (LAPACK issue #415) * Fixed several bugs in the LAPACK testsuite * Improved performance of TRMM and TRSM for certain problem sizes * Fixed infinite recursions and workspace miscalculations in ReLAPACK * CMAKE builds no longer require pkg-config for creating the .pc file * Makefile builds no longer misread NO_CBLAS=0 or NO_LAPACK=0 as enabling these options * Fixed detection of gfortran when invoked through an mpi wrapper * Improve thread reinitialization performance with OpenMP after a fork * Added support for building only the subset of the library required for a particular precision by specifying BUILD_SINGLE, BUILD_DOUBLE * Optional function name prefixes and suffixes are now correctly reflected in the generated cblas.h * Added CMAKE build support for the LAPACK and multithreading tests power: * Added optimized support for POWER10 * Added support for compiling for POWER8 in 32bit mode * Added support for compilation with LLVM/clang OBS-URL: https://build.opensuse.org/package/show/science/openblas?expand=0&rev=102
2020-10-21 05:48:02 +00:00
@@ -207,6 +208,7 @@ else
../$(LIBSONAME) : ../$(LIBNAME).renamed linktest.c
endif
$(CC) $(CFLAGS) $(LDFLAGS) -shared -o ../$(LIBSONAME) \
+ -Wl,-z,noexecstack \
-Wl,--whole-archive $< -Wl,--no-whole-archive \
$(FEXTRALIB) $(EXTRALIB)
$(CC) $(CFLAGS) $(LDFLAGS) -w -o linktest linktest.c ../$(LIBSONAME) $(FEXTRALIB) && echo OK.
- Update to version 0.3.11 common: * Reduced the default BLAS3_MEM_ALLOC_THRESHOLD (used as an upper limit for placing temporary arrays on the stack) to be compatible with a stack size of 1mb (as imposed by the JAVA runtime library) * Added mixed-precision dot function SBDOT and utility functions shstobf16, shdtobf16, sbf16tos and dbf16tod to convert between single or double precision float arrays and bfloat16 arrays * Fixed prototypes of LAPACK_?ggsvp and LAPACK_?ggsvd functions in lapack.h * Fixed underflow and rounding errors in LAPACK SLANV2 and DLANV2 (causing miscalculations in e.g. SHSEQR/DHSEQR, LAPACK issue #263) * Fixed workspace calculation in LAPACK ?GELQ (LAPACK issue #415) * Fixed several bugs in the LAPACK testsuite * Improved performance of TRMM and TRSM for certain problem sizes * Fixed infinite recursions and workspace miscalculations in ReLAPACK * CMAKE builds no longer require pkg-config for creating the .pc file * Makefile builds no longer misread NO_CBLAS=0 or NO_LAPACK=0 as enabling these options * Fixed detection of gfortran when invoked through an mpi wrapper * Improve thread reinitialization performance with OpenMP after a fork * Added support for building only the subset of the library required for a particular precision by specifying BUILD_SINGLE, BUILD_DOUBLE * Optional function name prefixes and suffixes are now correctly reflected in the generated cblas.h * Added CMAKE build support for the LAPACK and multithreading tests power: * Added optimized support for POWER10 * Added support for compiling for POWER8 in 32bit mode * Added support for compilation with LLVM/clang OBS-URL: https://build.opensuse.org/package/show/science/openblas?expand=0&rev=102
2020-10-21 05:48:02 +00:00
@@ -226,6 +228,7 @@ ifeq ($(OSNAME), SunOS)
so : ../$(LIBSONAME)
$(CC) $(CFLAGS) $(LDFLAGS) -shared -o ../$(LIBSONAME) \
+ -Wl,-z,noexecstack \
-Wl,--whole-archive ../$(LIBNAME) -Wl,--no-whole-archive $(EXTRALIB)
$(CC) $(CFLAGS) $(LDFLAGS) -w -o linktest linktest.c ../$(LIBSONAME) $(FEXTRALIB) && echo OK.
rm -f linktest