forked from pool/onednn
Accepting request 895561 from home:Guillaume_G:branches:science:machinelearning
- Update to 2.2.2, changes: * Fixed performance regression in fp32 forward inner product for shapes with number of output channels equal to 1 for processors with Intel AVX-512 support (714b1fd) * Fixed performance regression in forward convolutions with groups for processors with Intel AVX-512 support(3555d4a) * Removed -std=c++11 build flag for DPC++ headers (1fcb867) * Fixed buffer access in initializing workspace in RNN implementation on GPU (9b03091) * Fixed fix a bug in convolution with 1x1 kernel and mixed strides on processors with Intel AVX-512 support (d0b3e3f) * Used getauxval for Linux to get CPU features on for AArch64 systems (25c4cea) * Added -fp-model=precise build flag for DPC++ code (3e40e5e) * Fixed out-of-bounds writes in elementwise primitive on Intel Processor Graphics (bcf823c) - Fix build with Arm Compute Library: * onednn-1045.patch OBS-URL: https://build.opensuse.org/request/show/895561 OBS-URL: https://build.opensuse.org/package/show/science:machinelearning/onednn?expand=0&rev=8
This commit is contained in:
parent
6531f11827
commit
a14ab9290a
21
onednn-1045.patch
Normal file
21
onednn-1045.patch
Normal file
@ -0,0 +1,21 @@
|
||||
From a94acd4e2dfaf51552dd2a60b059df1c1f14e452 Mon Sep 17 00:00:00 2001
|
||||
From: Alexandre Truong <alexandre.truong@arm.com>
|
||||
Date: Wed, 28 Apr 2021 10:32:35 +0100
|
||||
Subject: [PATCH] cpu: aarch64: missing include for arm_compute::Scheduler
|
||||
|
||||
---
|
||||
src/cpu/aarch64/acl_indirect_gemm_convolution.hpp | 1 +
|
||||
1 file changed, 1 insertion(+)
|
||||
|
||||
diff --git a/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp b/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp
|
||||
index 86d2bed73..040311f8c 100644
|
||||
--- a/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp
|
||||
+++ b/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp
|
||||
@@ -26,6 +26,7 @@
|
||||
|
||||
#include "arm_compute/runtime/FunctionDescriptors.h"
|
||||
#include "arm_compute/runtime/NEON/NEFunctions.h"
|
||||
+#include "arm_compute/runtime/Scheduler.h"
|
||||
|
||||
namespace dnnl {
|
||||
namespace impl {
|
@ -1,3 +0,0 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:3faf3b7811dd37835169a9d5c57f17fd591f062029851186f9b11db117b9d1d9
|
||||
size 9533694
|
3
onednn-2.2.2.tar.gz
Normal file
3
onednn-2.2.2.tar.gz
Normal file
@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:01e651eb20ca1943bbdf756804fd02d13e5ff3c84b89e3aa5d40abdeb5bb07ee
|
||||
size 9533876
|
@ -1,3 +1,25 @@
|
||||
-------------------------------------------------------------------
|
||||
Wed May 26 13:29:27 UTC 2021 - Guillaume GARDET <guillaume.gardet@opensuse.org>
|
||||
|
||||
- Update to 2.2.2, changes:
|
||||
* Fixed performance regression in fp32 forward inner product for
|
||||
shapes with number of output channels equal to 1 for processors
|
||||
with Intel AVX-512 support (714b1fd)
|
||||
* Fixed performance regression in forward convolutions with groups
|
||||
for processors with Intel AVX-512 support(3555d4a)
|
||||
* Removed -std=c++11 build flag for DPC++ headers (1fcb867)
|
||||
* Fixed buffer access in initializing workspace in RNN
|
||||
implementation on GPU (9b03091)
|
||||
* Fixed fix a bug in convolution with 1x1 kernel and mixed
|
||||
strides on processors with Intel AVX-512 support (d0b3e3f)
|
||||
* Used getauxval for Linux to get CPU features on for AArch64
|
||||
systems (25c4cea)
|
||||
* Added -fp-model=precise build flag for DPC++ code (3e40e5e)
|
||||
* Fixed out-of-bounds writes in elementwise primitive on
|
||||
Intel Processor Graphics (bcf823c)
|
||||
- Fix build with Arm Compute Library:
|
||||
* onednn-1045.patch
|
||||
|
||||
-------------------------------------------------------------------
|
||||
Tue Apr 13 07:53:16 UTC 2021 - Guillaume GARDET <guillaume.gardet@opensuse.org>
|
||||
|
||||
|
@ -31,12 +31,14 @@
|
||||
|
||||
%define libname libdnnl2
|
||||
Name: onednn
|
||||
Version: 2.2.1
|
||||
Version: 2.2.2
|
||||
Release: 0
|
||||
Summary: Intel(R) Math Kernel Library for Deep Neural Networks
|
||||
License: Apache-2.0
|
||||
URL: https://01.org/onednn
|
||||
Source0: https://github.com/oneapi-src/oneDNN/archive/v%{version}/%{name}-%{version}.tar.gz
|
||||
# PATCH-FIX-UPSTREAM - https://github.com/oneapi-src/oneDNN/pull/1045
|
||||
Patch1: onednn-1045.patch
|
||||
BuildRequires: cmake
|
||||
BuildRequires: doxygen
|
||||
BuildRequires: fdupes
|
||||
|
Loading…
Reference in New Issue
Block a user