SHA256
1
0
forked from pool/onednn

Accepting request 895561 from home:Guillaume_G:branches:science:machinelearning

- Update to 2.2.2, changes:
  * Fixed performance regression in fp32 forward inner product for
  shapes with number of output channels equal to 1 for processors
  with Intel AVX-512 support (714b1fd)
  * Fixed performance regression in forward convolutions with groups
  for processors with Intel AVX-512 support(3555d4a)
  * Removed -std=c++11 build flag for DPC++ headers (1fcb867)
  * Fixed buffer access in initializing workspace in RNN
  implementation on GPU (9b03091)
  * Fixed fix a bug in convolution with 1x1 kernel and mixed
  strides on processors with Intel AVX-512 support (d0b3e3f)
  * Used getauxval for Linux to get CPU features on for AArch64
  systems (25c4cea)
  * Added -fp-model=precise build flag for DPC++ code (3e40e5e)
  * Fixed out-of-bounds writes in elementwise primitive on
  Intel Processor Graphics (bcf823c)
- Fix build with Arm Compute Library:
  * onednn-1045.patch

OBS-URL: https://build.opensuse.org/request/show/895561
OBS-URL: https://build.opensuse.org/package/show/science:machinelearning/onednn?expand=0&rev=8
This commit is contained in:
Guillaume GARDET 2021-05-27 06:10:38 +00:00 committed by Git OBS Bridge
parent 6531f11827
commit a14ab9290a
5 changed files with 49 additions and 4 deletions

21
onednn-1045.patch Normal file
View File

@ -0,0 +1,21 @@
From a94acd4e2dfaf51552dd2a60b059df1c1f14e452 Mon Sep 17 00:00:00 2001
From: Alexandre Truong <alexandre.truong@arm.com>
Date: Wed, 28 Apr 2021 10:32:35 +0100
Subject: [PATCH] cpu: aarch64: missing include for arm_compute::Scheduler
---
src/cpu/aarch64/acl_indirect_gemm_convolution.hpp | 1 +
1 file changed, 1 insertion(+)
diff --git a/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp b/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp
index 86d2bed73..040311f8c 100644
--- a/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp
+++ b/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp
@@ -26,6 +26,7 @@
#include "arm_compute/runtime/FunctionDescriptors.h"
#include "arm_compute/runtime/NEON/NEFunctions.h"
+#include "arm_compute/runtime/Scheduler.h"
namespace dnnl {
namespace impl {

View File

@ -1,3 +0,0 @@
version https://git-lfs.github.com/spec/v1
oid sha256:3faf3b7811dd37835169a9d5c57f17fd591f062029851186f9b11db117b9d1d9
size 9533694

3
onednn-2.2.2.tar.gz Normal file
View File

@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:01e651eb20ca1943bbdf756804fd02d13e5ff3c84b89e3aa5d40abdeb5bb07ee
size 9533876

View File

@ -1,3 +1,25 @@
-------------------------------------------------------------------
Wed May 26 13:29:27 UTC 2021 - Guillaume GARDET <guillaume.gardet@opensuse.org>
- Update to 2.2.2, changes:
* Fixed performance regression in fp32 forward inner product for
shapes with number of output channels equal to 1 for processors
with Intel AVX-512 support (714b1fd)
* Fixed performance regression in forward convolutions with groups
for processors with Intel AVX-512 support(3555d4a)
* Removed -std=c++11 build flag for DPC++ headers (1fcb867)
* Fixed buffer access in initializing workspace in RNN
implementation on GPU (9b03091)
* Fixed fix a bug in convolution with 1x1 kernel and mixed
strides on processors with Intel AVX-512 support (d0b3e3f)
* Used getauxval for Linux to get CPU features on for AArch64
systems (25c4cea)
* Added -fp-model=precise build flag for DPC++ code (3e40e5e)
* Fixed out-of-bounds writes in elementwise primitive on
Intel Processor Graphics (bcf823c)
- Fix build with Arm Compute Library:
* onednn-1045.patch
-------------------------------------------------------------------
Tue Apr 13 07:53:16 UTC 2021 - Guillaume GARDET <guillaume.gardet@opensuse.org>

View File

@ -31,12 +31,14 @@
%define libname libdnnl2
Name: onednn
Version: 2.2.1
Version: 2.2.2
Release: 0
Summary: Intel(R) Math Kernel Library for Deep Neural Networks
License: Apache-2.0
URL: https://01.org/onednn
Source0: https://github.com/oneapi-src/oneDNN/archive/v%{version}/%{name}-%{version}.tar.gz
# PATCH-FIX-UPSTREAM - https://github.com/oneapi-src/oneDNN/pull/1045
Patch1: onednn-1045.patch
BuildRequires: cmake
BuildRequires: doxygen
BuildRequires: fdupes