forked from pool/onednn
Guillaume GARDET
a14ab9290a
- Update to 2.2.2, changes: * Fixed performance regression in fp32 forward inner product for shapes with number of output channels equal to 1 for processors with Intel AVX-512 support (714b1fd) * Fixed performance regression in forward convolutions with groups for processors with Intel AVX-512 support(3555d4a) * Removed -std=c++11 build flag for DPC++ headers (1fcb867) * Fixed buffer access in initializing workspace in RNN implementation on GPU (9b03091) * Fixed fix a bug in convolution with 1x1 kernel and mixed strides on processors with Intel AVX-512 support (d0b3e3f) * Used getauxval for Linux to get CPU features on for AArch64 systems (25c4cea) * Added -fp-model=precise build flag for DPC++ code (3e40e5e) * Fixed out-of-bounds writes in elementwise primitive on Intel Processor Graphics (bcf823c) - Fix build with Arm Compute Library: * onednn-1045.patch OBS-URL: https://build.opensuse.org/request/show/895561 OBS-URL: https://build.opensuse.org/package/show/science:machinelearning/onednn?expand=0&rev=8
22 lines
796 B
Diff
22 lines
796 B
Diff
From a94acd4e2dfaf51552dd2a60b059df1c1f14e452 Mon Sep 17 00:00:00 2001
|
|
From: Alexandre Truong <alexandre.truong@arm.com>
|
|
Date: Wed, 28 Apr 2021 10:32:35 +0100
|
|
Subject: [PATCH] cpu: aarch64: missing include for arm_compute::Scheduler
|
|
|
|
---
|
|
src/cpu/aarch64/acl_indirect_gemm_convolution.hpp | 1 +
|
|
1 file changed, 1 insertion(+)
|
|
|
|
diff --git a/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp b/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp
|
|
index 86d2bed73..040311f8c 100644
|
|
--- a/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp
|
|
+++ b/src/cpu/aarch64/acl_indirect_gemm_convolution.hpp
|
|
@@ -26,6 +26,7 @@
|
|
|
|
#include "arm_compute/runtime/FunctionDescriptors.h"
|
|
#include "arm_compute/runtime/NEON/NEFunctions.h"
|
|
+#include "arm_compute/runtime/Scheduler.h"
|
|
|
|
namespace dnnl {
|
|
namespace impl {
|