forked from pool/onednn
Guillaume GARDET
a14ab9290a
- Update to 2.2.2, changes: * Fixed performance regression in fp32 forward inner product for shapes with number of output channels equal to 1 for processors with Intel AVX-512 support (714b1fd) * Fixed performance regression in forward convolutions with groups for processors with Intel AVX-512 support(3555d4a) * Removed -std=c++11 build flag for DPC++ headers (1fcb867) * Fixed buffer access in initializing workspace in RNN implementation on GPU (9b03091) * Fixed fix a bug in convolution with 1x1 kernel and mixed strides on processors with Intel AVX-512 support (d0b3e3f) * Used getauxval for Linux to get CPU features on for AArch64 systems (25c4cea) * Added -fp-model=precise build flag for DPC++ code (3e40e5e) * Fixed out-of-bounds writes in elementwise primitive on Intel Processor Graphics (bcf823c) - Fix build with Arm Compute Library: * onednn-1045.patch OBS-URL: https://build.opensuse.org/request/show/895561 OBS-URL: https://build.opensuse.org/package/show/science:machinelearning/onednn?expand=0&rev=8 |
||
---|---|---|
_constraints | ||
.gitattributes | ||
.gitignore | ||
onednn-2.2.2.tar.gz | ||
onednn-1045.patch | ||
onednn.changes | ||
onednn.spec |