forked from pool/htscodecs
Accepting request 1136375 from home:StefanBruens:branches:science
- Update to 1.6.0: * Changes + Replaced the rANS codec SIMD gathers with simulated gathers via scalar memory fetches. This helps AMD Zen4, but importantly it also fixes a disastrous performance regression caused by Intel's DownFall microcode fix. + There is an impact on pre-DownFall speeds, but we should focus on patched CPUs as a priority. + A small speed up to the rans_F_to_s3 function used by order-0 rans decode. + Small speed up to SIMD rans32x16 order-1 encoder by reducing cache misses. Also sped up the rans4x8 order-1 encoder, particularly on AMD Zen4. + Now supports building with "zig cc" * Bug fixes + Improve robustness of name tokeniser when given non 7-bit ASCII and on machines where "char" defaults to unsigned. + Also fixed a 1 byte buffer read-overrun in name tokeniser. + Fix name tokeniser encoder failure with some duplicated streams. + Fixed rans_set_cpu to work multiple times, as well as reinstating the ability to change decode and encode side independently (accidentally lost in commit 958032c). No effect on usage, but it improves the test coverage. + Added a round-trip fuzz tester to test the ability to encode. The old fuzz testing was decode streams only. + Fixed bounds checking in rans_uncompress_O0_32x16_avx2, fixing buffer read overruns. + Removed undefined behaviour in transpose_and_copy(), fixing zig cc builds. OBS-URL: https://build.opensuse.org/request/show/1136375 OBS-URL: https://build.opensuse.org/package/show/science/htscodecs?expand=0&rev=7
This commit is contained in:
parent
edd8dd578a
commit
77314a0d53
@ -1,3 +0,0 @@
|
|||||||
version https://git-lfs.github.com/spec/v1
|
|
||||||
oid sha256:34def7341d01a374583d61c9012a80f6b40af078b11bfcaacba1a2412e34dab3
|
|
||||||
size 3498542
|
|
3
htscodecs-1.6.0.tar.gz
Normal file
3
htscodecs-1.6.0.tar.gz
Normal file
@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:4b4abb6dfa192f179386c1d49256892a134297ecb4dd25ad4c346e9dde1108c0
|
||||||
|
size 3715332
|
@ -1,3 +1,37 @@
|
|||||||
|
-------------------------------------------------------------------
|
||||||
|
Wed Jan 3 01:32:19 UTC 2024 - Stefan Brüns <stefan.bruens@rwth-aachen.de>
|
||||||
|
|
||||||
|
- Update to 1.6.0:
|
||||||
|
* Changes
|
||||||
|
+ Replaced the rANS codec SIMD gathers with simulated gathers
|
||||||
|
via scalar memory fetches. This helps AMD Zen4, but
|
||||||
|
importantly it also fixes a disastrous performance regression
|
||||||
|
caused by Intel's DownFall microcode fix.
|
||||||
|
+ There is an impact on pre-DownFall speeds, but we should
|
||||||
|
focus on patched CPUs as a priority.
|
||||||
|
+ A small speed up to the rans_F_to_s3 function used by
|
||||||
|
order-0 rans decode.
|
||||||
|
+ Small speed up to SIMD rans32x16 order-1 encoder by reducing
|
||||||
|
cache misses. Also sped up the rans4x8 order-1 encoder,
|
||||||
|
particularly on AMD Zen4.
|
||||||
|
+ Now supports building with "zig cc"
|
||||||
|
* Bug fixes
|
||||||
|
+ Improve robustness of name tokeniser when given non 7-bit
|
||||||
|
ASCII and on machines where "char" defaults to unsigned.
|
||||||
|
+ Also fixed a 1 byte buffer read-overrun in name tokeniser.
|
||||||
|
+ Fix name tokeniser encoder failure with some duplicated
|
||||||
|
streams.
|
||||||
|
+ Fixed rans_set_cpu to work multiple times, as well as
|
||||||
|
reinstating the ability to change decode and encode side
|
||||||
|
independently (accidentally lost in commit 958032c). No
|
||||||
|
effect on usage, but it improves the test coverage.
|
||||||
|
+ Added a round-trip fuzz tester to test the ability to
|
||||||
|
encode. The old fuzz testing was decode streams only.
|
||||||
|
+ Fixed bounds checking in rans_uncompress_O0_32x16_avx2,
|
||||||
|
fixing buffer read overruns.
|
||||||
|
+ Removed undefined behaviour in transpose_and_copy(),
|
||||||
|
fixing zig cc builds.
|
||||||
|
|
||||||
-------------------------------------------------------------------
|
-------------------------------------------------------------------
|
||||||
Thu May 4 14:47:16 UTC 2023 - Andrea Manzini <andrea.manzini@suse.com>
|
Thu May 4 14:47:16 UTC 2023 - Andrea Manzini <andrea.manzini@suse.com>
|
||||||
|
|
||||||
|
@ -1,7 +1,7 @@
|
|||||||
#
|
#
|
||||||
# spec file for package htscodecs
|
# spec file for package htscodecs
|
||||||
#
|
#
|
||||||
# Copyright (c) 2023 SUSE LLC
|
# Copyright (c) 2024 SUSE LLC
|
||||||
#
|
#
|
||||||
# All modifications and additions to the file contributed by third parties
|
# All modifications and additions to the file contributed by third parties
|
||||||
# remain the property of their copyright owners, unless otherwise agreed
|
# remain the property of their copyright owners, unless otherwise agreed
|
||||||
@ -18,7 +18,7 @@
|
|||||||
|
|
||||||
%define sonum 2
|
%define sonum 2
|
||||||
Name: htscodecs
|
Name: htscodecs
|
||||||
Version: 1.5.0
|
Version: 1.6.0
|
||||||
Release: 0
|
Release: 0
|
||||||
Summary: C library for custom compression for CRAM and other formats
|
Summary: C library for custom compression for CRAM and other formats
|
||||||
License: MIT
|
License: MIT
|
||||||
|
Loading…
Reference in New Issue
Block a user