------------------------------------------------------------------- Tue Nov 24 10:03:21 UTC 2020 - Luigi Baldoni - Update to version 0.8.0 * Improve the performance by using a picture buffer pool; * ARM32 optimizations for 8bit bitdepth for ipred paeth, smooth, cfl * ARM32 optimizations for 10/12/16bit bitdepth for mc_avg/mask/w_avg, put/prep 8tap/bilin, wiener and CDEF filters * ARM64 optimizations for cfl_ac 444 for all bitdepths * x86 optimizations for MC 8-tap, mc_scaled in AVX2 * x86 optimizations for CDEF in SSE and {put/prep}_{8tap/bilin} in SSSE3 - Bump soversion to 5 - Drop dav1d-nasm-2.15.patch (merged upstream) ------------------------------------------------------------------- Tue Sep 1 11:11:55 UTC 2020 - Dominique Leuenberger - Add dav1d-nasm-2.15.patch: Fix compilation with nasm 2.15. ------------------------------------------------------------------- Mon Jun 22 08:13:31 UTC 2020 - aloisio@gmx.com - Update to version 0.7.1 * ARM32 NEON optimizations for itxfm, which can give up to 28% speedup, and MSAC * SSE2 optimizations for prep_bilin and prep_8tap * AVX2 optimizations for MC scaled * Fix a clamping issue in motion vector projection * Fix an issue on some specific Haswell CPU on ipred_z AVX2 functions * Improvements on the dav1dplay utility player to support resizing ------------------------------------------------------------------- Wed May 20 16:50:41 UTC 2020 - Luigi Baldoni - Update to verison 0.7.0 * Faster refmv implementation gaining up to 12% speed while -25% of RAM (Single Thread) * 10b/12b ARM64 optimizations are mostly complete: + ipred (paeth, smooth, dc, pal, filter, cfl) + itxfm (only 10b) * AVX2/SSSE3 for non-4:2:0 film grain and for mc.resize * AVX2 for cfl4:4:4 * AVX-512 CDEF filter * ARM64 8b improvements for cfl_ac and itxfm * ARM64 implementation for emu_edge in 8b/10b/12b * ARM32 implementation for emu_edge in 8b * Improvements on the dav1dplay utility player to support 10 bit, non-4:2:0 pixel formats and film grain on the GPU ------------------------------------------------------------------- Fri Mar 6 07:20:04 UTC 2020 - Luigi Baldoni - Update to version 0.6.0 * New ARM64 optimizations for the 10/12bit depth: + mc_avg, mc_w_avg, mc_mask + mc_put/mc_prep 8tap/bilin + mc_warp_8x8 + mc_w_mask + mc_blend + wiener + SGR + loopfilter + cdef * New AVX-512 optimizations for prep_bilin, prep_8tap, cdef_filter, mc_avg/w_avg/mask * New SSSE3 optimizations for film grain * New AVX2 optimizations for msac_adapt16 * Fix rare mismatches against the reference decoder, notably because of clipping * Improvements on ARM64 on msac, cdef and looprestoration optimizations * Improvements on AVX2 optimizations for cdef_filter * Improvements in the C version for itxfm, cdef_filter - Bump sover to 4 ------------------------------------------------------------------- Wed Dec 4 19:03:37 UTC 2019 - Luigi Baldoni - Update to version 0.5.2 * ARM32 optimizations for loopfilter, ipred_dc|h|v * Add section-5 raw OBU demuxer * Improve the speed by reducing the L2 cache collisions * Fix minor issues ------------------------------------------------------------------- Sat Oct 26 05:39:14 UTC 2019 - Luigi Baldoni - Update to version 0.5.1 * SSE2 optimizations for CDEF, wiener and warp_affine * NEON optimizations for SGR on ARM32 * Fix mismatch issue in x86 asm in inverse identity transforms * Fix build issue in ARM64 assembly if debug info was enabled * Add a workaround for Xcode 11 -fstack-check bug - Dropped arm64_ipred_symbols_aligned.patch (merged upstream) ------------------------------------------------------------------- Fri Oct 11 09:43:36 UTC 2019 - Luigi Baldoni - Update to version 0.5.0 Medium release fixing regressions and minor issues, and improving speed significantly: * Export ITU T.35 metadata * Speed improvements on blend_ on ARM * Speed improvements on decode_coef and MSAC * NEON optimizations for blend*, w_mask_, ipred functions for ARM64 * NEON optimizations for CDEF and warp on ARM32 * SSE2 optimizations for MSAC hi_tok decoding * SSSE3 optimizations for deblocking loopfilters and warp_affine * AVX-2 optimizations for film grain and ipred_z2 * SSE4 optimizations for warp_affine * VSX optimizations for wiener * Fix inverse transform overflows in x86 and NEON asm * Fix integer overflows with large frames * Improve film grain generation to match reference code * Improve compatibility with older binutils for ARM * More advanced Player example in tools - Bump soversion to 3 - Added arm64_ipred_symbols_aligned.patch to fix aarch64 build ------------------------------------------------------------------- Mon Aug 5 14:55:40 UTC 2019 - Luigi Baldoni - Update to version 0.4.0 * Fix playback with unknown OBUs * Add an option to limit the maximum frame size * SSE2 and ARM64 optimizations for MSAC * Improve speed on 32bits systems * Optimization in obmc blend * Reduce RAM usage significantly * The initial PPC SIMD code, cdef_filter * NEON optimizations for blend functions on ARM * NEON optimizations for w_mask functions on ARM * NEON optimizations for inverse transforms on ARM64 * Improve handling of malloc failures * Simple Player example in tools - Dropped dav1d.armv6.patch (merged upstream) - Bumped SOVERSION to 2 ------------------------------------------------------------------- Mon May 13 19:48:51 UTC 2019 - olaf@aepfle.de - Added dav1d.armv6.patch (disables armv7 asm for armv6 builds) ------------------------------------------------------------------- Sat May 11 16:06:40 UTC 2019 - Luigi Baldoni - Update to version 0.3.1 * Fix a buffer overflow in frame-threading mode on SSSE3 CPUs * Reduce binary size, notably on Windows * SSSE3 optimizations for ipred_filter * ARM optimizations for MSAC ------------------------------------------------------------------- Mon Apr 29 18:07:47 UTC 2019 - Luigi Baldoni - Update to version 0.3.0 * Fixes an annoying crash on SSSE3 that happened in the itx functions ------------------------------------------------------------------- Fri Apr 19 07:48:06 UTC 2019 - Luigi Baldoni - Update to version 0.2.2 * Large improvement on MSAC decoding with SSE, bringing 4-6% speed increase The impact is important on SSSE3, SSE4 and AVX-2 cpus * SSSE3 optimizations for all blocks size in itx * SSSE3 optimizations for ipred_paeth and ipref_cfl (420, 422 and 444) * Speed improvements on CDEF for SSE4 CPUs * NEON optimizations for SGR and loop filter * Minor crashes, improvements and build changes ------------------------------------------------------------------- Tue Apr 2 06:43:21 UTC 2019 - Dominique Leuenberger - Add baselibs.conf: ffmpeg, which is the main consumer of Dav1d, produces -32bit packages that would be uninstallable otherwise. ------------------------------------------------------------------- Tue Mar 12 22:23:22 UTC 2019 - Luigi Baldoni - Update to version 0.2.1 * SSSE3 optimization for cdef_dir * AVX-2 improvements of the existing CDEF optimizations * NEON improvements of the existing CDEF and wiener optimizations * Clarification about the numbering/versionning scheme ------------------------------------------------------------------- Mon Mar 4 17:31:53 UTC 2019 - Luigi Baldoni - Update to version 0.2.0 * ARM64 and ARM optimizations using NEON instructions * SSSE3 optimizations for both 32 and 64bits * More AVX-2 assembly, reaching almost completion * Fix installation of includes * Rewrite inverse transforms to avoid overflows * Snap packaging for Linux * Updated API (ABI and API break) * Fixes for un-decodable samples ------------------------------------------------------------------- Thu Dec 13 13:21:36 UTC 2018 - Jan Engelhardt - Redo description and mention SIMD acceleration. ------------------------------------------------------------------- Thu Dec 13 11:53:50 UTC 2018 - Luigi Baldoni - Moved license file to library package ------------------------------------------------------------------- Tue Dec 11 18:25:04 UTC 2018 - Luigi Baldoni - Initial stable package (v0.1.0)