highway/highway.changes

161 lines
6.6 KiB
Plaintext
Raw Permalink Normal View History

-------------------------------------------------------------------
Sun Feb 18 20:26:43 UTC 2024 - Jan Engelhardt <jengelh@inai.de>
- Update to release 1.1.0
* Add BitCastScalar, DispatchedTarget, Foreach
* Add Div/Mod and MaskedDiv/ModOr, SaturatedAbs, SaturatedNeg
* Add InterleaveWholeLower/Upper, Dup128VecFromValues
* Add IsInteger, IsIntegerLaneType, RemoveVolatile, RemoveCvRef
* Add MaskedAdd/Sub/Mul/Div/Gather/Min/Max/SatAdd/SatSubOr
* Add MaskFalse, IfNegativeThenNegOrUndefIfZero, PromoteEven/OddTo
* Add ReduceMin/Max, 8-bit reductions, f16 <-> f64 conversions
* Add Span, AlignedArray, matrix-vector mul
* Add SumsOf2/4, I8 SumsOf8, SumsOfAdjQuadAbsDiff,
SumsOfShuffledQuadAbsDiff
* Extend Dot to f32*bf16, FMA to integer
* Fix: RVV 8-bit overflow, UB in vqsort, big-endian bugs, PPC HTM
* New targets: HWY_Z14, HWY_Z15
-------------------------------------------------------------------
Fri Sep 22 01:30:49 UTC 2023 - Jan Engelhardt <jengelh@inai.de>
- Update to release 1.0.7
* Add LoadNOr, GatherIndexN, ScatterIndexN
* Add additional float<->int conversions
* Codegen improvements for 8-bit shift, PPC Compress/Expand
-------------------------------------------------------------------
Fri Aug 11 15:57:12 UTC 2023 - Jan Engelhardt <jengelh@inai.de>
- Update to release 1.0.6
* Add MaskedGatherIndex, MaskedScatterIndex, LoadN, StoreN,
SatWidenMulPairwiseAdd, SumOfMulQuadAccumulate,
PromoteUpperLowerTo.
* Add F64 for Wasm, F64 AbsDiff
* Validate all D args in x86 function signatures
-------------------------------------------------------------------
Wed Jul 19 16:33:36 UTC 2023 - Jan Engelhardt <jengelh@inai.de>
- Update to release 1.0.5
* Add Insert/ExtractBlock, BroadcastBlock/Lane, NumBlocks
* Add integer Le/Ge and [Neg]MulAdd, extend DemoteTo/PromoteTo
* Add Leading/TrailingZeroCount, HighestSetBitIndex, ReverseBits
* Add MaskedLoadOr, tuple Get/Set/Create, ReduceSum,
WidenMulPairwiseAdd
* Add [ZeroExtend]ResizeBitCast, BitwiseIfThenElse,
Find[Known]LastTrue
* Add AESRoundInv, AESKeyGenAssist
* Add contrib/math Atan2/SinCos, contrib/unroller
* Add fp16/bf16 support (Armv8, SVE, RVV), HWY_DYNAMIC_POINTER
* Add OrderedTruncate2To, Per4LaneBlockShuffle,
TwoTablesLookupLanes
* Add SlideUp/Down[Blocks/Lanes], Slide1Up/Down, ReverseLaneBytes
* Add SetBeforeFirst, SetAtOrBefore/AfterFirst, SetOnlyFirst
* Add 8-bit Reverse2/4/8, Shl/Shr, RotateRight, Reverse, Mul
* Add 8/16-bit DupEven/Odd, TableLookupLanes
* Add F64 ApproximateReciprocal[Sqrt], 32/64-bit SaturatedAdd/Sub
-------------------------------------------------------------------
Wed May 24 09:10:51 UTC 2023 - Jan Engelhardt <jengelh@inai.de>
- Update memory limiter from 900 to 1400/process.
-------------------------------------------------------------------
Fri May 12 19:37:01 UTC 2023 - Jan Engelhardt <jengelh@inai.de>
- Add memory-constraints to build
-------------------------------------------------------------------
Wed May 10 10:35:45 UTC 2023 - Jan Engelhardt <jengelh@inai.de>
- Add no-forced-inline.diff [boo#1211093]
-------------------------------------------------------------------
Fri Mar 17 17:09:46 UTC 2023 - Jan Engelhardt <jengelh@inai.de>
- Update to release 1.0.4
* Add PPC8..10, SSE2, AVX3_ZEN4, NEON_WITHOUT_AES targets
* Add Expand, LoadExpand, integer AbsDiff, SumsOf8AbsDiff
* Improved Half/Twice support, codegen for Shift*Same
* Faster KV128 sorting
* Update RISC-V V intrinsics for 1.0-draft
- Remove arm-disable-runtime-dispatch.patch (appears merged)
-------------------------------------------------------------------
Thu Jan 19 16:21:05 UTC 2023 - Jan Engelhardt <jengelh@inai.de>
- Update to release 1.0.3
* Add RearrangeToOddPlusEven, Xor3, 8-bit CompressStore,
HWY_ASSUME
* Add contrib/bit_pack for 8/16-bit lanes
* Update for new RVV intrinsics; faster WASM min/max and
extmul/q15mul
-------------------------------------------------------------------
Thu Dec 15 11:32:45 UTC 2022 - Simon Vogl <simon.vogl@gmx.net>
- Added missing baselibs.conf so that 32bit library packages
become available
-------------------------------------------------------------------
Wed Nov 23 23:15:10 UTC 2022 - Bruno Pitrus <brunopitrus@hotmail.com>
- Fix the library being built for incorrect microarchitecture
on armv{6,7}hl.
* add arm-disable-runtime-dispatch.patch to fix compiler error
- Actually run the testsuite.
-------------------------------------------------------------------
Tue Nov 8 14:21:25 UTC 2022 - Jan Engelhardt <jengelh@inai.de>
- Have armv7 build succeed again.
-------------------------------------------------------------------
Tue Nov 1 08:29:19 UTC 2022 - Jan Engelhardt <jengelh@inai.de>
- Update to release 1.0.2
* Add ExclusiveNeither, FindKnownFirstTrue, Ne128
* Add 16-bit SumOfLanes/ReorderWidenMulAccumulate/ReorderDemote2To
* Faster sort for low-entropy input, improved pivot selection
* Support static dispatch to SVE2_128 and SVE_256
- Leap just needs a modern gcc, no need for clang
-------------------------------------------------------------------
Wed Oct 26 16:47:07 UTC 2022 - Tom Mbrt <tom.mbrt@googlemail.com>
- Fix build on openSUSE Leap by using clang
-------------------------------------------------------------------
Thu Sep 22 09:20:14 UTC 2022 - Enrico Belleri <idesmi@protonmail.com>
- Update to 1.0.1:
* Add Eq128, i64 Mul, unsigned->float ConvertTo
* Faster sort for few unique keys, more robust pivot selection
* Fix: floating-point generator for sort tests, Min/MaxOfLanes for i16
* Fix: avoid always_inline in debug, link atomic
* GCC warnings: string.h, maybe-uninitialized, ignored-attributes
* GCC warnings: preprocessor int overflow, spurious use-after-free/overflow
* Doc: <=HWY_AVX3, Full32/64/128, how to use generic-inl
* ABI change: 64-bit target values, more room for expansion
* Add CompressBlocksNot, CompressNot, Lt128Upper, Min/Max128Upper, TruncateTo
* Add HWY_SVE2_128 target
* Sort speedups especially for 128-bit
* Documentation clarifications
* Faster NEON CountTrue/FindFirstTrue/AllFalse/AllTrue
* Improved SVE codegen
* Fix u16x8 ConcatEven/Odd, SSSE3 i64 Lt
* MSVC 2017 workarounds
* Support for runtime dispatch on Arm/GCC/Linux
-------------------------------------------------------------------
Wed Jul 13 10:09:10 UTC 2022 - Guillaume GARDET <guillaume.gardet@opensuse.org>
- Use GCC11 instead of default GCC12 to build on aarch64 Tumbleweed
until fixed upstream - https://github.com/google/highway/issues/776
-------------------------------------------------------------------
Sat Jul 9 12:19:06 UTC 2022 - Jan Engelhardt <jengelh@inai.de>
- Initial package (version 0.17.0) for build.opensuse.org