Skip to content

1.2.0

Latest
Compare
Choose a tag to compare
@jan-wassenberg jan-wassenberg released this 31 May 17:04
· 171 commits to master since this release
  • Add InterleaveEven/InterleaveOdd, BitShuffle, GatherIndexNOr

  • Add IsNegative, IfNegativeThenElseZero, IfNegativeThenZeroElse

  • Add NEON_BF16, HWY_VERSION_GE/LT, HWY_EXPORT_T/HWY_DYNAMIC_DISPATCH_T

  • Add PromoteInRangeTo/ConvertInRangeTo/DemoteInRangeTo

  • Add Rol/Ror, RotateLeft/RotateLeftSame/RotateRightSame

  • Add SatWidenMulPairwiseAccumulate, SatWidenMulAccumFixedPoint

  • Add stats.h, bit_set.h, IsEitherNaN

  • Add UI8/UI32/UI64 MulHigh, I64 MulEven/MulOdd/Mul128

  • Add WidenMulAccumulate, MulEvenAdd, MulOddAdd

  • contrib/bit_pack: support 32/64-bit lanes

  • contrib/math: Add Exp2, Hypot

  • contrib/matvec: Add MatVecAdd

  • contrib/sort: Add VQ/HeapSelect, partial sort

  • contrib/topology: add affinity, detect topology/cache size/CPU name

  • Enable runtime dispatch for NEON/RVV, bazel modules, abort handler

  • Remove DASSERT for negative Gather indices

  • Support opting out of GUnit dependency

  • Use SPR/ZEN4 bf16 dot product

  • Known GCC 13 RVV issue: parts of sort_test and bit_pack_test disabled

  • Known Clang RVV/QEMU issue: incorrect rounding mode in upper/lower halves