Revision ac908f6 by Thomas Dinges (soc-2014-cycles)
May 13, 2014, 08:37 (GMT)
Cycles: Add an AVX2 CPU kernel.

New optimized kernel, which requires AVX2 and FMA3 instruction sets.
At the moment the speedup is small (~2%) as we only use gcc and clang auto optimization, but we can use dedicated intrinsics for that later.

D482 would be a good basis for further improvements.

Full Hash: ac908f6c1f6d77790d2645104d4ba9a139937317
Parent Commit: 6d62837
Lines Changed: +176, -0

