ETK Performance Optimization Working Group Discussion: https://docs.einsteintoolkit.org/et-docs/Working_Group_on_Performance_Optimization
Has anyone else used SLEEF as a fast SIMD library http://sleef.org/ ? Are there any nice open-source alternatives out there?
Also, in case you didn't catch the news, the latest glibc (2.27) contains new, faster vectorized math functions (contributed by Intel) https://www.phoronix.com/scan.php?page=news_item&px=Intel-AVX2-FMA-Math-Glibc-2.27 ; https://lwn.net/Articles/746327/ 50% speed-ups on Skylake for some functions are reported.