Software Optimizations Become Simple with Top-Down Analysis on Intel Skylake - Ahmad Yasin @ IDF'15
How TMA* Addresses Performance Analysis Challenges through the IceLake Processor / Ahmad Yasin
Understanding CPU Microarchitecture to Increase Performance
Unlocking Modern CPU Power - Next-Gen C++ Optimization Techniques - Fedor G Pikus - C++Now 2024
"Simple Code" Follow-up Part 1: A (Very) Simplified CPU Diagram
Bringing Choice, Automation and Performance to ML Deployment with Apache TVM and the OctoML Platform
MIC 2018 - Tensorflow optimizations and performance tuning for Intel platforms
Simple Code, High Performance
Profiling your application with Intel Vtune Amplifier ǀ Paulius Velesko, Intel
Live #15: Braindump on how optimizations interact with CPU microarchitecture
Up and Away: JDK Optimizations
Harnessing Intel Processor Trace on Windows for fuzzing and dynamic analysis
Optimization With the Help of CPU Counters
[ENG] Alexander Komarov: "Benchmarking and tuning NFV"
CPU Architectures: Why Best for Productivity is NOT the Best for Gaming! (Remastered)
Maximum Performance, Minimum Effort: Intel® Performance Libraries
Accelerate Transformer inference on CPU with Optimum and ONNX
SAN19-417 Performance Engineering for Arm Supercomputers
How to use Apache TVM to optimize your ML models
Non-Uniform Memory Architecture (NUMA): A Nearly Unfathomable Morass of Arcana - Fedor Pikus CppNow