DL Boost

Intel's Deep Learning Boost (DL Boost) is a marketing name for instruction set architecture (ISA) features on the x86-64 designed to improve performance on deep learning tasks such as training and inference.[1]

Features

DL Boost consists of two sets of features:

AVX-512 VNNI, 4VNNIW, or AVX-VNNI: fast multiply-accumulation mainly for convolutional neural networks.
AVX-512 BF16: lower-precision bfloat16 floating-point numbers for generally faster computation. Operations provided include conversion to/from float32 and dot product.

DL Boost features were introduced in the Cascade Lake architecture.

A TensorFlow-based benchmark run on the Google Cloud Platform Compute Engine shows improved performance and reduced cost compared to previous CPUs and to GPUs, especially for small batch sizes.[2]

Notes

"Intel Deep Learning Boost" Product Overview , p. 3
Samantha Gurriero, "Machine Learning Optimisation: What is the Best Hardware on GCP?", Datatonic,

External links

Deep Learning Boost at Intel
Andres Rodrigues et al., "Lower Numerical Precision Deep Learning Inference and Training", Intel White paper
Intel and ML (2017), from Intel's Developer Relations Division

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[1] "Intel Deep Learning Boost" Product Overview , p. 3

[2] Samantha Gurriero, "Machine Learning Optimisation: What is the Best Hardware on GCP?", Datatonic,