This repository contains links to pre-trained models, sample scripts, best practices, and step-by-step tutorials for many popular open-source machine learning models optimized by Intel to run on Intel® Xeon® Scalable processors and Intel® Data Center GPUs.
Model packages and containers for running the Model Zoo's workloads can be found at the Intel® Developer Catalog.
- Demonstrate the AI workloads and deep learning models Intel has optimized and validated to run on Intel hardware
- Show how to efficiently execute, train, and deploy Intel-optimized models
- Make it easy to get started running Intel-optimized models on Intel hardware in the cloud or on bare metal
DISCLAIMER: These scripts are not intended for benchmarking Intel platforms. For any performance and/or benchmarking information on specific Intel platforms, visit https://www.intel.ai/blog.
Intel is committed to the respect of human rights and avoiding complicity in human rights abuses, a policy reflected in the Intel Global Human Rights Principles. Accordingly, by accessing the Intel material on this platform you agree that you will not use the material in a product or application that causes or contributes to a violation of an internationally recognized human right.
The Model Zoo for Intel® Architecture is licensed under Apache License Version 2.0.
To the extent that any public datasets are referenced by Intel or accessed using tools or code on this site those datasets are provided by the third party indicated as the data source. Intel does not create the data, or datasets, and does not warrant their accuracy or quality. By accessing the public dataset(s) you agree to the terms associated with those datasets and that your use complies with the applicable license.
Please check the list of datasets used in Model Zoo for Intel® Architecture in datasets directory.
Intel expressly disclaims the accuracy, adequacy, or completeness of any public datasets, and is not liable for any errors, omissions, or defects in the data, or for any reliance on the data. Intel is not liable for any liability or damages relating to your use of public datasets.
The model documentation in the tables below have information on the prerequisites to run each model. The model scripts run on Linux. Certain models are also able to run using bare metal on Windows. For more information and a list of models that are supported on Windows, see the documentation here.
Instructions available to run on Sapphire Rapids.
For best performance on Intel® Data Center GPU Flex and Max Series, please check the list of supported workloads. It provides instructions to run inference and training using Intel(R) Extension for PyTorch or Intel(R) Extension for TensorFlow.
Model | Framework | Mode | Model Documentation | Benchmark/Test Dataset |
---|---|---|---|---|
3D U-Net | TensorFlow | Inference | FP32 | BRATS 2018 |
3D U-Net MLPerf* | TensorFlow | Inference | FP32 BFloat16 Int8 | BRATS 2019 |
3D U-Net MLPerf* Sapphire Rapids | Tensorflow | Inference | FP32 BFloat16 Int8 BFloat32 | BRATS 2019 |
MaskRCNN | TensorFlow | Inference | FP32 | MS COCO 2014 |
UNet | TensorFlow | Inference | FP32 |
Model | Framework | Mode | Model Documentation | Benchmark/Test Dataset |
---|---|---|---|---|
BERT large | TensorFlow | Inference | FP32 BFloat16 FP16 | SQuAD |
BERT large | TensorFlow | Training | FP32 BFloat16 FP16 | SQuAD and MRPC |
BERT large Sapphire Rapids | Tensorflow | Inference | FP32 BFloat16 Int8 BFloat32 | SQuAD |
BERT large Sapphire Rapids | Tensorflow | Training | FP32 BFloat16 BFloat32 | SQuAD |
DistilBERT base | Tensorflow | Inference | FP32 BFloat16 Int8 FP16 | SST-2 |
BERT base | PyTorch | Inference | FP32 BFloat16 | BERT Base SQuAD1.1 |
BERT large | PyTorch | Inference | FP32 Int8 BFloat16 BFloat32 | BERT Large SQuAD1.1 |
BERT large | PyTorch | Training | FP32 BFloat16 BFloat32 | preprocessed text dataset |
DistilBERT base | PyTorch | Inference | FP32 Int8 BFloat16 BFloat32 | DistilBERT Base SQuAD1.1 |
RNN-T | PyTorch | Inference | FP32 BFloat16 BFloat32 | RNN-T dataset |
RNN-T | PyTorch | Training | FP32 BFloat16 BFloat32 | RNN-T dataset |
RoBERTa base | PyTorch | Inference | FP32 BFloat16 | RoBERTa Base SQuAD 2.0 |
T5 | PyTorch | Inference | FP32 Int8 |
Model | Framework | Mode | Model Documentation | Benchmark/Test Dataset |
---|---|---|---|---|
BERT | TensorFlow | Inference | FP32 | MRPC |
GNMT* | TensorFlow | Inference | FP32 | MLPerf GNMT model benchmarking dataset |
Transformer_LT_mlperf* | TensorFlow | Inference | FP32 BFloat16 Int8 | WMT English-German data |
Transformer_LT_mlperf* Sapphire Rapids | Tensorflow | Inference | FP32 BFloat16 Int8 BFloat32 | WMT English-German dataset |
Transformer_LT_mlperf* | TensorFlow | Training | FP32 BFloat16 | WMT English-German dataset |
Transformer_LT_mlperf* Sapphire Rapids | Tensorflow | Training | FP32 BFloat16 BFloat32 | WMT English-German dataset |
Transformer_LT_Official | TensorFlow | Inference | FP32 | WMT English-German dataset |
Transformer_LT_Official | TensorFlow Serving | Inference | FP32 |
Model | Framework | Mode | Model Documentation | Benchmark/Test Dataset |
---|---|---|---|---|
DIEN | TensorFlow | Inference | FP32 BFloat16 | DIEN dataset |
DIEN Sapphire Rapids | TensorFlow | Inference | FP32 BFloat16 BFloat32 | DIEN dataset |
DIEN | TensorFlow | Training | FP32 | DIEN dataset |
DIEN Sapphire Rapids | TensorFlow | Training | FP32 BFloat16 BFloat32 | DIEN dataset |
NCF | TensorFlow | Inference | FP32 | MovieLens 1M |
Wide & Deep | TensorFlow | Inference | FP32 | Census Income dataset |
Wide & Deep Large Dataset | TensorFlow | Inference | Int8 FP32 | Large Kaggle Display Advertising Challenge dataset |
Wide & Deep Large Dataset | TensorFlow | Training | FP32 | Large Kaggle Display Advertising Challenge dataset |
DLRM | PyTorch | Inference | FP32 Int8 BFloat16 BFloat32 | Criteo Terabyte |
DLRM | PyTorch | Training | FP32 BFloat16 BFloat32 | Criteo Terabyte |
Model | Framework | Mode | Model Documentation | Benchmark/Test Dataset |
---|---|---|---|---|
WaveNet | TensorFlow | Inference | FP32 |
Model | Framework | Mode | Model Documentation | Benchmark/Test Dataset |
---|---|---|---|---|
TransNetV2 | PyTorch | Inference | FP32 BFloat16 | Synthetic Data |
Model | Framework | Mode | Model Documentation | Benchmark/Test Dataset |
---|---|---|---|---|
AlphaFold2 | PyTorch | Inference | FP32 | AF2Dataset |
*Means the model belongs to MLPerf models and will be supported long-term.
Model | Framework | Mode | GPU Type | Model Documentation |
---|---|---|---|---|
ResNet 50v1.5 | TensorFlow | Inference | Flex Series | Int8 |
ResNet 50 v1.5 | PyTorch | Inference | Flex Series | Int8 |
SSD-MobileNet* | TensorFlow | Inference | Flex Series | Int8 |
SSD-MobileNet | PyTorch | Inference | Flex Series | Int8 |
Yolo V4 | PyTorch | Inference | Flex Series | Int8 |
ResNet 50v1.5 | TensorFlow | Inference | Max Series | Int8 FP32 FP16 |
ResNet 50 v1.5 | TensorFlow | Training | Max Series | BFloat16 |
ResNet 50 v1.5 | PyTorch | Inference | Max Series | Int8 |
ResNet 50 v1.5 | PyTorch | Training | Max Series | BFloat16 |
BERT large | PyTorch | Inference | Max Series | FP16 |
BERT large | PyTorch | Training | Max Series | BFloat16 |
BERT large | TensorFlow | Inference | Max Series | FP32 FP16 |
BERT large | TensorFlow | Training | Max Series | BFloat16 |
If you would like to add a new benchmarking script, please use this guide.