Name	Name	Last commit message	Last commit date
parent directory ..
dgl_utilities	dgl_utilities	Add R-GAT calibration + remove old reference implementation (#1943 )	Dec 4, 2024
tools	tools	Update accuracy_igbh.py (#2123 )	Feb 21, 2025
README.md	README.md	Docs update, fix download links for llama models (#2055 )	Feb 14, 2025
backend.py	backend.py	Update GNN reference implementation: add DGL backend (#1903 )	Nov 6, 2024
backend_dgl.py	backend_dgl.py	Update GNN reference implementation: add DGL backend (#1903 )	Nov 6, 2024
benchmark-checklist.md	benchmark-checklist.md	Fix mlperf.conf link for equal issue mode (#2069 )	Jan 31, 2025
dataset.py	dataset.py	Update GNN reference implementation: add DGL backend (#1903 )	Nov 6, 2024
dockerfile.cpu	dockerfile.cpu	Add R-GAT calibration + remove old reference implementation (#1943 )	Dec 4, 2024
dockerfile.gpu	dockerfile.gpu	Update docker GPU, avoid long build time (#1966 )	Dec 18, 2024
igbh.py	igbh.py	RGAT remove in-memory flag (#2033 )	Jan 14, 2025
main.py	main.py	RGAT remove in-memory flag (#2033 )	Jan 14, 2025
requirements.txt	requirements.txt	Add R-GAT calibration + remove old reference implementation (#1943 )	Dec 4, 2024
rgnn.py	rgnn.py	Update GNN reference implementation: add DGL backend (#1903 )	Nov 6, 2024
user.conf	user.conf	Update GNN reference implementation: add DGL backend (#1903 )	Nov 6, 2024

MLPerf™ Inference Benchmark for Graph Neural Network

This is the reference implementation for MLPerf Inference Graph Neural Network. The reference implementation currently uses Deep Graph Library (DGL), and pytorch as the backbone of the model.

Hardware requirements: The minimun requirements to run this benchmark are ~600GB of RAM and ~2.3TB of disk. This requires to create a memory map for the graph features and not load them to memory all at once.

Supported Models

model	accuracy	dataset	model source	precision	notes
RGAT	0.7286	IGBH	Illiois Graph Benchmark	fp32	-

Dataset

Data	Description	Task
IGBH	Illinois Graph Benchmark Heterogeneous is a graph dataset consisting of one heterogeneous graph with 547,306,935 nodes and 5,812,005,639 edges. Node types: Author, Conference, FoS, Institute, Journal, Paper. A subset of 1% of the paper nodes are randomly choosen as the validation dataset using the split seeds script. The validation dataset will be used as the input queries for the SUT, however the whole dataset is needed to run the benchmarks, since all the graph connections are needed to achieve the quality target.	Node Classification
IGBH (calibration)	We sampled 5000 nodes from the training paper nodes of the IGBH for the calibration dataset. We provide the Node ids and the script to generate them (using the `--calibration` flag).	Node Classification

Automated command to run the benchmark via MLCFlow

Please see the new docs site for an automated way to run this benchmark across different available implementations and do an end-to-end submission with or without docker.

You can also do pip install mlc-scripts and then use mlcr commands for downloading the model and datasets using the commands given in the later sections.

Setup

Set the following helper variables

export ROOT_INFERENCE=$PWD/inference
export GRAPH_FOLDER=$PWD/inference/graph/R-GAT/
export LOADGEN_FOLDER=$PWD/inference/loadgen
export MODEL_PATH=$PWD/inference/graph/R-GAT/model/

Clone the repository

git clone --recurse-submodules https://github.com/mlcommons/inference.git --depth 1

Install pytorch

For NVIDIA GPU based runs:

pip install torch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu121

For CPU based runs:

pip install torch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cpu

Install requirements (only for running without using docker)

Install requirements:

cd $GRAPH_FOLDER
pip install -r requirements.txt

Install loadgen:

cd $LOADGEN_FOLDER
CFLAGS="-std=c++14" python setup.py install

Install pytorch geometric

export TORCH_VERSION=$(python -c "import torch; print(torch.__version__)")
pip install torch-geometric torch-scatter torch-sparse -f https://data.pyg.org/whl/torch-${TORCH_VERSION}.html

Install DGL

For NVIDIA GPU based runs:

pip install  dgl -f https://data.dgl.ai/wheels/torch-2.1/cu121/repo.html

For CPU based runs:

pip install  dgl -f https://data.dgl.ai/wheels/torch-2.1/repo.html

Download model through MLCFlow Automation

mlcr get,ml-model,rgat --outdirname=<path_to_download>

Download model using Rclone

To run Rclone on Windows, you can download the executable here. To install Rclone on Linux/macOS/BSD systems, run:

sudo -v ; curl https://rclone.org/install.sh | sudo bash

Once Rclone is installed, run the following command to authenticate with the bucket:

rclone config create mlc-inference s3 provider=Cloudflare access_key_id=f65ba5eef400db161ea49967de89f47b secret_access_key=fbea333914c292b854f14d3fe232bad6c5407bf0ab1bebf78833c2b359bdfd2b endpoint=https://c2686074cb2caf5cbaf6d134bdba8b47.r2.cloudflarestorage.com

You can then navigate in the terminal to your desired download directory and run the following commands to download the checkpoints:

fp32

rclone copy mlc-inference:mlcommons-inference-wg-public/R-GAT/RGAT.pt $MODEL_PATH -P

Download and setup dataset

Debug Dataset

MLC Command

mlcr get,dataset,igbh,_debug --outdirname=<path to download>

Download Dataset

cd $GRAPH_FOLDER
python3 tools/download_igbh_test.py

Split Seeds

cd $GRAPH_FOLDER
python3 tools/split_seeds.py --path igbh --dataset_size tiny

Full Dataset

Warning: This script will download 2.2TB of data

MLC Command

mlcr get,dataset,igbh,_full --outdirname=<path to download>

cd $GRAPH_FOLDER
./tools/download_igbh_full.sh igbh/

Split Seeds

cd $GRAPH_FOLDER
python3 tools/split_seeds.py --path igbh --dataset_size full

Calibration dataset

The calibration dataset contains 5000 nodes from the training paper nodes of the IGBH dataset. We provide the Node ids and the script to generate them (using the --calibration flag).

MLC Command

mlcr get,dataset,igbh,_full,_calibration --outdirname=<path to download>

Run the benchmark

Debug Run

# Go to the benchmark folder
cd $GRAPH_FOLDER

# Run the benchmark DGL
python3 main.py --dataset igbh-dgl-tiny --dataset-path igbh/ --profile debug-dgl [--model-path <path_to_ckpt>] [--device <cpu or gpu>] [--dtype <fp16 or fp32>] [--scenario <SingleStream, MultiStream, Server or Offline>]

Local run

# Go to the benchmark folder
cd $GRAPH_FOLDER

# Run the benchmark DGL
python3 main.py --dataset igbh-dgl --dataset-path igbh/ --profile rgat-dgl-full [--model-path <path_to_ckpt>] [--device <cpu or gpu>] [--dtype <fp16 or fp32>] [--scenario <SingleStream, MultiStream, Server or Offline>]

Evaluate the accuracy

mlcr process,mlperf,accuracy,_igbh --result_dir=<Path to directory where files are generated after the benchmark run>

Please click here to view the Python script for evaluating accuracy for the IGBH dataset.

Run using docker

Not implemented yet

Accuracy run

Add the --accuracy to the command to run the benchmark

python3 main.py --dataset igbh --dataset-path igbh/ --accuracy --model-path model/ [--model-path <path_to_ckpt>] [--device <cpu or gpu>] [--dtype <fp16 or fp32>] [--scenario <SingleStream, MultiStream, Server or Offline>] [--layout <COO, CSC or CSR>]

NOTE: For official submissions you should submit the results of the accuracy run in a file called accuracy.txt with the following format:

accuracy=<accuracy>%, good=<number_of_good_samples>, total=<number_of_total_samples>
hash=<hash>

Docker run

CPU: Build docker image

docker build . -f dockerfile.cpu -t rgat-cpu

Run docker container:

docker run --rm -it -v $(pwd):/root rgat-cpu

Run benchmark inside the docker container:

python3 main.py --dataset igbh-dgl --dataset-path igbh/ --profile rgat-dgl-full --device cpu [--model-path <path_to_ckpt>] [--dtype <fp16 or fp32>] [--scenario <SingleStream, MultiStream, Server or Offline>]

GPU: Build docker image

docker build . -f dockerfile.gpu -t rgat-gpu

Run docker container:

docker run --rm -it -v $(pwd):/workspace/root --gpus all rgat-gpu

Go inside the root folder and run benchmark inside the docker container:

cd root
python3 main.py --dataset igbh-dgl --dataset-path igbh/ --profile rgat-dgl-full --device gpu [--model-path <path_to_ckpt>] [--dtype <fp16 or fp32>] [--scenario <SingleStream, MultiStream, Server or Offline>]

NOTE: For official submissions, this benchmark is required to run in equal issue mode. Please make sure that the flag rgat.*.sample_concatenate_permutation is set to one in the mlperf.conf file when loadgen is built.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

R-GAT

R-GAT

README.md

MLPerf™ Inference Benchmark for Graph Neural Network

Supported Models

Dataset

Automated command to run the benchmark via MLCFlow

Setup

Clone the repository

Install pytorch

Install requirements (only for running without using docker)

Install pytorch geometric

Install DGL

Download model through MLCFlow Automation

Download model using Rclone

Download and setup dataset

Debug Dataset

Full Dataset

Calibration dataset

Run the benchmark

Debug Run

Local run

Evaluate the accuracy

Run using docker

Accuracy run

Docker run

Files

R-GAT

Directory actions

More options

Directory actions

More options

Latest commit

History

R-GAT

Folders and files

parent directory

README.md

MLPerf™ Inference Benchmark for Graph Neural Network

Supported Models

Dataset

Automated command to run the benchmark via MLCFlow

Setup

Clone the repository

Install pytorch

Install requirements (only for running without using docker)

Install pytorch geometric

Install DGL

Download model through MLCFlow Automation

Download model using Rclone

Download and setup dataset

Debug Dataset

Full Dataset

Calibration dataset

Run the benchmark

Debug Run

Local run

Evaluate the accuracy

Run using docker

Accuracy run

Docker run