SDXL IRs and Scripts

SDXL end-to-end benchmarking

Checkout and compile IREE with release build and export PATH=/path/to/iree/build/release/tools:$PATH
Compile the full SDXL model: ./compile-txt2img.sh gfx942 (where gfx942 is the target for MI300X)
Run the benchmark: ./benchmark-txt2img.sh N /path/to/weights/irpa (where N is the GPU index)

Model IRs and weights

Caution

IRs in the following table might be stale. Use the ones in the base_ir/ directory instead.

Note

SDXL-turbo is only different from SDXL in its usage and training/weights. The model architecture (and therefore the weights-stripped MLIR) are equivalent.

Variant	Submodel	MLIR (No Weights) (Config A)	safetensors	Splat IRPA	MLIR (No Weights) (Config B)
SDXL1.0 1024x1024 (f16, BS1, len64)
	UNet + attn	Torch - Linalg	-	-	Azure
	UNet + PNDMScheduler	Azure
	Clip1	Azure	-	-
	Clip2	Azure	-	-
	VAE decode + attn	Azure	-	=	Azure
	VAE encode + attn	[GCloud][sdxl-1-1024x1024-f16-stripped-weight-vae-encode]	Same as decode	-	-
SDXL1.0 1024x1024 (f32, BS1, len64)
	UNet + attn	Azure	Azure	Azure	Azure
	Clip1	Azure	Azure	Azure	-
	Clip2	Azure	Azure	Azure	-
	VAE decode + attn	Azure	Azure	Azure	Azure
SDXL compiled pipeline IRPAs (f16)
	UNet	scheduled_unet_f16.irpa
	Prompt Encoder (CLIP1 + CLIP2)	prompt_encoder_f16.irpa
	VAE	vae_decode_f16.irpa

Name	Name	Last commit message	Last commit date
Latest commit erieaton-amd Oct 8, 2024 4fa7ccb · Oct 8, 2024 History 261 Commits
.github/workflows	.github/workflows	[tuner] Add CI for all tuning PR (#88 )	Aug 7, 2024
bitcode-2024-03-07	bitcode-2024-03-07	Add files	Apr 11, 2024
bitcode-6.1.2	bitcode-6.1.2	Update to use device libs in ROCm 6.1.2	Jul 12, 2024
docker	docker	Update ubuntu_rocm_sdxl.dockerfile	May 31, 2024
fp16-model	fp16-model	Add bs4 prompt encoder .mlir	Jul 24, 2024
int8-model	int8-model	fp8 attention sins	Jul 26, 2024
tuning	tuning	[tuner] Remove args.mode (#100 )	Aug 22, 2024
validating_accuracy	validating_accuracy	Update accuracy.md	Jun 7, 2024
.gitignore	.gitignore	Add dump of intermediates to sdxl int8-model (#79 )	Jul 23, 2024
.pre-commit-config.yaml	.pre-commit-config.yaml	[tuner] Add local pre-commit config v0 (#87 )	Aug 2, 2024
LICENSE	LICENSE	Add license.	May 31, 2024
README.md	README.md	Add steps for e2e benchmarking	May 28, 2024
correlator.py	correlator.py	Refactor correlation script (#105 )	Oct 8, 2024
gpu_ids.py	gpu_ids.py	Update power trace script and add IDs script (#74 )	Jul 19, 2024
power_trace.sh	power_trace.sh	Refactor correlation script (#105 )	Oct 8, 2024
requirements-dev.txt	requirements-dev.txt	[tuner] Add local pre-commit config v0 (#87 )	Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SDXL IRs and Scripts

SDXL end-to-end benchmarking

Model IRs and weights

About

Releases

Packages

Contributors 17

Languages

License

nod-ai/sdxl-scripts

Folders and files

Latest commit

History

Repository files navigation

SDXL IRs and Scripts

SDXL end-to-end benchmarking

Model IRs and weights

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 17

Languages

Packages