stable diffusion xl 1.0 tensorrt LLM Model

Introduction

The Stable Diffusion XL 1.0 TensorRT repository contains TensorRT versions (sdxl, sdxl-lcm, sdxl-lcmlora) of Stable Diffusion XL 1.0, developed in collaboration with NVIDIA. These optimized versions enhance speed and efficiency for text-to-image generation tasks.

Architecture

Developer: Stability AI
Model Type: Diffusion-based text-to-image generative model
Base Model: SDXL base 1.0 and SDXL refiner 1.0 converted for NVIDIA TensorRT optimized inference
License: CreativeML Open RAIL++-M License

Training

Performance improvements are demonstrated through timing and throughput comparisons at 1024x1024 resolution for 30 steps:

A10: 13% timing improvement, 20% throughput improvement
A100: 26% timing improvement, 33% throughput improvement
H100: 41% timing improvement, 70% throughput improvement

For the Latent Consistency Model (LCM) version:

A100: Total time 426.16 ms
H100: Total time 234.22 ms

Guide: Running Locally

Setup Environment: Use a TensorRT NGC container.

git clone https://github.com/rajeevsrao/TensorRT.git
cd TensorRT
git checkout release/9.2
docker run --rm -it --gpus all -v $PWD:/workspace nvcr.io/nvidia/pytorch:23.11-py3 /bin/bash

Download SDXL TensorRT Files:

git lfs install 
git clone https://huggingface.co/stabilityai/stable-diffusion-xl-1.0-tensorrt
cd stable-diffusion-xl-1.0-tensorrt
git lfs pull
cd ..

Install Libraries:

cd demo/Diffusion
python3 -m pip install --upgrade pip
pip3 install -r requirements.txt
python3 -m pip install --pre --upgrade --extra-index-url https://pypi.nvidia.com tensorrt

Inference with TensorRT:

SDXL:

python3 demo_txt2img_xl.py "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" --build-static-batch --use-cuda-graph --num-warmup-runs 1 --width 1024 --height 1024 --denoising-steps 30 --onnx-base-dir /workspace/stable-diffusion-xl-1.0-tensorrt/sdxl-1.0-base --onnx-refiner-dir /workspace/stable-diffusion-xl-1.0-tensorrt/sdxl-1.0-refiner

SDXL-LCM:

python3 demo_txt2img_xl.py "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" --version=xl-1.0 --onnx-dir /workspace/stable-diffusion-xl-1.0-tensorrt/lcm --engine-dir /workspace/stable-diffusion-xl-1.0-tensorrt/lcm/engine-sdxl-lcm-nocfg --scheduler LCM --denoising-steps 4 --guidance-scale 0.0 --seed 42

SDXL-LCMLORA:

python3 demo_txt2img_xl.py "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" --version=xl-1.0 --onnx-dir /workspace/stable-diffusion-xl-1.0-tensorrt/lcmlora --engine-dir /workspace/stable-diffusion-xl-1.0-tensorrt/lcm/engine-sdxl-lcmlora-nocfg --scheduler LCM --lora-path latent-consistency/lcm-lora-sdxl --lora-scale 1.0 --denoising-steps 4 --guidance-scale 0.0 --seed 42

License

The model is licensed under the CreativeML Open RAIL++-M License.

More Related APIs in Text To Image