librediffusion

A C++ / CUDA / TensorRT implementation of StreamDiffusion

Implemented in ossia score

Benchmarks

On a RTX 5090 at 1 step:

SDXL Turbo 1024x1024: stable 26 fps

SD Turbo 512x512: stable 96 fps

SDXS: above 600 fps

Models need to be converted to TensorRT through the Python script [train-lora.py] beforehand:

$ uv run train-lora.py --model stabilityai/sd-turbo --min-batch 1 --max-batch 1 --opt-batch 1 --min-resolution 512 --max-resolution 1024 --output ./engines-sd-turbo

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github		.github
3rdparty		3rdparty
src		src
tools		tools
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
train-lora.py		train-lora.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

librediffusion

Benchmarks

About

Uh oh!

Releases 1

Sponsor this project

Uh oh!

Languages

Uh oh!

License

jcelerier/librediffusion

Folders and files

Latest commit

History

Repository files navigation

librediffusion

Benchmarks

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Sponsor this project

Uh oh!

Languages