Train lcm distil instruct pix2pix sdxl #12835

mzeynali · 2025-12-12T21:41:53Z

What does this PR do?

This PR adds a training script for Latent Consistency Model (LCM) distillation applied to InstructPix2Pix with Stable Diffusion XL. This enables fast, few-step image editing (1-4 steps) while maintaining high-quality outputs from instruction-based editing.

Key Features

LCM Distillation Pipeline: Implements teacher-student distillation where a pre-trained InstructPix2Pix SDXL model (teacher) guides training of a lightweight student model capable of single-step inference
8-Channel U-Net Support: Properly handles InstructPix2Pix's concatenated input (noisy latent + original image latent)
Time Conditioning: Adds guidance scale embedding to student U-Net for flexible inference
EMA Target Network: Uses exponential moving average for stable training targets
DDIM Solver Integration: Implements multi-step teacher predictions with classifier-free guidance
Flexible Loss Functions: Supports both L2 and Huber loss for robust training
Production-Ready: Includes validation, checkpointing, mixed precision, gradient checkpointing, and xFormers support

Training Algorithm

Sample timestep from DDIM schedule
Add noise to latents and sample guidance scale $w \in [w_{min}, w_{max}]$
Student makes single-step prediction from noisy latents
Teacher performs multi-step DDIM prediction with CFG
Target network (EMA of student) generates stable training target
Compute loss between student and target predictions
Update student parameters and EMA update target network

Use Case

This script allows researchers and practitioners to create fast InstructPix2Pix SDXL models that can perform high-quality image editing in just 4 inference steps instead of 50+, making real-time image editing applications feasible.

Who can review?

@yiyixuxu

mzeynali added 2 commits December 13, 2025 00:17

added training script for instruct-pix2pix-sdxl

db32cc7

added training for lcm distil instruct-pix2pix-sdxl

feb721e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Train lcm distil instruct pix2pix sdxl #12835

Train lcm distil instruct pix2pix sdxl #12835

mzeynali commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Train lcm distil instruct pix2pix sdxl #12835

Are you sure you want to change the base?

Train lcm distil instruct pix2pix sdxl #12835

Conversation

mzeynali commented Dec 12, 2025

What does this PR do?

Key Features

Training Algorithm

Use Case

Who can review?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant