Add support for TransformerEngine flash attention in WAN #299

cpersson-amd · 2025-12-16T18:54:02Z

This PR implements the following:

TransformerEngine flash attention for WAN training and inference.
A new fsdp sharding parallelism optimized for use on GPUs.
Some minor changes to allow for training on flax version 0.11.2.

The code has been tested on WAN 2.1 (training and inference) and flux (only training) using GPUs.

google-cla · 2025-12-16T18:54:07Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

cpersson-amd added 3 commits December 16, 2025 15:22

add flash attn te support for wan

84e25e3

add gpu optimized sharding parallelism

2a03765

sharding bugfixes

5e21db3

cpersson-amd marked this pull request as draft December 17, 2025 00:18

cpersson-amd marked this pull request as ready for review December 17, 2025 10:21

cpersson-amd closed this Dec 17, 2025

cpersson-amd reopened this Dec 17, 2025

generalize across sharding parallelisms

a7345e2

cpersson-amd force-pushed the main branch from 9ca3d79 to a7345e2 Compare December 17, 2025 10:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for TransformerEngine flash attention in WAN #299

Add support for TransformerEngine flash attention in WAN #299

Uh oh!

cpersson-amd commented Dec 16, 2025

Uh oh!

google-cla bot commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add support for TransformerEngine flash attention in WAN #299

Are you sure you want to change the base?

Add support for TransformerEngine flash attention in WAN #299

Uh oh!

Conversation

cpersson-amd commented Dec 16, 2025

Uh oh!

google-cla bot commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant