Skip to content

Releases: linkedin/Liger-Kernel

v0.8.0

30 Apr 23:01
c4b16d4

Choose a tag to compare

Liger-Kernel v0.8.0

Highlights ✨

🚀 MoE Training Acceleration via LigerExperts

A new fused MoE expert kernel (LigerExperts, backed by LigerFusedMoEFunction) inspired by SonicMoE (arXiv:2512.14080). It replaces the eager per-expert loop in HuggingFace MoE blocks with a Triton grouped-GEMM + fused SwiGLU + token aggregation path, with a memory-efficient backward.

Qwen3-30B-A3B fine-tuning on 2 × H100 (max seq len 32768, BS=1, GA=8): 8.24× tokens/sec, 8.19× training step, 4.10× eval, ~1% memory savings.

Auto-patched into Mixtral, Qwen3-MoE, Qwen3-Next, Qwen3.5-MoE, Qwen3-VL-MoE, GLM4V-MoE, and HunYuan-MoE-V1. (#1179, #1192)

🤖 Claude Code Skills for Kernel Authoring (.claude/skills/)

Three first-party Claude Code skills now ship with the repo to make extending Liger-Kernel dramatically easier:

  • liger-kernel-dev (#1170) — builds a production-ready Triton kernel end-to-end from a PyTorch op (file, URL, snippet, or NL): ops, module wrapper, functional API, tests, benchmark, exports.
  • liger-autopatch (#1167, #1177) — adds Liger support for a new HuggingFace model: reads modeling_*.py, generates lce_forward, monkey-patch entry, and convergence tests.
  • liger-kernel-perf (#1185) — profiles an existing kernel, generates and benchmarks optimization variants (Ampere / Hopper / Blackwell-aware), applies the winner.

Several PRs in this release were authored using these skills (e.g. #1165, #1166, #1171, #1187).

🧩 New Model Support

  • Qwen3.5 MoE (#1109) and Qwen3.5 dense (#1123)
  • Qwen3.5 multimodalQwen3_5ForConditionalGeneration (#1150)
  • Nemotron (#1165), with Liger ReLU² wired in (#1176)
  • Ministral (#1166)
  • Gemma 4 dense text, 31B-targeted (#1196)
  • Falcon H1 SwiGLU multipliers (#1201)

Ascend NPU Backend Support

Liger-Kernel now supports the Ascend NPU backend. Huge thanks to the Ascend team for the sustained effort across this release. See #969 for the full tracking issue and roadmap.


What's Changed

Read more

v0.7.0

12 Feb 22:00
7644a0f

Choose a tag to compare

🚀 Liger-Kernel Now Fully Supports Transformers v5

We’ve added full support for Transformers v5!
🔗 #994

Liger now supports all 🤗 Transformers versions ≥ 4.52.0, including the latest v5 release.

Broader compatibility. Seamless upgrades. No version headaches.

Thanks to all the contributors!

What's Changed

New Contributors

Full Changelog: v0.6.5...v0.7.0

v0.6.5

04 Feb 02:01
81f932a

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.6.4...v0.6.5

v0.6.4 release

21 Nov 22:48
0a62700

Choose a tag to compare

Highlights

New model architecture:
Qwen3-VL, hunyuanv1, Olmo3

New algorithm:
DAPO loss

Optimizations:
Layernorm backward, Tiled MLP

What's Changed

New Contributors

Full Changelog: v0.6.3...v0.6.4

v0.6.3 release

27 Oct 18:30
d5648bf

Choose a tag to compare

Highlights in this release:

New model architecture supports:
SmolVLM2, GLM4.5V, InternVL3, Falcon-H1, Qwen-Next

New algorithm:
GSPO

What's Changed

New Contributors

Full Changelog: v0.6.2...v0.6.3

v0.6.2

22 Aug 00:15
77a4c1a

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.6.1...v0.6.2

v0.6.1

28 Jul 18:36
7705dcc

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.6.0...v0.6.1

v0.6.0: New Attention Operators, Cosine Similarity Loss, Llama 4, and VLM Patching Updates

09 Jul 05:05
66570b1

Choose a tag to compare

Highlights

This release introduces significant improvements to Liger-Kernel, including new operators, support for Llama 4 models, more robust benchmarking automation, and key fixes for patching of vision-language models (VLMs) due to recent transformers refactoring.

Key Changes

New Features & Improvements

Bug Fixes

Documentation & CI Fixes

Benchmarking & Automation

New Contributors

Full Changelog: v0.5.10...v0.6.0

v0.5.10: Qwen3 MOE support, Sparsemax kernel, bug fixes

22 May 17:52
44a8f2f

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.5.9...v0.5.10

v0.5.9: Adds XPU Setup, GLM-4 & Qwen3 Model Support, Key Bugfixes

04 May 19:47
f19068f

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.5.8...v0.5.9