Skip to content

flagos-ai/FlagScale

Repository files navigation

github+banner-20260130

[中文版|English]

Important

2026/03 UPDATE

v1.0.0 is now officially released as the first stable version. The codebase has been significantly refactored since v1.0.0-alpha.0. The hardware-specific (multi-chip) support has been moved into plugin repositories such as TransformerEngine-FL and vllm-plugin-FL. These plugins build on top of FlagOS, a unified open-source AI system software stack. If you are using or upgrading from a version earlier than v1.0.0-alpha.0, please use the main-legacy branch. It will continue to receive critical bug fixes and minor updates for a period of time.

Overview

FlagScale is a core component of FlagOS — a unified, open-source AI system software stack that fosters an open technology ecosystem by seamlessly integrating various models, systems, and chips. Following the principle of "develop once, migrate across various chips", FlagOS aims to unlock the full computational potential of hardware, break down barriers between different chip software stacks, and effectively reduce migration costs.To get started, see the Getting Started Guide.

As the central toolkit of this ecosystem, FlagScale provides a unified interface covering the complete lifecycle of large language models, multimodal models, and embodied AI models. It integrates multiple open-source backend engines under a single configuration and CLI interface, supporting key workflows including model training, reinforcement learning, and inference — with consistent operation across diverse chip vendors.

Within the FlagOS ecosystem, FlagScale works together with several other components:

  • FlagOS Plugins – hardware-adapted integrations of upstream AI frameworks
  • FlagCX – a scalable and adaptive cross-chip communication library
  • FlagOS-Robo related – infrastructure for embodied AI workloads

FlagOS plugin projects are built on top of widely used upstream open-source frameworks and extend them to support multiple AI chips. These plugins provide hardware compatibility and runtime integration for training, reinforcement learning, and inference.

The following table lists the mapping between FlagOS plugins and their corresponding upstream projects.

Task FlagOS Plugin Projects Upstream Projects
Training Megatron-LM-FL
TransformerEngine-FL
Megatron-LM
TransformerEngine
Reinforcement Learning VeRL-FL veRL
Serve / Inference vllm-plugin-FL vllm

Resources

Support List

Training

Model Example config File
DeepSeek-V3 16b_a3b.yaml
Qwen2/2.5/3 235b_a22b.yaml
Qwen2.5-VL 7b.yaml
QwQ 32b.yaml
LLaMA2 7b.yaml
LLaMA3/3.1 70b.yaml
LLaVA-OneVision 7b.yaml
LLaVA1.5 7b.yaml
Mixtral 8x7b.yaml
RWKV 7b.yaml
Aquila 7b.yaml
... ...

Serve/Inference

Model Example config File
DeepSeek-V3 671b.yaml
DeepSeek-R1 671b.yaml
Qwen2.5 72b.yaml
Qwen3 8b.yaml
Qwen2.5-VL 32b_instruct.yaml
Qwen3-Omni 30b.yaml
QwQ 32b.yaml
Grok2 270b.yaml
Kimi-K2 1t.yaml
... ...

Contribution

join our WeChat Group

开源小助手

License

This project is licensed under the Apache License (Version 2.0). This project also contains other third-party components under other open-source licenses.

About

FlagScale is a large model toolkit based on open-sourced projects.

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

 
 
 

Contributors