[Inquiry] Appreciation for InteractiveOmni and questions about training code & GRPO support

Hello InteractiveOmni Team,

First of all, I would like to express my sincere appreciation for your outstanding work on **InteractiveOmni**. The model's performance is impressive, and we find the **Audio-Visual Multi-turn Dialogue** capability particularly robust and well-suited for the research tasks our team is currently working on.

We are very interested in diving deeper into this project and would like to inquire about a few things regarding future updates:

1. **Training Code Release**: Do you have any plans to open-source the training code? Having access to the training pipeline would be incredibly helpful for us to understand the model better and adapt it to our specific scenarios.

2. **Framework Support**: Are there any plans to support popular training frameworks such as **ms-swift** or **LLaMA-Factory**? Integration with these frameworks would greatly facilitate the fine-tuning and deployment process for the community.

3. **RL / GRPO Support**: Given the recent advancements in Multimodal RL, we are wondering if you are considering supporting Reinforcement Learning methods like **GRPO (Group Relative Policy Optimization)** for this model? We believe this could further enhance the model's reasoning and interaction capabilities.

Thank you again for your contribution to the open-source community! Looking forward to your response.

Best regards,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inquiry] Appreciation for InteractiveOmni and questions about training code & GRPO support #2

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[Inquiry] Appreciation for InteractiveOmni and questions about training code & GRPO support #2

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions