Popular repositories Loading
-
multimodal-light-agent
multimodal-light-agent Public(更新中,尚未完善)本项目是我自主开发的小型多模态轻量 Agent 系统,面向视频内容智能问答场景。系统以视频、图像、文本等多模态输入为基础,通过意图理解、任务路由、工具调用与模块调度,整合 OpenCV 视频处理、视觉分析与大模型文本生成能力,探索跨模态信息融合与智能回答生成的完整执行流程。
Python 3
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.