Data Scientist, AI/ML engineer building production multimodal systems — voice, vision, and LLM-driven applications.
- 🔭 Currently working on: Face Recognition · Voice STT & TTS · LLM-based news intelligence · Fault diagnosis
- 🧠 Focus: Large Language Models, multi-agent systems, and multimodal AI (voice + vision), deployed at production scale on Kubernetes
- 🌱 Exploring: real-time streaming TTS and agentic workflows
Languages: Python · Java · SQL · Shell AI/ML: LLMs · multi-agent systems · ASR/TTS · multimodal (voice + vision) Backend: FastAPI · Spring Boot · Prefect Data: ClickHouse · PostgreSQL Infra: Docker · Kubernetes · GitLab CI/CD
- GitHub: @wuxuedaifu
- Email: wuxuedaifu@gmail.com
