Reproducible local LLM setup and benchmark evidence for AMD Strix Halo / Ryzen AI MAX+ 395: 63-98.5 t/s direct Qwen MoE, 101.1 t/s MTP.
-
Updated
May 31, 2026 - Python
Reproducible local LLM setup and benchmark evidence for AMD Strix Halo / Ryzen AI MAX+ 395: 63-98.5 t/s direct Qwen MoE, 101.1 t/s MTP.
A production-grade Python SDK for the Lemonade LLM backend. Featuring auto-discovery, port scanning, and a low-overhead client architecture. Powering the Sorana AI workspace.
Sample MCP Server for weather queries - Reference implementation for ASUS and OEM partners
Add a description, image, and links to the ai-pc topic page so that developers can more easily learn about it.
To associate your repository with the ai-pc topic, visit your repo's landing page and select "manage topics."