nano-vllm

Star

Here are 5 public repositories matching this topic...

slwang-ustc / nano-vllm-v1

Star

Nano vLLM with vLLM v1's request scheduling strategy and chunked prefill

nlp deep-learning inference pytorch transformer llm vllm llm-inference nano-vllm

Updated Jan 26, 2026
Python

izmttk / ullm

Star

Lightweight LLM inference engine inspired by nano-vllm, with radix-tree based prefix cache, tp & pp, cuda graph, openai api, async scheduling, and more.

deep-learning inference pytorch transformer llm llm-serving nano-vllm

Updated Mar 29, 2026
Python

linzm1007 / llm_infer_learning

Star

大模型推理学习，vllm、sglang、nano-vllm学习记录

infer vllm sglang nano-vllm

Updated Dec 7, 2025

uttera / uttera-tts-vllm

Star

High-throughput TTS server based on vLLM continuous batching. VoxCPM2 and future Transformer TTS models. Optimized for cloud deployment and multi-tenant serving.

python text-to-speech concurrency self-hosted tts voice-cloning fastapi privacy-focused openai-api vllm local-ai open-webui nano-vllm openclaw personality-tuning voxcpm2 uttera

Updated Apr 23, 2026
Python

pradhankukiran / vox-populi

Star

Personal text-to-speech webapp powered by VoxCPM2 — voice design, controllable cloning, and ultimate cloning. Next.js on Vercel + Modal GPU.

python text-to-speech typescript ai nextjs modal tts voice-synthesis tailwindcss voice-cloning fastapi huggingface shadcn-ui gpu-inference nextjs16 nano-vllm openbmb voxcpm2

Updated May 14, 2026
TypeScript

Improve this page

Add a description, image, and links to the nano-vllm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the nano-vllm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nano-vllm

Here are 5 public repositories matching this topic...

slwang-ustc / nano-vllm-v1

izmttk / ullm

linzm1007 / llm_infer_learning

uttera / uttera-tts-vllm

pradhankukiran / vox-populi

Improve this page

Add this topic to your repo