Skip to content
#

ollamma

Here are 4 public repositories matching this topic...

Language: All
Filter by language

Intelligent LLM router that dynamically routes prompts between local Ollama (Qwen) and cloud models (Gemini) using complexity scoring, semantic caching, and cost-aware decisioning.

  • Updated May 15, 2026
  • Python

Improve this page

Add a description, image, and links to the ollamma topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ollamma topic, visit your repo's landing page and select "manage topics."

Learn more