You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
🖨️ Automated scanner document processor with AI-powered naming and WebDav integration. Receives scans via FTP, extracts text using Vision AI, generates intelligent filenames with Ollama AI, and uploads to your cloud storage.
AI-powered browser automation agent using a dual-LLM architecture. The orchestrator (qwen3-vl-32k) creates execution plans from screenshots, while the executor (llama3.1:8b) translates steps into browser actions using an accessibility tree for reliable element selection. Local, private, powered by Ollama.
Next-gen AI Optical Music Recognition (OMR) platform. Convert sheet music images into playable ABC notation instantly using Google Gemini 3 Pro Vision. Built with React 19, TypeScript, and Tailwind.
AI Nutrition Vision analyzes food images using OpenAI Vision to detect food items and produce detailed nutrition insights (calories, protein, fat, serving size, etc.) with clean Streamlit UI.
🔍 A CLIP-powered image similarity finder built with Streamlit — upload a query image and find the most visually similar matches from a gallery using deep visual embeddings.
🎶 Transform sheet music into interactive digital content with Resonote, leveraging advanced Optical Music Recognition for seamless musical score analysis.