A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Karpathy's autoresearch.
-
Updated
Apr 4, 2026
A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Karpathy's autoresearch.
Group Evolving Agents: Open-Ended Self-Improvement via Experience Sharing
SutroYaro — Sutro Group research workspace for energy-efficient AI training. Point any coding agent at the repo and it becomes a research agent. 34 experiments, eval environment, weekly catch-ups, multi-researcher workflow.
Faraday: An Autonomous Web Research Agent (LangGraph/Streamlit). 🕵️♀️ Investigates queries using dynamic tools (Tavily, Google, NewsAPI, etc.), gathers multi-source info, and synthesizes structured reports in a Streamlit UI. Features agentic workflow & source tracking.
🤖 CodeForge AI: An autonomous multi-agent coding system powered by LangGraph for agentic software development and automated workflows. SOTA custom agentic GraphRag, shared-state memory, auto-model routing for cost optimization, and a range of custom tooling.
Lightweight Python CLI for the Exa API (Search, Contents, Find Similar, Answer, Research, Context) with JSON-first output, SSE streaming, and model-aware polling. LLM‑agnostic: integrate with OpenAI Agents SDK/Codex CLI or Claude tool use by invoking CLI commands, no MCP server required.
Faraday: An Autonomous Web Research Agent (LangGraph/Streamlit). 🕵️♀️ Investigates queries using dynamic tools (Tavily, Google, NewsAPI, etc.), gathers multi-source info, and synthesizes structured reports in a Streamlit UI. Features agentic workflow & source tracking.
🤖 Build and interact with Claude Agent using this Python SDK for seamless integration and efficient asynchronous querying.
An advanced agentic workflow implementation using LangGraph and LangChain, featuring iterative research, autonomous planning, and persistent state management for high-quality content generation.
Autonomous ML research loops for Claude Code with mechanical anti-fabrication guards.
Modular multi-agent AI architecture for deep research, long-context reasoning, and reliable execution.
this is a tool to use AI agents to help with job applications
Provider-agnostic multi-agent orchestration runtime with LangGraph, MCP tools, CLI, FastAPI, evidence capture, and citation-aware outputs.
Organize genealogy research with structured AI prompts, vault templates, and workflows for source-backed family history work
Compare AdaL and Claude Code on Autoresearch benchmarks to find better hyperparameters, run more experiments, and converge faster
Add a description, image, and links to the research-agents topic page so that developers can more easily learn about it.
To associate your repository with the research-agents topic, visit your repo's landing page and select "manage topics."