Give any AI agent the ability to crawl, observe, and analyze websites.
5 skills, 1 agent, and an MCP server — packaged as a plugin that follows the Agent Skills open standard. The skills are plain Markdown files that encode domain judgment: when to use which settings, how to interpret observations, what constitutes a finding. The plugin format is just the distribution mechanism.
| Requirement | Description |
|---|---|
| Crawlio | macOS app, installed and running — download |
| CrawlioMCP | MCP server binary (see below) |
| AI tool | Any tool with MCP support (Claude Code, Gemini CLI, Cursor, Windsurf, etc.) |
cd /path/to/Crawlio-app
swift build -c release --product CrawlioMCPBinary lands at .build/release/CrawlioMCP.
Claude Code — plugin install:
claude plugin install /path/to/crawlio-pluginGemini CLI — add to your MCP server config:
{
"mcpServers": {
"crawlio": {
"command": "CrawlioMCP"
}
}
}Other MCP clients (Cursor, Windsurf, etc.) — copy the .mcp.json contents into your client's MCP config. The skills in skills/ work as standalone Markdown instructions in any agent that supports them.
ln -sf /path/to/Crawlio-app/.build/release/CrawlioMCP /usr/local/bin/CrawlioMCPOr edit .mcp.json to use a full path:
{
"mcpServers": {
"crawlio": {
"command": "/path/to/Crawlio-app/.build/release/CrawlioMCP"
}
}
}Launch the Crawlio macOS app. It starts a local HTTP control server automatically.
| Skill | Description |
|---|---|
crawl-site |
Crawl with intelligent config, monitoring, and retry |
extract-and-export |
Full pipeline: crawl, extract, export in 7 formats |
observe |
Query the observation timeline with filters |
finding |
Create and query evidence-backed findings |
audit-site |
Multi-pass site audit with findings report |
Crawl a website with intelligent configuration. Detects site type (static, SPA, CMS, docs), optimizes settings, monitors progress, retries failures, and reports results.
/crawlio:crawl-site https://example.com
End-to-end pipeline: crawl a site, extract structured content (clean HTML, markdown, metadata), and export in any of 7 formats.
/crawlio:extract-and-export https://docs.stripe.com 5 warc
Supported formats: folder zip singleHTML warc pdf extracted deploy
Query the observation log — the append-only timeline of everything Crawlio saw during a crawl. Filter by host, source, operation type, or time range.
/crawlio:observe example.com
Create and query evidence-backed findings. Record insights with observation IDs as evidence that persist across sessions.
/crawlio:finding
Full site audit: crawl, capture enrichment, analyze observations across multiple passes, and produce a findings report with prioritized recommendations.
/crawlio:audit-site https://example.com
A custom agent (agents/site-auditor.md) for systematic multi-pass site analysis:
- Reconnaissance — detect site type, configure settings
- Crawl — download with monitoring and failure retry
- Analysis — structure, errors, enrichment, synthesis (4 passes)
- Report — evidence-backed findings with prioritized recommendations
AI Agent ──skill──► CrawlioMCP ──HTTP──► Crawlio App
(stdio MCP) (macOS, 127.0.0.1)
│
▼
observations.jsonl
(per-project timeline)
Skills encode judgment — when to use which settings, how to interpret observations, what constitutes a finding.
MCP server handles mechanics — HTTP calls, file reads, protocol bridging.
This separation is what makes the plugin forkable: swap the judgment layer for your domain, keep the same mechanics.
For deeper analysis, install the Crawlio Agent Chrome extension. It captures browser-side intelligence (framework detection, network requests, console logs, DOM snapshots) that enriches the observation log.
This plugin is designed to be forked. See FORKING.md for a guide on creating domain-specific versions:
- SEO Auditor — meta tags, heading hierarchy, structured data, internal linking
- Security Scanner — HTTPS enforcement, security headers, exposed endpoints
- Competitive Analysis — multi-site framework comparison, third-party services
- Content Migration Planner — URL mapping, redirect chains, content volume