Transilience AI Community Security Tools

Open-source Claude Code skills and agents for AI-powered penetration testing, bug bounty hunting, AI threat testing, and security reconnaissance

Quick Start | Skills & Agents | Architecture | Contributing | Website

Announcement

Practice Makes Perfect: Teaching an AI to Hack by Learning from Its Mistakes (March 2026)

We built an autonomous pentesting agent that scores 100% (104/104) on a published CTF benchmark suite — using only structured markdown skill files, no fine-tuning. Starting from a bare 89.4% baseline, we ran a simple loop roughly 15 times: run the benchmarks, find a failure, diagnose the missing technique, write it into a skill file, and run again. The same skills transfer cross-model: Claude Sonnet 4.6 reaches 96.2% and Claude Haiku 4.5 reaches 62.5%. This repository contains the full skill set described in the paper.

Read the paper (PDF)

Overview

Transilience AI Community Tools is a consolidated Claude Code security testing suite — 23 skills, 8 agents, and 2 tool integrations that cover the full penetration testing lifecycle from reconnaissance to reporting.

Why Choose Transilience Community Tools?

AI-Powered Automation — Claude orchestrates intelligent security testing workflows
Complete OWASP Coverage — 100% OWASP Top 10 + OWASP LLM Top 10
Professional Reporting — CVSS 3.1, CWE, MITRE ATT&CK, Transilience-branded PDF reports
Playwright Integration — Browser automation for client-side vulnerability testing
Payload-Enriched References — 160+ reference files with inline PayloadsAllTheThings techniques
Open Source — MIT licensed for commercial and personal use

Quick Start

1. Clone and enter the project

git clone https://github.com/transilienceai/communitytools.git
cd communitytools/projects/pentest

2. Install tools (optional but recommended)

# Browser automation (XSS, CSRF, clickjacking testing)
.claude/tools/playwright/install.sh

# CLI tools (nmap, sqlmap, nikto, gobuster, ffuf, testssl)
.claude/tools/kali/install.sh

# Verify
.claude/tools/check-all.sh

3. Open Claude Code and run skills

claude    # Launch Claude Code from the projects/pentest directory

Then use slash commands inside the Claude session:

/coordination https://target.com     # Full penetration test
/hackthebox                          # HackTheBox challenge automation
/hackerone                           # Bug bounty workflow
/techstack-identification            # Passive tech stack recon
/reconnaissance target.com           # Attack surface mapping
/source-code-scanning ./app          # Static code analysis

Skills & Agents

All skills and agents live under projects/pentest/.claude/.

Agents (8)

Agent	Role
Pentester Orchestrator	Coordinates pentests — plans, dispatches parallel agent batches, analyzes results, adapts
Pentester Executor	Thin experiment runner — executes specific tests, returns raw results
Pentester Validator	Validates findings against raw evidence — all 5 checks must pass or finding is rejected
HackTheBox	Platform automation — login, challenge selection, VPN, delegates solving, logs proceedings
HackerOne Hunter	Bug bounty automation — scope parsing, parallel testing, PoC validation, submission reports
Script Generator	Generates optimized scripts for pentest agents — parallelization, syntax validation
PATT Fetcher	On-demand PayloadsAllTheThings retrieval when local payloads are insufficient
Skiller	Skill creation and management — scaffolding, validation, GitHub workflow

Skills by Category (23)

Vulnerability Testing (10)

Skill	Coverage
`/injection`	SQL, NoSQL, OS Command, SSTI, XXE, LDAP/XPath
`/client-side`	XSS (Reflected/Stored/DOM), CSRF, Clickjacking, CORS, Prototype Pollution
`/server-side`	SSRF, HTTP Smuggling, Path Traversal, File Upload, Deserialization, Host Header
`/authentication`	Auth Bypass, JWT, OAuth, Password Attacks, 2FA Bypass, CAPTCHA Bypass
`/api-security`	GraphQL, REST API, WebSockets, Web LLM
`/web-app-logic`	Business Logic, Race Conditions, Access Control, Cache Poisoning/Deception, IDOR
`/cloud-containers`	AWS, Azure, GCP, Docker, Kubernetes
`/system`	Active Directory, Privilege Escalation (Linux/Windows), Exploit Development
`/infrastructure`	Port Scanning, DNS, MITM, VLAN Hopping, IPv6, SMB/NetBIOS
`/social-engineering`	Phishing, Pretexting, Vishing, Physical Security

Reconnaissance (3)

Skill	Purpose
`/reconnaissance`	Subdomain discovery, port scanning, endpoint enumeration, API discovery, attack surface mapping
`/osint`	Repository enumeration, secret scanning, git history analysis, employee footprint
`/techstack-identification`	Passive tech stack inference across 17 intelligence domains

Specialized (3)

Skill	Purpose
`/ai-threat-testing`	OWASP LLM Top 10 — prompt injection, model extraction, data poisoning, supply chain
`/cve-poc-generator`	CVE research, NVD lookup, safe Python PoC generation, vulnerability reports
`/source-code-scanning`	SAST — OWASP Top 10, CWE Top 25, dependency CVEs, hardcoded secrets

Platform Integrations (2)

Skill	Purpose
`/hackerone`	Scope CSV parsing, parallel asset testing, PoC validation, platform-ready submissions
`/hackthebox`	Playwright-based login, challenge browsing, VPN management, automated solving

Orchestration & Tooling (5)

Skill	Purpose
`/coordination`	Engagement orchestration, test planning, output structure
`/essential-tools`	Burp Suite, Playwright automation, methodology, reporting standards
`/transilience-report-style`	Transilience-branded PDF report generation (ReportLab)
`/github-workflow`	Git branching, commits, PRs, issues, code review
`/skiller`	Skill scaffolding, validation, GitHub workflow automation

Tool Integrations (2)

Tool	Purpose
Playwright	Browser automation for client-side testing via MCP
Kali Linux Tools	nmap, masscan, nikto, gobuster, ffuf, sqlmap, testssl, and more

Architecture

The suite uses a hybrid AGENTS.md + Skills architecture based on Vercel research showing 100% pass rate vs 53-79% for skills alone:

AGENTS.md (root) — Passive knowledge base, always loaded. Compressed security payloads, methodologies (PTES, OWASP, MITRE), CVSS scoring, PoC standards.
Skills (.claude/skills/) — User-triggered workflows invoked with /skill-name. Multi-step orchestration, parallel agents, checkpointing.
Agents (.claude/agents/) — Autonomous workers spawned by skills and orchestrators.

Multi-Agent Execution Flow

sequenceDiagram
    participant User
    participant Skill as Skill Layer
    participant Orch as Orchestrator Agent
    participant Agents as Specialized Agents
    participant Output as Standardized Outputs

    User->>Skill: /pentest https://target.com
    Skill->>Orch: Initialize 7-phase workflow

    Orch->>Agents: Phase 1-2: Deploy recon agents
    Agents-->>Output: inventory/*.json + analysis/*.md

    Orch->>Agents: Phase 3-4: Deploy vuln agents in parallel
    Note over Agents: SQL/XSS/SSRF/JWT/OAuth/SSTI/XXE...
    Agents-->>Output: findings/*.json + evidence/*.png

    Orch->>Output: Phase 5: Generate reports
    Output-->>User: Executive + technical reports

Repository Structure

communitytools/
├── AGENTS.md                    # Passive security knowledge (always loaded)
├── CLAUDE.md                    # Project instructions
├── marketplace.json             # Machine-readable project manifest
├── papers/                      # Research papers
├── benchmarks/                  # XBOW benchmark runner
└── projects/pentest/            # Main project
    └── .claude/
        ├── agents/              # 8 agent definitions
        │   ├── pentester-orchestrator.md
        │   ├── pentester-executor.md
        │   ├── pentester-validator.md
        │   ├── hackthebox.md
        │   ├── hackerone.md
        │   ├── script-generator.md
        │   ├── patt-fetcher.md
        │   ├── skiller.md
        │   └── reference/       # Output structure, test plan format
        ├── skills/              # 23 skill directories
        │   ├── {skill-name}/
        │   │   ├── SKILL.md     # Skill definition
        │   │   └── reference/   # Attack techniques, cheat sheets, payloads
        │   └── ...
        └── tools/               # Tool integrations
            ├── playwright/
            └── kali/

Contributing

We welcome contributions from the security community!

Read the full guide: CONTRIBUTING.md

Quick path using the Skiller:

/skiller
# Select: CREATE → provide details → automated GitHub workflow
# Handles: issue creation, branch, skill generation, validation, commit, PR

Security & Legal

IMPORTANT: These tools are designed for authorized security testing ONLY.

Authorized & Legal Use:

Penetration testing with written authorization
Bug bounty programs within scope
Security research on your own systems
CTF competitions and training environments
Educational purposes with proper permissions

Prohibited & Illegal Use:

Unauthorized testing of any systems
Malicious exploitation of vulnerabilities
Data theft or system disruption
Any use that violates local or international laws

Users are solely responsible for compliance with all applicable laws and regulations.

Responsible Disclosure

If you discover a vulnerability using these tools:

Do not exploit beyond proof-of-concept
Report immediately to the vendor/organization
Follow responsible disclosure timelines (typically 90 days)
Document thoroughly for remediation

Community & Support

GitHub Discussions — Ask questions, share ideas
GitHub Issues — Report bugs, request features
Website — Commercial products
Email — Enterprise support

Project Stats

Category	Count
Skills	23
Agents	8
Tool Integrations	2
Attack Types	53
Reference Files	160+

Coverage:

OWASP Top 10 (2021) — 100%
OWASP LLM Top 10 (2025) — 100%
SANS Top 25 CWE — 90%+
MITRE ATT&CK TTPs — mapped for all findings

License

Contributors

Built by Transilience AI

Transilience AI specializes in autonomous security testing and AI security operations.

Website | Issues | Discussions

claude-code ai-security penetration-testing bug-bounty owasp llm-security ai-threat-testing security-automation ethical-hacking cybersecurity appsec web-security hackerone hackthebox multi-agent

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
.claude-plugin		.claude-plugin
.github		.github
benchmarks		benchmarks
papers		papers
projects/pentest/.claude		projects/pentest/.claude
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transilience AI Community Security Tools

Announcement

Overview

Why Choose Transilience Community Tools?

Quick Start

1. Clone and enter the project

2. Install tools (optional but recommended)

3. Open Claude Code and run skills

Skills & Agents

Agents (8)

Skills by Category (23)

Vulnerability Testing (10)

Reconnaissance (3)

Specialized (3)

Platform Integrations (2)

Orchestration & Tooling (5)

Tool Integrations (2)

Architecture

Multi-Agent Execution Flow

Repository Structure

Contributing

Security & Legal

Responsible Disclosure

Community & Support

Project Stats

License

Contributors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Transilience AI Community Security Tools

Announcement

Overview

Why Choose Transilience Community Tools?

Quick Start

1. Clone and enter the project

2. Install tools (optional but recommended)

3. Open Claude Code and run skills

Skills & Agents

Agents (8)

Skills by Category (23)

Vulnerability Testing (10)

Reconnaissance (3)

Specialized (3)

Platform Integrations (2)

Orchestration & Tooling (5)

Tool Integrations (2)

Architecture

Multi-Agent Execution Flow

Repository Structure

Contributing

Security & Legal

Responsible Disclosure

Community & Support

Project Stats

License

Contributors

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages