Build production-ready apps,
pipeline by pipeline.
CodeSleuth is a 7-agent software engineering pipeline powered by Python scripts and Go goroutines. From product idea to SHIP verdict โ with human gates at every critical transition, parallel task execution, staged write approval, continuous SAST, coverage enforcement, and session forking.
Directed by Agent 0 โ Orchestrator ยท human gates between every stage
What's New in v3.0
Eight new capabilities ship in v3.0, powered by 17 Python + Go pipeline scripts that integrate directly into agent workflows.
parallel_runner.go. Write-conflict detection automatically splits conflicting tasks into sequential sub-batches.staged_write.py. No file is modified without explicit approval.!fork. session_fork.py manages isolated copies with git branches. Use !merge to adopt the winner and clean up.!replay TASK-NNN. pipeline_replay.py resets the codebase git state and marks later tasks PENDING.semantic_search.py surfaces relevant existing code before new code is written.!budget. token_tracker.py runs after every agent turn to keep spending visible.migration_safety.py. HIGH severity risks block the pipeline until explicitly approved.Explore the Docs
What is CodeSleuth AI?
CodeSleuth AI is a structured, multi-agent software development system. Each agent has a precisely defined role, a set of absolute rules, and standardized artifact outputs that feed directly into the next stage.
- No agent auto-advances. Every stage transition requires a human gate command (
!plan,!build,!security,!verify). - RARV Cycle. Every agent follows Reason โ Act โ Reflect โ Verify before producing output.
- Full-contract Planning. Agent 2 produces TDD, INTERFACES, SCHEMA, TASK-GRAPH, Zod contracts, and the Parallelism Map in a single pass triggered by
!plan. - Pattern Library. Every non-obvious implementation decision is captured in
PATTERN_LIBRARY.mdand reused across projects automatically. - Machine-Readable Contracts. The Planner emits Zod schemas and OpenAPI stubs that the Verifier independently validates โ not just documentation.
- Spec Change Protocol. Mid-build scope changes go through a formal
!changeflow โ blast radius report, CHANGE_LOG.md entry, and STALE task marking โ before any implementation starts. - Performance Budget Gate. Bundle sizes and API response times are enforced by
scripts/perf.mjswith configurable budgets in.agent/PERF_BUDGET.json. - 12-Dimension Product Scorecard. The Critic scores every product on 12 dimensions (1โ5) before deep-dive critique. A score of 1 on any dimension is an automatic HOLD.
- Multi-tool compatible.
CLAUDE.mdis the canonical agent activation file. Runnode .cli/sync/sync-cli-instructions.mjsto propagate rules to Cline, Continue, Cursor, Copilot, Kiro, OpenCode, and 7 other tools. Each rule file embeds the Agent 0 Startup Protocol verbatim so all models produce a consistent banner.
Type "Start agent 0" in any supported AI CLI or IDE to activate the Orchestrator. It will display the startup banner, initialize pipeline state, and guide you through the first human gate. See the Quickstart Guide for full details.
Pipeline at a Glance
| Agent | Role | Gate Command | Primary Output |
|---|---|---|---|
| 0 โ Orchestrator | Director, never builder | Start agent 0 |
Pipeline state, gate enforcement, CHANGE_LOG.md |
| 1 โ Discovery | Spec author | Initial prompt | feature-spec.md, HANDOFF.json v1 |
| 2 โ Planning | Technical architect | !plan |
TDD.md, TASK-GRAPH.md, INTERFACES.md, contracts/ |
| 3 โ Builder | Implementation engine | !build |
Complete codebase, CHECKPOINT.md, DECISIONS.md |
| 4 โ Critic | Product accountability | !critic |
CRITICISM.md, 12-D scorecard, product verdict |
| 5 โ Security | 20-domain auditor | !security |
SECURITY_REPORT.md, HANDOFF.json v5 |
| 6 โ Verifier | QA & release gate | !verify |
VERIFICATION_REPORT.md, SHIP/NO-SHIP verdict |
v3.0 Upgrades at a Glance
| Feature | What It Adds | Agents |
|---|---|---|
| Full-Contract Planning | Single-pass: TDD, INTERFACES, SCHEMA, TASK-GRAPH, Zod contracts, Parallelism Map | 2 |
| Spec Change Protocol | !change โ blast radius โ STALE marking | 0, 3 |
| Pattern Library | Compounding PATTERN-NNN knowledge across projects | 2, 3 |
| Machine-Readable Contracts | Zod + OpenAPI stubs validated by Verifier | 2, 3, 5 |
| Performance Budget Gate | Bundle KB + API p50/p95 enforcement | 3, 5 |
| 12-Dimension Scorecard | D1โD12 product scores with auto-HOLD at 1 | 6 |
| Change Impact Tracking | Reverse CIA โ file ownership regression detection | 3 |
| Living Documentation | !docs generates human-readable changelogs | 0 |
v3.0 ships 17 Python + Go scripts wired into agent workflows and a tracker of 29 implemented enhancements. See Pipeline Scripts and Suggestion Tracker for the full details.
Quick Links
- โก Quickstart โ run your first session
- ๐๏ธ Architecture โ RARV cycle, artifacts, tiers
- โจ๏ธ Command Reference โ all commands in one place
- ๐งฉ Pattern Library โ reusable implementation patterns
- ๐ Platform Capabilities โ build targets, toolchains, and requirements
- ๐ HANDOFF.json Schema โ v1 through v6 specification
- ๐ Pipeline Scripts โ 17 Python + Go tools reference
- ๐ก Suggestion Tracker โ 29 implemented enhancements