Everything claw-forge offers

A complete picture of what makes claw-forge the right harness for serious autonomous coding pipelines.

7+
AI providers
18
built-in skills
427
tests passing
0
Node.js required
Feature Status Details
Language & Installation
Languageโœ…Pure Python โ€” no Node.js, no Bun, no npm
Package managerโœ…uv tool install claw-forge โ€” isolated, instant, no venv ceremony
PyPI packageโœ…Also installable via pip install claw-forge
AI Provider Support
API rotation poolโœ…Round-robin and weighted routing across 7+ providers with automatic failover
Anthropic (direct)โœ…Direct API key or OAuth token from claude login
AWS Bedrockโœ…IAM credentials or instance role โ€” no key management
Azure AI Foundryโœ…Azure OpenAI-compat endpoint with managed identity support
Google Vertex AIโœ…ADC credentials, auto model name conversion (@ format)
Groq (free tier)โœ…14,400 req/day free โ€” ideal for monitoring and lightweight tasks
Cerebras (free tier)โœ…1M tokens/day free โ€” Llama 3.3 70B at near-instant inference
Ollama (local)โœ…Any locally-running model via Ollama's OpenAI-compat API
Anthropic-compat proxiesโœ…Custom base_url with x-api-key auth โ€” works with any proxy
Circuit breaker per providerโœ…Closed โ†’ Half-open โ†’ Open state machine; auto-recovery after cooldown
Provider health dashboardโœ…Live health dots in Kanban UI โ€” RPM, latency, cost per provider
Per-provider cost trackingโœ…USD cost tracked per session and per provider
OAuth token supportโœ…Auto-reads ~/.claude/.credentials.json; re-reads on 401
Claude Agent SDK Integration
Bidirectional sessionsโœ…ClaudeSDKClient โ€” mid-session follow-ups, model switching, interrupt
In-process MCP serverโœ…Feature DB tools run in-process โ€” zero subprocess cold-start overhead
File checkpointing + rewindโœ…Rewind all files to any prior checkpoint without git
Pre-compact hookโœ…Custom compaction instructions preserve feature state across context limits
Structured JSON outputโœ…Schema-enforced output from reviewer and planning agents
Thinking configโœ…Deep thinking for planning, adaptive for coding, disabled for monitoring
Named sub-agentsโœ…AgentDefinition โ€” planner/coder/reviewer with separate prompts and tool sets
Cost cap per sessionโœ…max_budget_usd โ€” hard stop when budget is hit
Token-level streamingโœ…StreamEvent for typewriter effect in terminal UI
Architecture & Concurrency
Concurrency modelโœ…Pure asyncio.TaskGroup โ€” no subprocess+threading mix
State managementโœ…FastAPI REST + SQLAlchemy + WebSocket โ€” clean separation of concerns
Session hydrationโœ…session_manifest.json survives restarts and process crashes
Plugin systemโœ…pyproject.toml entry points โ€” third-party plugins without forking core
Dependency-aware schedulingโœ…Kahn's algorithm + DFS cycle detection โ€” features run in correct order
Security
Bash security hookโœ…Hierarchical allowlist: hardcoded blocklist โ†’ global defaults โ†’ project-specific
CanUseTool callbackโœ…Programmatic permission control with input mutation before execution
OS-level sandboxโœ…SandboxSettings โ€” filesystem + network isolation at OS level (macOS/Linux)
Write restriction to project dirโœ…File writes sandboxed to the project directory automatically
Agent lock fileโœ….claw-forge.lock prevents duplicate agents on the same project
Skills & Built-in Tooling
LSP skills (Python)โœ…Pyright โ€” type checking, autocomplete, go-to-definition
LSP skills (Go)โœ…gopls โ€” full Go language intelligence
LSP skills (Rust)โœ…rust-analyzer โ€” borrow checker integration, refactoring
LSP skills (TypeScript)โœ…ts-server โ€” JS/TS type checking and navigation
LSP skills (Solidity)โœ…Solidity LSP โ€” smart contract analysis
LSP skills (C/C++)โœ…clangd โ€” C/C++ intelligence and formatting
Systematic debug skillโœ…Structured root-cause analysis workflow
Verification gate skillโœ…Run checks before claiming task complete
Parallel dispatch skillโœ…Route subtasks to parallel agents automatically
Frontend design skillโœ…Production-grade UI design guidance for web agents
Playwright browser skillโœ…Browser automation for web testing agents
Workflow Features
YOLO modeโœ…--yolo โ€” max concurrency, auto-approve permissions, skip verification
Pause / resumeโœ…Drain mode: finish active features, then pause gracefully
Human input requestsโœ…Agent raises needs_human flag; claw-forge input CLI unblocks it
Batch feature modeโœ…Implement multiple features per session with --batch-size
Slash commands (.claude/)โœ…create-spec, expand-project, check-code, checkpoint, review-pr, pool-status
Session resumeโœ…Continue or fork any prior session by ID
Rate limit handlingโœ…Parse retry-after headers, exponential backoff, auto-resume after cooldown
UI & Monitoring
Kanban boardโœ…5-column board: Pending / In Progress / Passing / Failed / Blocked
Provider health dotsโœ…Green/amber/red per provider; click for RPM, latency, circuit state
Real-time WebSocket updatesโœ…Feature status, agent events, cost โ€” live, no polling
Documentation & Quality
Tutorial websiteโœ…This site โ€” quickstart, provider setup, plugin guide, skills reference
SDK API guideโœ…20 Claude Agent SDK APIs documented with claw-forge examples (docs/sdk-api-guide.md)
Test coverage โ‰ฅ 90%โœ…Enforced in CI โ€” 427 tests, all passing
Type annotations (strict)โœ…Full mypy strict โ€” no Any escapes in core modules
GitHub CI/CDโœ…Lint + typecheck + full test suite on every push and PR

What we're building next

Honest about what's still in progress.

๐ŸŒ Multi-machine Agents

Distribute agent waves across multiple machines. The pool manager handles routing โ€” we need the distributed work queue layer.

โ˜๏ธ Cloud State Backend

Currently SQLite โ€” great for local development. Adding PostgreSQL and a hosted cloud option for team use.

๐Ÿงฉ VS Code Extension

Embed the Kanban UI directly in VS Code. See agent progress without leaving your editor.

๐Ÿ“Š Cost Analytics Dashboard

Historical cost breakdown per project, per provider, per feature โ€” so you can optimize spend over time.

Get started โ†’    Star on GitHub โญ