AI coding agents for Claude Code and Codex — Sidekick lets the active host AI delegate implementation to Forge or Kay while the host stays focused on planning, review, mentoring, and communication.
| Sidekick | Activation surface | Agent | Status |
|---|---|---|---|
| Forge | /forge |
ForgeCode — #7 Terminal-Bench 2.0 (81.8%) | ✅ v0.5.6 |
| Kay | kay-delegate |
OSS Codex-lineage execution agent — Codex CLI #6 Terminal-Bench 2.0, kay exec, MiniMax M2.7, OpenCode Go compatibility |
✅ v0.5.6 |
More sidekicks planned.
If you want the docs in the right order, start here:
- Start Here — task-first navigation
- Audience — who each doc is for
- Glossary — canonical terminology
- Compatibility — Claude Code, Codex, Forge, and Kay runtime differences
- ADR index — durable docs-system decisions
For help-site workflows, use the Help Center.
Install through the active host's plugin surface:
# Claude Code
/plugin install alo-labs/sidekick
# Codex
codex plugin marketplace add alo-exp/sidekickOn the next host session, Sidekick performs the first-run bootstrap. Runtime readiness is checked when you start Forge or Kay delegation for the current session.
- Auto-installs ForgeCode on first session start and checks readiness when Forge delegation starts for the current session
- Guides Forge provider setup for MiniMax Coding without putting provider keys in the prompt
- Turns the host into an orchestrator: Claude Code or Codex plans and communicates, while Forge executes file changes, tests, and commits
- Fallback ladder: automatic 3-level recovery on failure — L1 Guide (reframe), L2 Handhold (decompose), L3 Take over (
sidekick forge-level3 start|stop, project-scoped direct work + DEBRIEF) - AGENTS.md mentoring: after every task, the host AI extracts learnings and writes them to a 3-tier instruction store (
~/forge/AGENTS.md,./AGENTS.md, session logs) — delegation gets smarter over time - Skill injection: 4 bootstrap skills (testing-strategy, code-review, security, quality-gates) auto-injected into task prompts based on task type
- Token optimization: task prompts capped at 2,000 tokens with validated
.forge.tomlcompaction defaults
- Auto-installs Kay from the pinned
alo-labs/kayinstaller on first session start and keeps legacycode,codex, andcoderaliases compatibility-only - Activates Kay work through
kay-delegate; active Kay mode launcheskay exec --full-autoso Kay remains the runtime identity - Uses Kay's native agents, skills, subagents, and
AGENTS.mdsupport instead of recreating Forge-style prompt injection - Supports Claude Code and Codex hosts by running Kay as a child execution process through the packaged
kay-delegateskill - Defaults to MiniMax
MiniMax-M2.7through Kay-local~/.kay/config.toml, with OpenCode Go available for multi-AI compatibility - Keeps a project-local audit index at
.kay/conversations.idx; the canonical Kay workflows live in the delegate and stop skills, with the legacy flat alias preserved only as a hidden compatibility entry atskills/codex-delegate.md.
The website setup shortcuts /forge:delegate and /kay:delegate are shipped alias skills. They route to the canonical /forge and kay-delegate workflows.
You → Claude Code or Codex (plan + communicate) → Kay (implement + commit) → host AI (review + report)
Host AI handles: architecture, explanations, research, code review Kay handles: writing files, features, tests, git commits
You → Claude Code or Codex (plan + communicate) → Forge (implement + commit) → host AI (review + report)
Host AI handles: architecture, explanations, research, code review Forge handles: writing files, features, tests, git commits
The host AI will guide you to:
- Create MiniMax.io API access at https://platform.minimax.io/subscribe/token-plan.
- Store the credential in Forge's native
~/forge/.credentials.jsonarray format. - Point Forge at MiniMax M2.7 through
~/forge/.forge.toml.
The host configures Forge automatically and delegates coding work from that point.
| Provider | Model | Notes |
|---|---|---|
| MiniMax Coding | MiniMax M2.7 MiniMax-M2.7 |
Direct API — get key at https://platform.minimax.io/subscribe/token-plan |
tests/run_release.bash chains the unit suites plus the live Forge/Kay install, smoke, E2E, and Kay marketplace-install gates.
| Tier | Script | Runs without Forge/Kay | Purpose |
|---|---|---|---|
| Unit + integration | tests/run_all.bash |
✅ | 30 suites — hook classifiers, idx audit, plugin integrity, docs contract, homepage/help-site navigation, social preview, post-release cleanup, clean reinstall bootstrap, and Forge/Kay coverage gaps. |
| Forge smoke | tests/smoke/run_smoke.bash |
skip | forge --version + trivial forge -p round-trip against the real binary. |
| Forge live E2E | tests/run_live_e2e.bash |
skip | Full host→Forge delegation on a seeded-buggy testapp (tests/testapp/) — proves the 5-field prompt shape, tool-use, and verification loop work end-to-end. |
| Kay marketplace install | tests/run_live_codex_marketplace_install.bash |
skip | Installs Sidekick through the Codex marketplace path and verifies the installed Kay and Forge surfaces. |
| Kay smoke | tests/smoke/run_codex_smoke.bash |
skip | kay --version + trivial kay exec round-trip against the real binary, with legacy names kept as compatibility aliases. |
| Kay live E2E | tests/run_live_codex_e2e.bash |
skip | Full host→Kay delegation on the same seeded-buggy testapp — proves the 5-field prompt shape, edit, and verification loop work end-to-end. |
The live stages are gated behind SIDEKICK_LIVE_FORGE=1 and SIDEKICK_LIVE_CODEX=1 so they never run in CI. Before tagging a new version:
SIDEKICK_LIVE_FORGE=1 SIDEKICK_LIVE_CODEX=1 bash tests/run_release.bashBefore any release, complete the 4-stage pre-release quality gate until it passes twice in a row, then run the full live Forge/Kay pyramid twice with both live env vars. Each full live run records a current-session quality-gate-live-pyramid marker; the release hook requires two markers before publishing.
After the release is published, run bash tests/post_release_cleanup.bash so the local repo returns to a clean post-release state.
This cleanup only removes transient build/cache artifacts; .planning/, docs/specs, and docs/design content stay in place.
Without those env vars the gate still runs stage 1 and cleanly skips the live stages (exit 0), so it's safe to wire into CI.
MIT — Ālo Labs