Skip to content

alo-exp/sidekick

Repository files navigation

Sidekick — AI Coding Agents for Claude Code and Codex

AI coding agents for Claude Code and Codex — Sidekick lets the active host AI delegate implementation to Forge or Kay while the host stays focused on planning, review, mentoring, and communication.

Sidekicks

Sidekick Activation surface Agent Status
Forge /forge ForgeCode — #7 Terminal-Bench 2.0 (81.8%) ✅ v0.5.6
Kay kay-delegate OSS Codex-lineage execution agent — Codex CLI #6 Terminal-Bench 2.0, kay exec, MiniMax M2.7, OpenCode Go compatibility ✅ v0.5.6

More sidekicks planned.


Docs

If you want the docs in the right order, start here:

For help-site workflows, use the Help Center.

Installation

Install through the active host's plugin surface:

# Claude Code
/plugin install alo-labs/sidekick

# Codex
codex plugin marketplace add alo-exp/sidekick

On the next host session, Sidekick performs the first-run bootstrap. Runtime readiness is checked when you start Forge or Kay delegation for the current session.


Forge — ForgeCode Sidekick

What it does

  • Auto-installs ForgeCode on first session start and checks readiness when Forge delegation starts for the current session
  • Guides Forge provider setup for MiniMax Coding without putting provider keys in the prompt
  • Turns the host into an orchestrator: Claude Code or Codex plans and communicates, while Forge executes file changes, tests, and commits
  • Fallback ladder: automatic 3-level recovery on failure — L1 Guide (reframe), L2 Handhold (decompose), L3 Take over (sidekick forge-level3 start|stop, project-scoped direct work + DEBRIEF)
  • AGENTS.md mentoring: after every task, the host AI extracts learnings and writes them to a 3-tier instruction store (~/forge/AGENTS.md, ./AGENTS.md, session logs) — delegation gets smarter over time
  • Skill injection: 4 bootstrap skills (testing-strategy, code-review, security, quality-gates) auto-injected into task prompts based on task type
  • Token optimization: task prompts capped at 2,000 tokens with validated .forge.toml compaction defaults

Kay — OSS Codex-Lineage Sidekick

What it does

  • Auto-installs Kay from the pinned alo-labs/kay installer on first session start and keeps legacy code, codex, and coder aliases compatibility-only
  • Activates Kay work through kay-delegate; active Kay mode launches kay exec --full-auto so Kay remains the runtime identity
  • Uses Kay's native agents, skills, subagents, and AGENTS.md support instead of recreating Forge-style prompt injection
  • Supports Claude Code and Codex hosts by running Kay as a child execution process through the packaged kay-delegate skill
  • Defaults to MiniMax MiniMax-M2.7 through Kay-local ~/.kay/config.toml, with OpenCode Go available for multi-AI compatibility
  • Keeps a project-local audit index at .kay/conversations.idx; the canonical Kay workflows live in the delegate and stop skills, with the legacy flat alias preserved only as a hidden compatibility entry at skills/codex-delegate.md.

The website setup shortcuts /forge:delegate and /kay:delegate are shipped alias skills. They route to the canonical /forge and kay-delegate workflows.

Kay flow

You → Claude Code or Codex (plan + communicate) → Kay (implement + commit) → host AI (review + report)

Host AI handles: architecture, explanations, research, code review Kay handles: writing files, features, tests, git commits

Forge flow

You → Claude Code or Codex (plan + communicate) → Forge (implement + commit) → host AI (review + report)

Host AI handles: architecture, explanations, research, code review Forge handles: writing files, features, tests, git commits

Forge provider setup

The host AI will guide you to:

  1. Create MiniMax.io API access at https://platform.minimax.io/subscribe/token-plan.
  2. Store the credential in Forge's native ~/forge/.credentials.json array format.
  3. Point Forge at MiniMax M2.7 through ~/forge/.forge.toml.

The host configures Forge automatically and delegates coding work from that point.

Providers & Models

Provider Model Notes
MiniMax Coding MiniMax M2.7 MiniMax-M2.7 Direct API — get key at https://platform.minimax.io/subscribe/token-plan

Testing

tests/run_release.bash chains the unit suites plus the live Forge/Kay install, smoke, E2E, and Kay marketplace-install gates.

Tier Script Runs without Forge/Kay Purpose
Unit + integration tests/run_all.bash 30 suites — hook classifiers, idx audit, plugin integrity, docs contract, homepage/help-site navigation, social preview, post-release cleanup, clean reinstall bootstrap, and Forge/Kay coverage gaps.
Forge smoke tests/smoke/run_smoke.bash skip forge --version + trivial forge -p round-trip against the real binary.
Forge live E2E tests/run_live_e2e.bash skip Full host→Forge delegation on a seeded-buggy testapp (tests/testapp/) — proves the 5-field prompt shape, tool-use, and verification loop work end-to-end.
Kay marketplace install tests/run_live_codex_marketplace_install.bash skip Installs Sidekick through the Codex marketplace path and verifies the installed Kay and Forge surfaces.
Kay smoke tests/smoke/run_codex_smoke.bash skip kay --version + trivial kay exec round-trip against the real binary, with legacy names kept as compatibility aliases.
Kay live E2E tests/run_live_codex_e2e.bash skip Full host→Kay delegation on the same seeded-buggy testapp — proves the 5-field prompt shape, edit, and verification loop work end-to-end.

The live stages are gated behind SIDEKICK_LIVE_FORGE=1 and SIDEKICK_LIVE_CODEX=1 so they never run in CI. Before tagging a new version:

SIDEKICK_LIVE_FORGE=1 SIDEKICK_LIVE_CODEX=1 bash tests/run_release.bash

Before any release, complete the 4-stage pre-release quality gate until it passes twice in a row, then run the full live Forge/Kay pyramid twice with both live env vars. Each full live run records a current-session quality-gate-live-pyramid marker; the release hook requires two markers before publishing.

After the release is published, run bash tests/post_release_cleanup.bash so the local repo returns to a clean post-release state. This cleanup only removes transient build/cache artifacts; .planning/, docs/specs, and docs/design content stay in place.

Without those env vars the gate still runs stage 1 and cleanly skips the live stages (exit 0), so it's safe to wire into CI.


License

MIT — Ālo Labs

About

Claude Code plugin — Forge and Kay delegation modes, 3-level fallback ladder, AGENTS.md mentoring, auto-install via OpenRouter

Topics

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors