Eli

Hook-first AI agent framework in Rust.
Single binary. CLI + gateway runtime. Governed self-evolution built in.

Website · Quick Start · Self-Evolution · Architecture · Issues

Eli is for people who want an agent runtime, not a notebook demo.

It is built around a fixed turn pipeline, explicit hooks, append-only tape history, and a provider-agnostic LLM layer. You can run it in a terminal, wire it into Telegram, or expose it through a gateway and sidecar-backed channels such as Feishu, WeChat, Slack, Discord, and DingTalk.

eli chat
eli run "summarize this repo"
eli gateway

Why Eli

LangChain, AutoGen, and crewAI are good at getting an agent demo running. Eli is optimized for people who want a cleaner runtime shape they can keep in production:

Rust workspace, not Python orchestration glue.
Single binary deployment for the core runtime.
Hook-first architecture instead of subclass piles and hidden control flow.
Tape-backed history that is inspectable, forkable, and replayable.
Governed self-evolution that does not let the model silently rewrite its own core prompt.

What You Get

Capability	What it means
Hook-first runtime	Override session resolution, prompt building, model execution, state persistence, or outbound rendering independently
Stable turn pipeline	`resolve_session → load_state → build_prompt → run_model → save_state → render_outbound → dispatch_outbound`
Single binary core	Core runtime ships as one Rust binary; no Python runtime required for the main agent
Provider-agnostic LLM layer	`nexil` handles streaming, tool schema, tape storage, OAuth, and provider routing
Tape history	Append-only session history with anchoring, forking, search, and viewer support
Skills	`SKILL.md` discovery with project/global precedence and markdown-native authoring
Gateway mode	Run as a listener for Telegram or via sidecar-backed channels
Governed self-evolution	Candidate capture, deterministic evaluation, canary promotion, rollback, and automation journal

Quick Start

git clone https://github.com/cklxx/eli.git
cd eli
cargo build --release
cp env.example .env

Then add your provider credentials and run:

eli chat
eli run "summarize this repo"
eli gateway

Useful project tasks:

just doctor
just check
just test-rust
just test-all

Self-Evolution

Eli now includes a governed self-evolution loop.

The model cannot silently rewrite its own core prompt. Experience is distilled from tape evidence into candidates, then pushed through a controlled lifecycle:

distill -> evaluate -> canary -> observe -> promote / rollback

Candidates share one lifecycle across four unified artifact kinds:

Kind	Materialized to
`prompt_rule`	`.agents/evolution/rules/` (bundle: `rules.bundle.md`)
`compiled_knowledge`	`.agents/evolution/knowledge/` (bundle: `knowledge.bundle.md`)
`runtime_policy`	`.agents/evolution/runtime-policies/` (bundle: `runtime_policy.bundle.json`)
`skill`	`.agents/skills/<name>/SKILL.md`

What is automated:

Tape evidence can be distilled into candidates of any artifact kind.
Candidates are deterministically evaluated before promotion.
Low-risk candidates can be auto-staged as canaries.
Repeated observations can auto-promote a canary.
Every action is written to an automation journal.

What stays governed:

Core persona prompt is not live-edited.
Every artifact is materialized as a managed fragment, not an opaque prompt mutation.
Rollback stays local to the promoted fragment.

Example commands:

eli evolution distill <tape> --persist
eli evolution evaluate <candidate_id>
eli evolution auto-run <tape>
eli evolution capture-knowledge <artifact> --summary "…" --content "…"
eli evolution capture-runtime-policy <artifact> --summary "…" --content '{…}'
eli evolution history --limit 20
eli evolution list

CLI Surface

Command	What it does
`eli chat`	Interactive REPL
`eli run "prompt"`	One-shot turn
`eli gateway`	Start channel listeners
`eli login`	Authenticate a provider
`eli use`	Switch profile
`eli model`	Show or switch model
`eli status`	Config and auth overview
`eli tape`	Tape viewer web UI
`eli decisions`	Persistent decision management
`eli evolution`	Distill, evaluate, auto-run, inspect history, promote, rollback

Gateway & Channels

Native:

ELI_TELEGRAM_TOKEN=xxx eli gateway

Sidecar-backed via OpenClaw:

Feishu
WeChat
Slack
Discord
DingTalk

The sidecar is used for channels and plugin-backed integrations. The core Eli runtime still stays a single Rust binary.

Architecture

Interactive diagram · Video walkthrough

Workspace layout:

Component	Role
`crates/nexil`	Provider-agnostic LLM toolkit: transport, streaming, tools, tape, OAuth
`crates/eli`	Agent runtime: hooks, channels, tools, skills, prompt builder, evolution
`sidecar/`	OpenClaw bridge for plugin-backed channels and MCP-style integrations

Turn pipeline:

resolve_session
-> load_state
-> build_prompt
-> run_model
-> save_state
-> render_outbound
-> dispatch_outbound

The design goal is explicit interception points with minimal hidden state.

Skills

Skills are markdown files with YAML frontmatter.

Discovery order:

.agents/skills/<name>/SKILL.md
~/.eli/skills/<name>/SKILL.md
Builtin or synthesized skill sources

That gives you local project override without inventing a new packaging format.

Configuration

Variable	Default	Description
`ELI_MODEL`	`openai:gpt-4o-mini`	`provider:model` identifier
`ELI_API_KEY`	—	Provider API key
`ELI_API_BASE`	—	Custom endpoint
`ELI_MAX_STEPS`	`50`	Max tool iterations per turn
`ELI_TELEGRAM_TOKEN`	—	Telegram bot token
`ELI_WEBHOOK_PORT`	`3100`	Webhook port
`ELI_HOME`	`~/.eli`	Config, tape, and runtime data directory
`ELI_TRACE`	—	Trace logging
`ELI_EVOLUTION_DISABLED`	—	Disable background auto-evolution loop when set to `1`/`true`

Profiles live in ~/.eli/config.toml.

Example profiles:

active_profile = "deepseek"

[profiles.deepseek]
provider = "deepseek"
model = "deepseek:deepseek-v4-pro"
api_base = "https://api.deepseek.com/beta"

[profiles.openai]
provider = "openai"
model = "openai:gpt-5.5"

OpenAI-family profiles, including openai:gpt-5.5, keep Eli's tape session_id local. The public OpenAI API rejects a top-level session_id, while local Eli sessions and provider-specific adapters still preserve the runtime session context.

Model A/B Benchmark

Run the live hard-tail comparison suite through Eli:

scripts/run_model_comparison_suite.py --keep-eli-home

The default pair is DSv4 (deepseek:deepseek-v4-pro at https://api.deepseek.com/beta) versus GPT-5.5 (openai:gpt-5.5). The suite uses the same Eli CLI path for both models and keeps only hard-tail cases that stress planning, programming, writing, analysis, exploration, frontier-science reasoning, Mac-only Metal-vs-MLX performance work, multi-turn memory, tool evidence replay, tape handoff, and subagent orchestration. The current cases avoid saturated keyword prompts; they use fixture evidence, numeric gates, strict timing constraints, stale subagent conflicts, and long multi-turn correction/redaction pressure. Results are written to tests/snapshots/model_comparison_latest.json.

The runner includes a benchmark-local output compatibility layer for common model issues such as fenced JSON, decimal percentage spellings, stdout truncation, empty/truncated structured output diagnostics, and one bounded same-session JSON repair turn. Disable it with ELI_AB_COMPAT=0 or --no-output-compat when auditing whether those compatibility fixes can be removed. The reported total score uses the raw first answer; repair is recorded separately under compat_total so format fixes cannot inflate the ability score. The latest checked snapshot scored DSv4 at 72/100 and GPT-5.5 at 81/100; repair telemetry was 74/100 and 83/100, respectively.

Metal performance cases are macOS/Darwin-only. They require a hand-written Metal path to beat the MLX baseline before merge; Linux or CUDA validation is not part of this benchmark.

Development

Repository layout:

Rust workspace in crates/
Python integration tests in tests/
TypeScript sidecar in sidecar/
Project docs in docs/

Common workflows:

just doctor
just check
just test-rust
just test-py
just test-sidecar
just test-all

Docs entrypoint: docs/index.md

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 338 Commits
.agents		.agents
.aiden		.aiden
.claude		.claude
.context		.context
.github/workflows		.github/workflows
.openmax/reports		.openmax/reports
assets		assets
crates		crates
docs		docs
scripts		scripts
sidecar		sidecar
site		site
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
TODOS.md		TODOS.md
demo.tape		demo.tape
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
env.example		env.example
justfile		justfile
rust-toolchain.toml		rust-toolchain.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Eli

Why Eli

What You Get

Quick Start

Self-Evolution

CLI Surface

Gateway & Channels

Architecture

Skills

Configuration

Model A/B Benchmark

Development

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Eli

Why Eli

What You Get

Quick Start

Self-Evolution

CLI Surface

Gateway & Channels

Architecture

Skills

Configuration

Model A/B Benchmark

Development

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages