agent-debugging

Here are 28 public repositories matching this topic...

liaohch3 / claude-tap

Intercept and inspect Coding Agent API traffic from Claude Code, Codex CLI, Gemini CLI, Cursor CLI, OpenCode, Kimi, Pi, and Hermes in a local trace viewer.

Updated May 17, 2026
Python

najeed / ai-agent-eval-harness

Star

The open-source MultiAgentOps evaluation and verification harness for any industry business workflow.

Updated May 15, 2026
Python

OthmanAdi / langsmith-fetch-skill

Sponsor

Star

🔍 AI observability skill for Claude Code. Debug LangChain/LangGraph agents by fetching execution traces from LangSmith Studio directly in your terminal.

developer-tools observability ai-agents langchain langsmith llm-ops langsmith-tracing developer-tools-ai-agent claude-skills claude-skills-creator claude-skills-hub claude-skills-libary agent-debugging

Updated Apr 6, 2026

cylestio / agent-inspector

Star

Local open-source dev tool to debug, secure, and evaluate LLM agents. Provides static analysis, dynamic security checks, and runtime monitoring - integrates with Cursor and Claude Code.

behavior-analysis agent-trace ai-security-tool agent-security cursor-integration claude-code-plugin agent-debugging

Updated Jan 15, 2026
Python

Ylsssq926 / clawclip

Star

Cut your OpenClaw / ZeroClaw token bill. Find which model earns its cost. Prove whether optimizations actually work. Local, no upload.

hermes ai-agent ai-observability cost-reduction local-ai agent-tools llm-cost token-optimization agent-debugging openclaw zeroclaw hermes-agent agent-analytics prompt-efficiency

Updated May 8, 2026
TypeScript

aaronlab / browsertrace

Star

Local replay debugger for Browser Use failures with screenshots, model I/O, failed-step timelines, and public-safe HTML exports.

Updated May 14, 2026
Python

converra / agent-triage

Star

Diagnose your AI agents in production. Extract policies from prompts, evaluate traces, generate diagnostic reports.

Updated Mar 10, 2026
TypeScript

amitmishrg / agenticlens

Star

Visual debugging, tracing, and replay for agent workflows.

nodejs ai reactjs devtools tracing developer-tools visualizations observability debugging-tools ai-agents log-visualization jsonl ai-observability llm agentic-ai agent-workflows workflow-visualization agent-debugging execution-tracing

Updated Mar 27, 2026
JavaScript

Exploreunive / agentlens

Star

Explain why your agent failed — root-cause debugging, memory attribution, and run divergence for LLM agents.

python memory tracing developer-tools observability ai-agents llm agent-debugging

Updated Mar 31, 2026
Python

kangjinghang / agent-chatlens

Star

🔍 A beautiful web viewer for AI agent session files. Browse Claude Code & OpenClaw conversations with chat-style UI, timeline visualization, and zero setup.

react visualization typescript developer-tools dark-mode chat-ui claude conversation-analysis jsonl vite ai-agent session-viewer claude-code agent-debugging openclaw jsonl-viewer tool-call-visualization

Updated Apr 13, 2026
TypeScript

joshualamerton / AgentLens

Star

A real-time observability and debugging layer for AI agents.

python machine-learning ai machine-learning-algorithms devtools agents ai-agents machine-learning-projects llms ai-devtools agent-debugging

Updated Mar 11, 2026
Python

ChainWatch is a flight data recorder for multi-step AI systems. It's a CLI-based tool that records every step in an AI decision chain, links them together in order, prevents tampering, and allows you to verify the chain's integrity and replay the full decision flow.

ai artificial-intelligence audit-log autonomous-agents ai-agents ai-engineering ai-observability llm llmops ai-tracing agent-observability ai-audit agent-debugging tool-using-agents decision-tracing

Updated Jan 22, 2026
Python

David-Wu1119 / agentreplay

Star

Local recorder and replay verifier for AI-agent command runs.

developer-tools replay observability ai-agent llm-security agent-debugging

Updated May 10, 2026
TypeScript

aevyraai / origin

Star

Failure attribution for agent pipelines — find which span caused the failure and what kind of fix it needs.

python debugging open-source agents observability root-cause-analysis llm prompt-optimization agent-debugging failure-attribution

Updated May 3, 2026
Python

mda-diaz / runlens

Star

RunLens helps teams compare and debug AI agent runs with step timelines, run diffs, and cost analysis.

python ai-agents fastapi observability-analyze llmops agent-debugging

Updated Apr 1, 2026
HTML

Zijian-Ni / agent-replay

Star

🔄 Record, replay, and debug AI agent execution traces — the DevTools for AI agents

debugger devtools trace openai replay ai-agent llm anthropic agent-debugging

Updated Mar 27, 2026
TypeScript

rty90 / Android-Agent-Reliability-Runtime

Star

Android Agent Reliability Runtime A debugging and safety runtime for mobile GUI agents: detect readiness, block unsafe actions, verify progress, diagnose failures, and save reproducible traces.

adb android-automation llm-agent mobile-agent gui-agent agent-observability android-agent agent-runtime agent-debugging ui-automation-testing

Updated May 8, 2026
Python

ptaramona / drill-sergeant-skill

Star

Enforce communication discipline & execution hygiene for agent teams. Detect loops, route violations, stale work, and missing ownership.

message-filtering multi-agent-systems workflow-automation autonomous-agents ai-agents team-communication agent-coordination agent-orchestration agent-governance agent-debugging agent-supervision execution-monitoring

Updated Mar 11, 2026

neaagora / shepdog

Star

Verify that AI agents actually executed API/tool calls they claim.

mcp observability ai-agents llm-evaluation hallucination-detection agent-testing agent-debugging service-record agent-failures shepdog

Updated Apr 6, 2026
Python

mungdungus / agent-rescue-room

Star

LangGraph pipeline that diagnoses failed agent runs: classifies failures, grounds them in source code, pauses at a human approval gate, generates regression tests, writes the customer memo, and learns reusable patterns for future diagnoses.

python gemini langchain langgraph llm-observability agent-debugging

Updated Apr 17, 2026
TypeScript

Improve this page

Add a description, image, and links to the agent-debugging topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the agent-debugging topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

agent-debugging

Here are 28 public repositories matching this topic...

liaohch3 / claude-tap

najeed / ai-agent-eval-harness

OthmanAdi / langsmith-fetch-skill

cylestio / agent-inspector

Ylsssq926 / clawclip

aaronlab / browsertrace

converra / agent-triage

amitmishrg / agenticlens

Exploreunive / agentlens

kangjinghang / agent-chatlens

joshualamerton / AgentLens

Tarunjit45 / ChainWatch

David-Wu1119 / agentreplay

aevyraai / origin

mda-diaz / runlens

Zijian-Ni / agent-replay

rty90 / Android-Agent-Reliability-Runtime

ptaramona / drill-sergeant-skill

neaagora / shepdog

mungdungus / agent-rescue-room

Improve this page

Add this topic to your repo