Record and replay AI agent execution for debugging
Config is the same across clients — only the file and path differ.
{
"mcpServers": {
"io-github-mdfifty50-boop-agent-replay": {
"command": "<see-readme>",
"args": []
}
}
}Are you the author?
Add this badge to your README to show your security score and help users find safe servers.
Record and replay AI agent execution for debugging
No automated test available for this server. Check the GitHub README for setup instructions.
Five weighted categories — click any category to see the underlying evidence.
No known CVEs.
No package registry to scan.
This server is missing a description. Tools and install config are also missing.If you've used it, help the community.
Add informationBe the first to review
Have you used this server?
Share your experience — it helps other developers decide.
Sign in to write a review.
Others in entertainment / ai-ml
A Model Context Protocol (MCP) server and CLI that provides tools for agent use when working on iOS and macOS projects.
Dynamic problem-solving through sequential thought chains
A Model Context Protocol server for searching and analyzing arXiv papers
The official Python SDK for Model Context Protocol servers and clients
MCP Security Weekly
Get CVE alerts and security updates for io.github.mdfifty50-boop/agent-replay and similar servers.
Start a conversation
Ask a question, share a tip, or report an issue.
Sign in to join the discussion.
MCP server for agent session recording and replay — debug non-deterministic agent behavior with session comparison and divergence detection.
Record every action an agent takes, replay sessions step by step, diff two runs to find behavioral regressions, and pinpoint exactly where an agent diverged from expected output.
npx agent-replay-mcp
Add to claude_desktop_config.json:
{
"mcpServers": {
"agent-replay": {
"command": "npx",
"args": ["agent-replay-mcp"]
}
}
}
git clone https://github.com/mdfifty50-boop/agent-replay-mcp.git
cd agent-replay-mcp
npm install
node src/index.js
Start recording all actions for an agent session.
| Param | Type | Default | Description |
|---|---|---|---|
agent_id | string | required | Unique agent identifier |
metadata | object | {} | Optional metadata (task, model, environment) |
Returns a session_id for use with other tools.
Stop recording and return a session summary.
| Param | Type | Description |
|---|---|---|
session_id | string | Session ID from record_session |
Returns: action count, total duration, action type breakdown.
Log a single action during a recording session.
| Param | Type | Default | Description |
|---|---|---|---|
session_id | string | required | Active session ID |
action_type | string | required | Type (tool_call, llm_response, decision, error) |
input | any | required | Input to the action |
output | any | required | Output from the action |
reasoning | string | "" | Agent reasoning for this step |
duration_ms | number | 0 | Action duration in milliseconds |
Replay a recorded session step by step with full action detail.
| Param | Type | Description |
|---|---|---|
session_id | string | Session ID to replay |
Returns: complete action sequence with timing, reasoning, inputs, and outputs.
Behavioral diff between two sessions. Aligns actions by step index and highlights differences.
| Param | Type | Description |
|---|---|---|
session_id_1 | string | First session |
session_id_2 | string | Second session |
Returns: similarity ratio, identical/divergent step counts, first divergence step, and per-step diffs.
Find where an agent first deviated from expected output.
| Param | Type | Description |
|---|---|---|
session_id | string | Session to analyze |
expected_output | any | Expected final output, or array of per-step expected outputs |
If expected_output is an array, compares step by step. If a single value, finds the last matching output and flags the next step as the divergence point.
Export a session for sharing and offline analysis.
| Param | Type | Default | Description |
|---|---|---|---|
session_id | string | required | Session to export |
format | string | "json" | "json" or "markdown" |
Markdown format produces a readable transcript with step headers, reasoning, and code blocks.
| URI | Description |
|---|---|
agent-replay://sessions | All recorded sessions with status and action counts |
1. record_session — start recording at agent launch
2. For each agent action:
- log_action — capture input, output, reasoning, timing
3. stop_recording — finalize the session
4. Debug:
- replay_session — review what happened step by step
- compare_sessions — diff today's run vs yesterday's
- find_divergence_point — pinpoint where it went wrong
5. Share:
- export_session — JSON for tooling, markdown for humans
npm test
MIT