Is io.github.dbsectrainer/mcp-eval-runner safe to use?

io.github.dbsectrainer/mcp-eval-runner has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments.

How do I install io.github.dbsectrainer/mcp-eval-runner?

io.github.dbsectrainer/mcp-eval-runner supports copy-paste install configs on its MCPpedia page for Claude Desktop, Cursor, and Claude Code. Scroll to the Quick Install section and select your client.

What AI clients work with io.github.dbsectrainer/mcp-eval-runner?

io.github.dbsectrainer/mcp-eval-runner is compatible with Claude Desktop, Cursor, Claude Code, and most MCP clients that support stdio transport. It uses stdio and sse and http transport.

Is io.github.dbsectrainer/mcp-eval-runner actively maintained?

io.github.dbsectrainer/mcp-eval-runner is actively maintained — last commit was 3 days ago.

MCP Eval Runner

npm mcp-eval-runner package

A standardized testing harness for MCP servers and agent workflows. Define test cases as YAML fixtures (steps → expected tool calls → expected outputs), run regression suites directly from your MCP client, and get pass/fail results with diffs — without leaving Claude Code or Cursor.

Key features

YAML fixtures: Test cases are plain files in version control — diffable, reviewable, and shareable.
Two execution modes: Live mode spawns a real MCP server and calls tools via stdio; simulation mode runs assertions against expected_output without a server.
Composable assertions: Combine output_contains, output_not_contains, output_equals, output_matches, schema_match, tool_called, and latency_under per step.
Step output piping: Reference a previous step's output in downstream inputs via {{steps.<step_id>.output}}.
Regression reports: Compare the current run to any past run and surface what changed.
Watch mode: Automatically reruns the affected fixture when files change.
CI-ready: Includes a GitHub Action for running evals on every config change.

Requirements

Node.js v22.5.0 or newer.
npm.

Getting started

Add the following config to your MCP client:

{
  "mcpServers": {
    "eval-runner": {
      "command": "npx",
      "args": ["-y", "mcp-eval-runner@latest"]
    }
  }
}

By default, eval fixtures are loaded from ./evals/ in the current working directory. To use a different path:

{
  "mcpServers": {
    "eval-runner": {
      "command": "npx",
      "args": ["-y", "mcp-eval-runner@latest", "--fixtures=~/my-project/evals"]
    }
  }
}

MCP Client configuration

Amp · Claude Code · Cline · Cursor · VS Code · Windsurf · Zed

Your first prompt

Create a file at evals/smoke.yaml. Use live mode (recommended) by including a server block:

name: smoke
description: "Verify eval runner itself is working"
server:
  command: node
  args: ["dist/index.js"]
steps:
  - id: list_check
    description: "List available test cases"
    tool: list_cases
    input: {}
    expect:
      output_contains: "smoke"

Then enter the following in your MCP client:

Run the eval suite.

Your client should return a pass/fail result for the smoke test.

Fixture format

Fixtures are YAML (or JSON) files placed in the fixtures directory. Each file defines one test case.

Top-level fields

| Field | Required | Description | | ------------- | -------- | ----------------------------------------------------------------------------------------- | | name | Yes | Unique name for the test case | | description | No | Human-readable description | | server | No | Server config — if present, runs in live mode; if absent, runs in simulation mode | | steps | Yes | Array of steps to execute |

`server` block (live mode)

server:
  command: node # executable to spawn
  args: ["dist/index.js"] # arguments
  env: # optional environment variables
    MY_VAR: "value"

When server is present the eval runner spawns the server as a child process, connects via MCP stdio transport, and calls each step's tool against the live server.

`steps` array

Each step has the following fields:

| Field | Required | Description | | ----------------- | -------- | ----------------------------------------

... View full README on GitHub

io.github.dbsectrainer/mcp-eval-runner

Quick Install

Should you use this server?

Test This Server

Score Breakdown

MCPpedia Score

Security

Help improve this page

Reviews

Frequently Asked Questions

Similar servers

Memory MCP Server

Context Mode

Idea Reality MCP Server

Browser Tools Mcp

Discussion

MCP Eval Runner

Key features

Requirements

Getting started

MCP Client configuration

Your first prompt

Fixture format

Top-level fields

`server` block (live mode)

`steps` array

io.github.dbsectrainer/mcp-eval-runner

Quick Install

Should you use this server?

Test This Server

Score Breakdown

MCPpedia Score

Security

Help improve this page

Reviews

Frequently Asked Questions

Similar servers

Memory MCP Server

Context Mode

Idea Reality MCP Server

Browser Tools Mcp

Discussion

MCP Eval Runner

Key features

Requirements

Getting started

MCP Client configuration

Your first prompt

Fixture format

Top-level fields

server block (live mode)

steps array

`server` block (live mode)

`steps` array