MCPCorpus is a comprehensive dataset for analyzing the Model Context Protocol (MCP) ecosystem, containing ~14K MCP servers and 300 MCP clients with 20+ normalized metadata attributes.
Config is the same across clients — only the file and path differ.
{
"mcpServers": {
"mcpcorpus": {
"command": "<see-readme>",
"args": []
}
}
}Are you the author?
Add this badge to your README to show your security score and help users find safe servers.
MCPCorpus is a comprehensive dataset for analyzing the Model Context Protocol (MCP) ecosystem, containing ~14K MCP servers and 300 MCP clients with 20+ normalized metadata attributes.
No automated test available for this server. Check the GitHub README for setup instructions.
Five weighted categories — click any category to see the underlying evidence.
No known CVEs.
No package registry to scan.
Be the first to review
Have you used this server?
Share your experience — it helps other developers decide.
Sign in to write a review.
Others in ai-ml / data
Dynamic problem-solving through sequential thought chains
Query and manage PostgreSQL databases directly from AI assistants
Persistent memory using a knowledge graph
Manage Supabase projects — databases, auth, storage, and edge functions
MCP Security Weekly
Get CVE alerts and security updates for MCPCorpus and similar servers.
Start a conversation
Ask a question, share a tip, or report an issue.
Sign in to join the discussion.
MCPCorpus is a comprehensive dataset for analyzing the Model Context Protocol (MCP) ecosystem, containing ~14K MCP servers and 300 MCP clients with 20+ normalized metadata attributes.
MCPCorpus/
├── Crawler/ # Data collection tools
│ ├── Servers/ # Server data
│ ├── Clients/ # Client data
│ ├── github_info_collector.py # GitHub metadata collector
| ├── data_cleaner.py # Data normalization
│ └── tool_extractor.py # mcp tool extract
└── Website/ # Web search interface
├── server.py # Local web server
└── index.html # Search interface
└── mcpso_servers_cleaned.json
└── mcpso_clients_cleaned.json
cd Website
python server.py
# Open http://localhost:8000
import json
import pandas as pd
# Load datasets
with open('Crawler/Servers/mcpso_servers_cleaned.json', 'r') as f:
servers = json.load(f)
with open('Crawler/Clients/mcpso_clients_cleaned.json', 'r') as f:
clients = json.load(f)
# Convert to DataFrame
servers_df = pd.DataFrame(servers)
clients_df = pd.DataFrame(clients)
# Collect new data
cd Crawler/Servers && python Server_request.py
cd ../Clients && python Client_request.py
# Add GitHub metadata
cd .. && python github_info_collector.py --token YOUR_GITHUB_TOKEN
If you use MCPCorpus in your research, please cite it as:
@misc{lin2025largescaleevolvabledatasetmodel,
title={A Large-Scale Evolvable Dataset for Model Context Protocol Ecosystem and Security Analysis},
author={Zhiwei Lin and Bonan Ruan and Jiahao Liu and Weibo Zhao},
year={2025},
eprint={2506.23474},
archivePrefix={arXiv},
primaryClass={cs.CR},
url={https://arxiv.org/abs/2506.23474},
}