This project provides a toolset to crawl websites wikis, tool/library documentions and generate Markdown documentation, and make that documentation searchable via a Model Context Protocol (MCP) server, designed for integration with tools like Cursor.
Config is the same across clients — only the file and path differ.
{
"mcpServers": {
"doc-query-server": {
"env": {},
"args": [
"--directory",
"/path/to/your/MCPDocSearch",
"run",
"python",
"-m",
"mcp_server.main"
],
"command": "uv"
}
}
}Are you the author?
Add this badge to your README to show your security score and help users find safe servers.
This project provides a toolset to crawl websites, generate Markdown documentation, and make that documentation searchable via a Model Context Protocol (MCP) server, designed for integration with tools like Cursor.
No automated test available for this server. Check the GitHub README for setup instructions.
Five weighted categories — click any category to see the underlying evidence.
No known CVEs.
No package registry to scan.
Click any tool to inspect its schema.
Be the first to review
Have you used this server?
Share your experience — it helps other developers decide.
Sign in to write a review.
Others in browser / developer-tools
Browser automation with Puppeteer for web scraping and testing
A Model Context Protocol (MCP) server and CLI that provides tools for agent use when working on iOS and macOS projects.
XcodeBuildMCP provides tools for Xcode project management, simulator management, and app utilities.
Asynchronous coordination layer for AI coding agents: identities, inboxes, searchable threads, and advisory file leases over FastMCP + Git + SQLite
MCP Security Weekly
Get CVE alerts and security updates for MCPDocSearch and similar servers.
Start a conversation
Ask a question, share a tip, or report an issue.
Sign in to join the discussion.
This project provides a toolset to crawl websites, generate Markdown documentation, and make that documentation searchable via a Model Context Protocol (MCP) server, designed for integration with tools like Cursor.
crawler_cli):
crawl4ai../storage/ by default.mcp_server):
./storage/ directory.sentence-transformers (multi-qa-mpnet-base-dot-v1).storage/document_chunks_cache.pkl) to store processed chunks and embeddings.
.md files in ./storage/ haven't changed, the server loads directly from the cache, resulting in much faster startup times..md file in ./storage/ is modified, added, or removed since the cache was last created.fastmcp for clients like Cursor:
list_documents: Lists available crawled documents.get_document_headings: Retrieves the heading structure for a document.search_documentation: Performs semantic search over document chunks using vector similarity.stdio transport for use within Cursor.crawler_cli tool to crawl a website and generate a .md file in ./storage/.mcp_server (typically managed by an MCP client like Cursor)..md files in ./storage/.list_documents, search_documentation, etc.) to query the crawled content.This project uses uv for dependency management and execution.
Install uv: Follow the instructions on the uv website.
Clone the repository:
git clone https://github.com/alizdavoodi/MCPDocSearch.git
cd MCPDocSearch
Install dependencies:
uv sync
This command creates a virtual environment (usually .venv) and installs all dependencies listed in pyproject.toml.
Run the crawler using the crawl.py script or directly via uv run.
Basic Example:
uv run python crawl.py https://docs.example.com
This will crawl https://docs.example.com with default settings and save the output to ./storage/docs.example.com.md.
Example with Options:
uv run python crawl.py https://docs.another.site --output ./storage/custom_name.md --max-depth 2 --keyword "API" --keyword "Reference" --exclude-pattern "*blog*"
View all options:
uv run python crawl.py --help
Key options include:
--output/-o: Specify output file path.--max-depth/-d: Set crawl depth (must be between 1 and 5).--include-pattern/--exclude-pattern: Filter URLs to crawl.--keyword/-k: Keywords for relevance scoring during crawl.--remove-links/--keep-links: Control HTML cleaning.--cache-mode: Control crawl4ai caching (`DEF