Read and analyze local documents (PDF, Excel, CSV, Word, PowerPoint, images). No API keys needed.
Config is the same across clients — only the file and path differ.
{
"mcpServers": {
"io-github-lunarperovskite-docalyze": {
"args": [
"-y",
"docalyze-mcp-server"
],
"command": "npx"
}
}
}Are you the author?
Add this badge to your README to show your security score and help users find safe servers.
An MCP (Model Context Protocol) server that lets AI assistants read and visually analyze local documents — PDFs, Excel spreadsheets, CSV files, Word documents, PowerPoint presentations, and images.
Run this in your terminal to verify the server starts. Then let us know if it worked — your result helps other developers.
npx -y 'docalyze-mcp-server' 2>&1 | head -1 && echo "✓ Server started successfully"
After testing, let us know if it worked:
Five weighted categories — click any category to see the underlying evidence.
No known CVEs.
Checked docalyze-mcp-server against OSV.dev.
Click any tool to inspect its schema.
Be the first to review
Have you used this server?
Share your experience — it helps other developers decide.
Sign in to write a review.
Others in data / writing
Query and manage PostgreSQL databases directly from AI assistants
An official Qdrant Model Context Protocol (MCP) server implementation
🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.
Manage Supabase projects — databases, auth, storage, and edge functions
MCP Security Weekly
Get CVE alerts and security updates for io.github.LunarPerovskite/docalyze and similar servers.
Start a conversation
Ask a question, share a tip, or report an issue.
Sign in to join the discussion.
An MCP (Model Context Protocol) server that lets AI assistants read and visually analyze local documents — PDFs, Excel spreadsheets, CSV files, Word documents, PowerPoint presentations, and images.
No API keys required. The host AI (GitHub Copilot, Claude, etc.) does all the reasoning directly.
| Format | Extensions | Read | Visual |
|---|---|---|---|
.pdf | ✅ | ✅ | |
| Excel | .xlsx, .xls | ✅ | ✅ |
| CSV / TSV | .csv, .tsv | ✅ | — |
| JSON | .json | ✅ | — |
| Word | .docx | ✅ | ✅ |
| PowerPoint | .pptx | ✅ | ✅ |
| Plain text | .txt, .md | ✅ | — |
| Images | .png, .jpg, .jpeg, .gif, .bmp, .tiff, .webp | — | ✅ |
| Tool | Description |
|---|---|
list_documents | List files under a directory, filtered by glob pattern |
document_info | Get metadata (size, modified date, sheets) for a file |
read_document | Extract text content from a document with pagination |
visual_evaluate_document | Return page images inline so the AI can analyze charts, tables, and diagrams |
Search for docalyze in the MCP server gallery (Extensions sidebar → MCP tab) and click Install.
pip install docalyze-mcp-server
npx docalyze-mcp-server
This requires uv or pipx installed — the npm wrapper calls uvx to run the Python package automatically.
Add to your VS Code mcp.json (or settings.json):
{
"servers": {
"docalyze": {
"type": "stdio",
"command": "python",
"args": ["-m", "docalyze_mcp_server"],
"env": {
"PYTHONIOENCODING": "utf-8"
}
}
}
}
Or, if you installed via pip and want to use the entry point:
{
"servers": {
"docalyze": {
"type": "stdio",
"command": "docalyze-mcp-server"
}
}
}
The base install handles PDF, Excel, CSV, JSON, and plain text. For additional formats:
# Word documents
pip install docalyze-mcp-server[docx]
# PowerPoint
pip install docalyze-mcp-server[pptx]
# OCR (requires Tesseract installed on your system)
pip install docalyze-mcp-server[ocr]
# Everything
pip install docalyze-mcp-server[all]
The server reads documents from a configurable root directory. Set the DOCUMENTS_ROOT environment variable to change it:
{
"servers": {
"docalyze": {
"type": "stdio",
"command": "docalyze-mcp-server",
"env": {
"DOCUMENTS_ROOT": "/path/to/your/documents"
}
}
}
}
If not set, it defaults to the directory containing the server script.
MIT