Is Vision.Mcp safe to use?

Vision.Mcp has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments. Licensed under MIT.

How do I install Vision.Mcp?

Vision.Mcp can be installed by cloning its GitHub repository and following the setup instructions in the README.

What can Vision.Mcp do?

Vision.Mcp provides 2 tools: ingest_pdf, ingest_image. See the full tools list on the server page for descriptions and parameters.

What AI clients work with Vision.Mcp?

Vision.Mcp is compatible with claude-desktop, cursor, claude-code. It uses stdio transport.

Is Vision.Mcp actively maintained?

Vision.Mcp is recently maintained — last commit was 36 days ago. It has 4 GitHub stars.

Vision.Mcp

Name: Vision.Mcp
Author: br3akzero

by br3akzero

A standalone MCP server that provides on-device Vision Framework access for PDF and image text extraction.

4 2 tools GitHub

No known CVEs

MIT license

Maintained

Last commit 36d ago

Works with most clients

Transport: stdio

2 tools · ~182 tok

Grade A · 0.1% of 200K ctx

Edit this pageView history

Other

Step 1

Install in your client

Config is the same across clients — only the file and path differ.

Supported in Claude Desktopstdio · Node 18+

Paste into ~/Library/Application Support/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "vision-mcp": {
      "command": "<see-readme>",
      "args": []
    }
  }
}

Are you the author?

Add this badge to your README to show your security score and help users find safe servers.

Embed in your READMEAbout badges →

[![MCPpedia Score](https://mcppedia.org/api/badge/vision-mcp)](https://mcppedia.org/s/vision-mcp)

Read me

What Vision.Mcp does

A standalone MCP server that provides on-device Vision Framework access for PDF and image text extraction. Uses Apple's Vision OCR exclusively -- no cloud services, no API keys, no data leaves your machine.

Test This Server

No automated test available for this server. Check the GitHub README for setup instructions.

Loading README…

Scored, not listed

Why this score

Five weighted categories — click any category to see the underlying evidence.

Score breakdown

69/100across 5 weighted dimensions

How we score →

0255075100

−31

Security

Maintenance

Efficiency

Documentation

Compatibility

Categoriesclick a row to see evidence

Security

OSV.dev

No known CVEs.

No package registry to scan.

Inventory

Tools (2)

Click any tool to inspect its schema.

~182 tokens total

Community

Reviews

Be the first to review

Have you used this server?

Share your experience — it helps other developers decide.

How easy was setup?Did it work reliably?How was the documentation?

Frequently Asked Questions

Is Vision.Mcp safe to use?: Vision.Mcp has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments. Licensed under MIT.
How do I install Vision.Mcp?: Vision.Mcp can be installed by cloning its GitHub repository and following the setup instructions in the README.
What can Vision.Mcp do?: Vision.Mcp provides 2 tools: ingest_pdf, ingest_image. See the full tools list on the server page for descriptions and parameters.
What AI clients work with Vision.Mcp?: Vision.Mcp is compatible with claude-desktop, cursor, claude-code. It uses stdio transport.
Is Vision.Mcp actively maintained?: Vision.Mcp is recently maintained — last commit was 36 days ago. It has 4 GitHub stars.

Similar servers

Others in other

View all →

Pi Lean Ctx95

Pi Coding Agent extension (CLI-first) — routes bash/read/grep/find/ls through lean-ctx CLI for strong token savings. Optional MCP bridge can register advanced tools.

3.0k 6

io.github.asklokesh/loki-mode95

Autonomous spec-to-product coding-agent CLI with an MCP server exposing 34 tools over stdio.

992 8

Sigmap94

97% token reduction for AI coding sessions — zero deps, 21 languages, MCP server

531 6

Sunpeak93

App framework, testing framework, and inspector for MCP Apps.

193 1

MCP Security Weekly

Get CVE alerts and security updates for Vision.Mcp and similar servers.

Community

Discussion

Start a conversation

Ask a question, share a tip, or report an issue.

Has anyone used this with Cursor?How do you handle auth?Any alternatives?

Edit this pageView history

Other

Step 1

Install in your client

Config is the same across clients — only the file and path differ.

Supported in Claude Desktopstdio · Node 18+

Paste into ~/Library/Application Support/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "vision-mcp": {
      "command": "<see-readme>",
      "args": []
    }
  }
}

Are you the author?

Add this badge to your README to show your security score and help users find safe servers.

Embed in your READMEAbout badges →

[![MCPpedia Score](https://mcppedia.org/api/badge/vision-mcp)](https://mcppedia.org/s/vision-mcp)

Read me

What Vision.Mcp does

Test This Server

No automated test available for this server. Check the GitHub README for setup instructions.

README

VisionMCP

Built with Swift 6.3, macOS 26, and the MCP Swift SDK.

How it works

Two independent parsers, each producing structured PageExtraction results:

PDF ingestion -- renders PDF pages to images via PDFKit, then runs RecognizeDocumentsRequest (macOS 26 Vision API) for structured document OCR. Extracts text, tables, lists, and paragraphs.
Image ingestion -- loads images via CGImageSource, then runs VNRecognizeTextRequest for text OCR. Supports PNG, JPEG, TIFF, BMP, GIF, HEIC, and WebP.

Both paths produce extracted text, confidence scores, and automatic text chunking with configurable overlap. The server is read-only -- it extracts and returns data with no persistence or database.

Requirements

macOS 26 (Tahoe) or later
Xcode 26 beta or later
Swift 6.3 or later

Build

git clone https://codeberg.org/<your-user>/VisionMCP.git
cd VisionMCP
swift build -c release

The release binary is at .build/release/VisionMCP.

Install

sudo ln -sf $(pwd)/.build/release/VisionMCP /usr/local/bin/visionmcp

Verify:

visionmcp --version

MCP Configuration

opencode

Add to your project's opencode.json:

{
  "mcp": {
    "visionmcp": {
      "type": "local",
      "command": ["/usr/local/bin/visionmcp"],
      "enabled": true
    }
  }
}

Or add to your global ~/.config/opencode/opencode.json to make it available across all projects.

Tools

`ingest_pdf`

Extracts text from a PDF document using Vision OCR. Returns extracted text, chunks, and metadata.

Parameters:

Name	Type	Required	Description
`file_path`	string	yes	Absolute path to the PDF file

Returns:

raw_text -- full extracted text
chunks -- text split into token-limited chunks with overlap
pages -- per-page extraction with text, confidence, tables, lists, paragraphs
file_hash -- SHA-256 hash of the file
page_count, chunk_count, status

`ingest_image`

Extracts text from an image file using Vision OCR. Returns extracted text and metadata.

Parameters:

Name	Type	Required	Description
`file_path`	string	yes	Absolute path to the image file

Supports: PNG, JPEG, TIFF, BMP, GIF, HEIC, WebP. Max file size: 250 MB.

Returns: Same structure as ingest_pdf.

Example response

{
  "file_name": "invoice-001.jpeg",
  "page_count": 1,
  "chunk_count": 2,
  "file_hash": "a258e31c...",
  "raw_text": "Invoice text here...",
  "chunks": "[{\"chunk_index\":0,\"content\":\"...\",\"token_count\":558}]",
  "pages": "[{\"page_number\":1,\"text\":\"...\",\"confidence\":0.97}]",
  "status": "extracted"
}

Architecture

VisionMCP
├── PDFParser              # Renders pages, runs RecognizeDocumentsRequest
├── PDFDocumentActor       # Thread-safe PDFDocument wrapper (Sendable)
├── ImageParser            # Loads images, runs VNRecognizeTextRequest
├── TextChunker            # Splits text into overlapping token-limited chunks
├── IngestService          # Orchestrates parsing + chunking
├── IngestTools            # MCP tool definitions + handlers
├── ToolRegistry           # Wires MCP server to tools
└── main.swift             # Entry point, stdio transport

No shared protocol, no factory, no reconciliation. Each tool routes directly to its parser.

Development

Build

swift build

Test

swift test

Tests use Swift Testing (import Testing, @Test, #expect).

Run locally

swift run VisionMCP

... View full README on GitHub

Loading README…

Scored, not listed

Why this score

Five weighted categories — click any category to see the underlying evidence.

Score breakdown

69/100across 5 weighted dimensions

How we score →

0255075100

−31

Security

Maintenance

Efficiency

Documentation

Compatibility

Categoriesclick a row to see evidence

Security

OSV.dev

No known CVEs.

No package registry to scan.

Inventory

Tools (2)

Click any tool to inspect its schema.

~182 tokens total

Community

Reviews

Be the first to review

Have you used this server?

Share your experience — it helps other developers decide.

How easy was setup?Did it work reliably?How was the documentation?

Frequently Asked Questions

Is Vision.Mcp safe to use?: Vision.Mcp has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments. Licensed under MIT.
How do I install Vision.Mcp?: Vision.Mcp can be installed by cloning its GitHub repository and following the setup instructions in the README.
What can Vision.Mcp do?: Vision.Mcp provides 2 tools: ingest_pdf, ingest_image. See the full tools list on the server page for descriptions and parameters.
What AI clients work with Vision.Mcp?: Vision.Mcp is compatible with claude-desktop, cursor, claude-code. It uses stdio transport.
Is Vision.Mcp actively maintained?: Vision.Mcp is recently maintained — last commit was 36 days ago. It has 4 GitHub stars.

Similar servers

Others in other

View all →

Pi Lean Ctx95

Pi Coding Agent extension (CLI-first) — routes bash/read/grep/find/ls through lean-ctx CLI for strong token savings. Optional MCP bridge can register advanced tools.

3.0k 6

io.github.asklokesh/loki-mode95

Autonomous spec-to-product coding-agent CLI with an MCP server exposing 34 tools over stdio.

992 8

Sigmap94

97% token reduction for AI coding sessions — zero deps, 21 languages, MCP server

531 6

Sunpeak93

App framework, testing framework, and inspector for MCP Apps.

193 1

MCP Security Weekly

Get CVE alerts and security updates for Vision.Mcp and similar servers.