Is Mcp Ocr Server safe to use?

Mcp Ocr Server has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments.

How do I install Mcp Ocr Server?

Mcp Ocr Server can be installed by cloning its GitHub repository and following the setup instructions in the README.

What AI clients work with Mcp Ocr Server?

Mcp Ocr Server is compatible with Claude Desktop, Cursor, Claude Code, and most MCP clients that support stdio transport. It uses stdio transport.

Is Mcp Ocr Server actively maintained?

Mcp Ocr Server is less actively maintained — last commit was 128 days ago. It has 1 GitHub stars.

MCP OCR Server

生产级 OCR MCP Server，基于 Tesseract OCR 和 GoCV，提供智能图像预处理和高性能文本识别服务。

特性

核心功能

✅ 多语言支持: 英文、简体中文、繁体中文、日文
✅ 智能预处理: 自动图像质量分析和自适应预处理管道
✅ 高性能: Worker Pool + 资源池 + 结果缓存
✅ MCP 集成: 完整的 Model Context Protocol 支持
✅ 生产就绪: 完善的错误处理、日志记录和配置管理

图像预处理

自动质量分析 (清晰度、对比度、亮度)
灰度化处理
降噪 (Fast Non-Local Means Denoising)
二值化 (Otsu / 自适应阈值)
倾斜校正 (基于霍夫变换)
对比度增强 (CLAHE)
亮度调整

性能优化

Worker Pool 并发处理
Tesseract 客户端池
基于 SHA256 的结果缓存
可配置的资源限制

系统要求

Go 1.21+
Tesseract OCR 4.0+
OpenCV 4.5+
macOS / Linux (Ubuntu, CentOS)

快速开始

1. 安装系统依赖

# macOS
./scripts/install-deps.sh

# 或手动安装
brew install tesseract tesseract-lang opencv

2. 安装 Go 依赖

make deps

3. 配置

编辑 configs/config.yaml:

server:
  name: mcp-ocr-server
  version: 1.0.0

ocr:
  language: eng+chi_sim+chi_tra+jpn
  data_path: /usr/local/share/tessdata
  max_image_size: 10485760  # 10MB
  timeout: 30

preprocessing:
  enabled: true
  auto_mode: true  # 智能预处理

performance:
  worker_pool_size: 4
  cache_enabled: true
  cache_size: 100

4. 构建和运行

# 构建
make build

# 运行
make run

# 或直接运行
./bin/mcp-ocr-server -config configs/config.yaml

MCP Tools

1. ocr_recognize_text

识别图像文件中的文本。

参数:

{
  "image_path": "/path/to/image.png",
  "language": "eng",
  "preprocess": true,
  "auto_mode": true
}

返回:

{
  "text": "识别的文本内容",
  "confidence": 95.5,
  "language": "eng",
  "duration": 1.23
}

2. ocr_recognize_text_base64

识别 Base64 编码的图像。

参数:

{
  "image_base64": "iVBORw0KGgoAAAANSUhEUgA...",
  "language": "chi_sim",
  "preprocess": true,
  "auto_mode": true
}

3. ocr_batch_recognize

批量识别多个图像。

参数:

{
  "image_paths": [
    "/path/to/image1.png",
    "/path/to/image2.jpg"
  ],
  "language": "eng+chi_sim",
  "preprocess": true
}

返回:

{
  "results": [
    {
      "path": "/path/to/image1.png",
      "text": "...",
      "confidence": 95.5
    },
    {
      "path": "/path/to/image2.jpg",
      "text": "...",
      "confidence": 92.3
    }
  ],
  "count": 2
}

4. ocr_get_supported_languages

获取支持的语言列表。

返回:

{
  "languages": ["eng", "chi_sim", "chi_tra", "jpn"]
}

使用示例

Claude Desktop 集成

在 claude_desktop_config.json 中添加:

{
  "mcpServers": {
    "ocr": {
      "command": "/path/to/mcp-ocr-server",
      "args": ["-config", "/path/to/config.yaml"]
    }
  }
}

示例对话

用户: 请识别这张图片中的文本 /path/to/document.png

Claude: 我来使用 OCR 工具识别这张图片...

[调用 ocr_recognize_text]

识别结果:
- 文本: "这是一份重要文档..."
- 置信度: 96.5%
- 语言: 简体中文
- 处理时间: 1.2秒

项目结构

mcp-ocr-server/
├── cmd/
│   └── server/
│       └── main.go              # 服务入口
├── internal/
│   ├── config/
│   │   └── config.go            # 配置管理
│   ├── ocr/
│   │   ├── engine.go            # OCR 引擎接口
│   │   └── tesseract.go         # Tesseract 实现
│   ├── preprocessing/
│   │   ├── analyzer.go          # 图像质量分析
│   │   └── preprocessor.go      # 图像预处理
│   ├── pool/
│   │   └── worker_pool.go       # Worker Pool
│   ├── cache/
│   │   └── cache.go             # 结果缓存
│   ├── tools/
│   │   ├── schemas.go           # MCP Tool Schema
│   │   └── handler.go           # Tool Handler
│   └── server/
│       └── server.go            # MCP Server
├── pkg/
│   ├── errors/
│   │   └── errors.go            # 错误处理
│   └── logger/
│       └── logger.go            # 日志封装
├── configs/
│   └── config.yaml              # 配置文件
├── scripts/
│   └── install-deps.sh          # 依赖安装脚本
├── Makefile                     # 构建管理
├── Dockerfile                   # Docker 支持
└── README.md                    # 项目文档

配置说明

OCR 配置

ocr:
  language: eng+chi_sim           # 语言组合
  data_path: /path/to/tessdata    # tessdata 路径
  page_seg_mode: 3                # 页面分割模式
  max_image_size: 10485760        # 最大图像大小
  timeout: 30                     # 超时时间

预处理配置

preprocessing:
  enabled: true

... [View full README on GitHub](https://github.com/Ricardo-M-L/mcp-ocr-server#readme)

Mcp Ocr Server

Quick Install

Should you use this server?

Test This Server

Score Breakdown

MCPpedia Score

Security

Help improve this page

Reviews

Frequently Asked Questions

Similar servers

Memory MCP Server

Context Mode

Idea Reality MCP Server

Browser Tools Mcp

Discussion

MCP OCR Server

特性

核心功能

图像预处理

性能优化

系统要求

快速开始

1. 安装系统依赖

2. 安装 Go 依赖

3. 配置

4. 构建和运行

MCP Tools

1. ocr_recognize_text

2. ocr_recognize_text_base64

3. ocr_batch_recognize

4. ocr_get_supported_languages

使用示例

Claude Desktop 集成

示例对话

项目结构

配置说明

OCR 配置

预处理配置