Is io.github.AnnasMazhar/pyspark-mcp safe to use?

io.github.AnnasMazhar/pyspark-mcp has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments.

How do I install io.github.AnnasMazhar/pyspark-mcp?

io.github.AnnasMazhar/pyspark-mcp supports copy-paste install configs on its MCPpedia page for Claude Desktop, Cursor, and Claude Code. Scroll to the Quick Install section and select your client.

What AI clients work with io.github.AnnasMazhar/pyspark-mcp?

io.github.AnnasMazhar/pyspark-mcp is compatible with claude-desktop, cursor, claude-code. It uses stdio transport.

Is io.github.AnnasMazhar/pyspark-mcp actively maintained?

io.github.AnnasMazhar/pyspark-mcp is actively maintained — last commit was 8 days ago.

io.github.AnnasMazhar/pyspark-mcp

SQL to PySpark conversion, AWS Glue job generation, and Spark code optimization.

0 tools GitHub

No known CVEs

No license

Actively maintained

Last commit 8d ago

Works with most clients

Transport: stdio

0 tools

Grade F

Edit this pageView history

Cloud Marketing

Step 1

Install in your client

Config is the same across clients — only the file and path differ.

Supported in Claude Desktopstdio · Node 18+

Paste into ~/Library/Application Support/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "pyspark": {
      "args": [],
      "command": "pyspark-mcp"
    }
  }
}

Are you the author?

Add this badge to your README to show your security score and help users find safe servers.

Embed in your READMEAbout badges →

[![MCPpedia Score](https://mcppedia.org/api/badge/io-github-annasmazhar-pyspark-mcp)](https://mcppedia.org/s/io-github-annasmazhar-pyspark-mcp)

Read me

What io.github.AnnasMazhar/pyspark-mcp does

SQL to PySpark conversion, AWS Glue job generation, and Spark code optimization.

Test This Server

No automated test available for this server. Check the GitHub README for setup instructions.

Loading README…

Scored, not listed

Why this score

Five weighted categories — click any category to see the underlying evidence.

Score breakdown

37/100across 5 weighted dimensions

How we score →

0255075100

−63

Security

Maintenance

Efficiency

Documentation

Compatibility

Categoriesclick a row to see evidence

Security

OSV.dev

No known CVEs.

No package registry to scan.

Help improve this page

This server is missing a description. Tools and install config are also missing.If you've used it, help the community.

Add information

Community

Reviews

Be the first to review

Have you used this server?

Share your experience — it helps other developers decide.

How easy was setup?Did it work reliably?How was the documentation?

Frequently Asked Questions

Is io.github.AnnasMazhar/pyspark-mcp safe to use?: io.github.AnnasMazhar/pyspark-mcp has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments.
How do I install io.github.AnnasMazhar/pyspark-mcp?: io.github.AnnasMazhar/pyspark-mcp supports copy-paste install configs on its MCPpedia page for Claude Desktop, Cursor, and Claude Code. Scroll to the Quick Install section and select your client.
What AI clients work with io.github.AnnasMazhar/pyspark-mcp?: io.github.AnnasMazhar/pyspark-mcp is compatible with claude-desktop, cursor, claude-code. It uses stdio transport.
Is io.github.AnnasMazhar/pyspark-mcp actively maintained?: io.github.AnnasMazhar/pyspark-mcp is actively maintained — last commit was 8 days ago.

Similar servers

Others in cloud / marketing

View all →

Observability Mcp95

MCP Server for GCP environment for interacting with various Observability APIs.

790 10

Mcp Server Typescript94

DataForSEO API modelcontextprotocol server

197 9

Alibabacloud Devops Mcp Server92

Yunxiao MCP Server provides AI assistants with the ability to interact with the Yunxiao platform. It provides a set of tools that interact with Yunxiao's API, allowing AI assistants to manage Codeup repository, Project, Pipeline, Packages etc.

110 7

io.github.wyre-technology/datto-saas-protection-mcp92

MCP server for Datto SaaS Protection — M365/GWS backups, restores, seats.

MCP Security Weekly

Get CVE alerts and security updates for io.github.AnnasMazhar/pyspark-mcp and similar servers.

Community

Discussion

Start a conversation

Ask a question, share a tip, or report an issue.

Has anyone used this with Cursor?How do you handle auth?Any alternatives?

Edit this pageView history

Cloud Marketing

Step 1

Install in your client

Config is the same across clients — only the file and path differ.

Supported in Claude Desktopstdio · Node 18+

Paste into ~/Library/Application Support/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "pyspark": {
      "args": [],
      "command": "pyspark-mcp"
    }
  }
}

Are you the author?

Add this badge to your README to show your security score and help users find safe servers.

Embed in your READMEAbout badges →

[![MCPpedia Score](https://mcppedia.org/api/badge/io-github-annasmazhar-pyspark-mcp)](https://mcppedia.org/s/io-github-annasmazhar-pyspark-mcp)

Read me

What io.github.AnnasMazhar/pyspark-mcp does

SQL to PySpark conversion, AWS Glue job generation, and Spark code optimization.

Test This Server

No automated test available for this server. Check the GitHub README for setup instructions.

README

PySpark MCP Server

SQL migration assistance, AWS Glue job generation, and Spark code optimization — as an MCP server.

What It Does

SQL Dialect Transpilation — Convert between PostgreSQL, Oracle, Redshift, MySQL, Snowflake, and Spark SQL using SQLGlot
PySpark DataFrame API Generation — Generate DataFrame API code from SQL with optimization hints
AWS Glue Integration — Job templates, DynamicFrame conversions, Data Catalog definitions, S3 optimization strategies
Batch Processing — Process hundreds of SQL files concurrently
Code Review & Optimization — Analyze existing PySpark code for performance improvements
Pattern Detection — Find code duplication and suggest refactoring

What It Doesn't Do

Recursive CTEs → provides Spark SQL equivalent + guidance (PySpark has no native recursive CTE support)
MERGE/PIVOT/CONNECT BY → transpiles to Spark SQL, provides DataFrame API guidance
Perfect 1:1 DataFrame API transpilation for all SQL — complex queries get Spark SQL + optimization recommendations

Quick Start

pip install -e .
pyspark-mcp  # starts the MCP server

MCP Configuration

Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "pyspark": {
      "command": "pyspark-mcp",
      "args": []
    }
  }
}

Hermes Agent

Add to ~/.hermes/config.yaml:

mcp:
  servers:
    pyspark:
      command: pyspark-mcp
      enabled_tools: all

Docker

docker compose up -d

Tools

SQL Conversion

convert_sql_to_pyspark — Convert SQL to PySpark with dialect detection
analyze_sql_context — Analyze SQL complexity and suggest approach

AWS Glue

generate_aws_glue_job_template — Generate complete Glue job scripts
convert_dataframe_to_dynamic_frame — DataFrame ↔ DynamicFrame conversion
generate_data_catalog_table_definition — Data Catalog table definitions
generate_incremental_processing_job — Incremental/CDC job generation
analyze_s3_optimization_opportunities — S3 layout and partitioning analysis

Optimization

review_pyspark_code — Code review with performance recommendations
optimize_pyspark_code — Suggest optimizations for existing code
recommend_join_strategy — Broadcast vs shuffle join recommendations
suggest_partitioning_strategy — Partitioning recommendations

Batch Processing

batch_process_files — Process multiple SQL files concurrently
batch_process_directory — Convert entire directories

Development

python -m venv .venv
source .venv/bin/activate
pip install -e ".[dev]"

# Test
pytest tests/ -v --cov=pyspark_tools

# Format
black pyspark_tools tests
isort pyspark_tools tests

# Lint
flake8 pyspark_tools tests

Architecture

pyspark_tools/
├── server.py              # FastMCP server + tool definitions
├── sql_converter.py       # SQLGlot-based transpilation + DataFrame API generation
├── aws_glue_integration.py # Glue job templates, DynamicFrame, Data Catalog
├── advanced_optimizer.py  # Performance analysis + optimization suggestions
├── batch_processor.py     # Concurrent file processing
├── code_reviewer.py       # PySpark code review patterns
├── duplicate_detector.py  # Code deduplication
├── data_source_analyzer.py # Data source analysis
└── file_utils.py          # File I/O utilities

CI/CD

✅ 256 tests passing
✅ 71% code coverage
✅ Code quality checks (black, isort, flake8)
✅ Python 3.11 tested

License

MIT — see LICENSE.

... View full README on GitHub

Loading README…

Scored, not listed

Why this score

Five weighted categories — click any category to see the underlying evidence.

Score breakdown

37/100across 5 weighted dimensions

How we score →

0255075100

−63

Security

Maintenance

Efficiency

Documentation

Compatibility

Categoriesclick a row to see evidence

Security

OSV.dev

No known CVEs.

No package registry to scan.

Help improve this page

This server is missing a description. Tools and install config are also missing.If you've used it, help the community.

Add information

Community

Reviews

Be the first to review

Have you used this server?

Share your experience — it helps other developers decide.

How easy was setup?Did it work reliably?How was the documentation?

Frequently Asked Questions

Is io.github.AnnasMazhar/pyspark-mcp safe to use?: io.github.AnnasMazhar/pyspark-mcp has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments.
How do I install io.github.AnnasMazhar/pyspark-mcp?: io.github.AnnasMazhar/pyspark-mcp supports copy-paste install configs on its MCPpedia page for Claude Desktop, Cursor, and Claude Code. Scroll to the Quick Install section and select your client.
What AI clients work with io.github.AnnasMazhar/pyspark-mcp?: io.github.AnnasMazhar/pyspark-mcp is compatible with claude-desktop, cursor, claude-code. It uses stdio transport.
Is io.github.AnnasMazhar/pyspark-mcp actively maintained?: io.github.AnnasMazhar/pyspark-mcp is actively maintained — last commit was 8 days ago.