Is io.github.rsmdt/multimodal safe to use?

io.github.rsmdt/multimodal has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments.

How do I install io.github.rsmdt/multimodal?

io.github.rsmdt/multimodal supports copy-paste install configs on its MCPpedia page for Claude Desktop, Cursor, and Claude Code. Scroll to the Quick Install section and select your client.

What can io.github.rsmdt/multimodal do?

io.github.rsmdt/multimodal provides 5 tools: generate_image, generate_video, generate_audio, transcribe_audio, list_providers. See the full tools list on the server page for descriptions and parameters.

What AI clients work with io.github.rsmdt/multimodal?

io.github.rsmdt/multimodal is compatible with claude-desktop, cursor, claude-code. It uses stdio and sse and http transport.

Is io.github.rsmdt/multimodal actively maintained?

io.github.rsmdt/multimodal is less actively maintained — last commit was 117 days ago. It has 1 GitHub stars.

io.github.rsmdt/multimodal

@r16t/multimodal-mcp

Multi-provider media generation — images, video, audio, and transcription via a unified interface

1 74/wk 5 tools GitHub npm

No known CVEs

No license

Maintained

Last commit 117d ago

Works with most clients

Transport: stdio, sse, http

5 tools · ~1.0k tok

Grade B · 0.5% of 200K ctx

Edit this pageView history

AI / ML Entertainment

Step 1

Install in your client

Config is the same across clients — only the file and path differ.

Supported in Claude Desktopstdio, sse, http · Node 18+

Paste into ~/Library/Application Support/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "multimodal-mcp": {
      "env": {
        "OPENAI_API_KEY": "sk-..."
      },
      "args": [
        "@r16t/multimodal-mcp@latest"
      ],
      "command": "npx"
    }
  }
}

Are you the author?

Add this badge to your README to show your security score and help users find safe servers.

Embed in your READMEAbout badges →

[![MCPpedia Score](https://mcppedia.org/api/badge/io-github-rsmdt-multimodal)](https://mcppedia.org/s/io-github-rsmdt-multimodal)

Read me

What io.github.rsmdt/multimodal does

Multi-provider media generation MCP server. Generate images, videos, audio, and transcriptions from text prompts using OpenAI, xAI, Gemini, ElevenLabs, and BFL (FLUX) through a single unified interface.

Test This Server

Run this in your terminal to verify the server starts. Then let us know if it worked — your result helps other developers.

npx -y '@r16t/multimodal-mcp' 2>&1 | head -1 && echo "✓ Server started successfully"

After testing, let us know if it worked:

Loading README…

Scored, not listed

Why this score

Five weighted categories — click any category to see the underlying evidence.

Score breakdown

75/100across 5 weighted dimensions

How we score →

0255075100

−25

Security

Maintenance

Efficiency

Documentation

Compatibility

Categoriesclick a row to see evidence

Security

OSV.dev

No known CVEs.

Checked @r16t/multimodal-mcp against OSV.dev.

Inventory

Tools (5)

Click any tool to inspect its schema.

~1.0k tokens total

Community

Reviews

Be the first to review

Have you used this server?

Share your experience — it helps other developers decide.

How easy was setup?Did it work reliably?How was the documentation?

Frequently Asked Questions

Is io.github.rsmdt/multimodal safe to use?: io.github.rsmdt/multimodal has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments.
How do I install io.github.rsmdt/multimodal?: io.github.rsmdt/multimodal supports copy-paste install configs on its MCPpedia page for Claude Desktop, Cursor, and Claude Code. Scroll to the Quick Install section and select your client.
What can io.github.rsmdt/multimodal do?: io.github.rsmdt/multimodal provides 5 tools: generate_image, generate_video, generate_audio, transcribe_audio, list_providers. See the full tools list on the server page for descriptions and parameters.
What AI clients work with io.github.rsmdt/multimodal?: io.github.rsmdt/multimodal is compatible with claude-desktop, cursor, claude-code. It uses stdio and sse and http transport.
Is io.github.rsmdt/multimodal actively maintained?: io.github.rsmdt/multimodal is less actively maintained — last commit was 117 days ago. It has 1 GitHub stars.

Similar servers

Others in ai-ml / entertainment

View all →

Sequential Thinking MCP Server98

Dynamic problem-solving through sequential thought chains

87.9k 1

Memory MCP Server98

Persistent memory using a knowledge graph

87.9k 5

Gpt Researcher95

An autonomous agent that conducts deep research on any data using any LLM providers

28.0k 5

Ruflo95

🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architecture, distributed swarm intelligence, RAG integration, and native Claude Code / Codex Integration

62.2k 4

MCP Security Weekly

Get CVE alerts and security updates for io.github.rsmdt/multimodal and similar servers.

Community

Discussion

Start a conversation

Ask a question, share a tip, or report an issue.

Has anyone used this with Cursor?How do you handle auth?Any alternatives?

Edit this pageView history

AI / ML Entertainment

Step 1

Install in your client

Config is the same across clients — only the file and path differ.

Supported in Claude Desktopstdio, sse, http · Node 18+

Paste into ~/Library/Application Support/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "multimodal-mcp": {
      "env": {
        "OPENAI_API_KEY": "sk-..."
      },
      "args": [
        "@r16t/multimodal-mcp@latest"
      ],
      "command": "npx"
    }
  }
}

Are you the author?

Add this badge to your README to show your security score and help users find safe servers.

Embed in your READMEAbout badges →

[![MCPpedia Score](https://mcppedia.org/api/badge/io-github-rsmdt-multimodal)](https://mcppedia.org/s/io-github-rsmdt-multimodal)

Read me

What io.github.rsmdt/multimodal does

Test This Server

Run this in your terminal to verify the server starts. Then let us know if it worked — your result helps other developers.

npx -y '@r16t/multimodal-mcp' 2>&1 | head -1 && echo "✓ Server started successfully"

After testing, let us know if it worked:

README

multimodal-mcp

Features

🎨 Image Generation — Generate images via OpenAI (gpt-image-1), xAI (grok-imagine-image), Gemini (imagen-4), or BFL (FLUX Pro 1.1)
✏️ Image Editing — Edit images via OpenAI, xAI, Gemini, or BFL (FLUX Kontext)
🎬 Video Generation — Generate videos via OpenAI (sora-2), xAI (grok-imagine-video), or Gemini (veo-3.1)
🔊 Audio Generation — Text-to-speech via OpenAI (tts-1), Gemini, or ElevenLabs (Flash v2.5). Sound effects via ElevenLabs
🎙️ Audio Transcription — Speech-to-text via OpenAI (Whisper) or ElevenLabs (Scribe)
🔄 Auto-Discovery — Automatically detects configured providers from environment variables
🎯 Provider Selection — Auto-selects or explicitly choose a provider per request
📁 File Output — Saves all generated media to disk with descriptive filenames

Quick Start

Set the API key for at least one provider. Most users only need one — add more to access additional providers.

# Using OpenAI
claude mcp add multimodal-mcp -e OPENAI_API_KEY=sk-... -- npx -y @r16t/multimodal-mcp@latest

# Or using xAI
# claude mcp add multimodal-mcp -e XAI_API_KEY=xai-... -- npx -y @r16t/multimodal-mcp@latest

# Or using Gemini
# claude mcp add multimodal-mcp -e GEMINI_API_KEY=AIza... -- npx -y @r16t/multimodal-mcp@latest

# Or using ElevenLabs (audio + transcription)
# claude mcp add multimodal-mcp -e ELEVENLABS_API_KEY=xi-... -- npx -y @r16t/multimodal-mcp@latest

# Or using BFL/FLUX (images)
# claude mcp add multimodal-mcp -e BFL_API_KEY=... -- npx -y @r16t/multimodal-mcp@latest

Using a different editor? See setup instructions for Claude Desktop, Cursor, VS Code, Windsurf, and Cline.

Environment Variables

Variable	Required	Description
`OPENAI_API_KEY`	At least one provider key	OpenAI API key — enables image, video, audio generation, and transcription via gpt-image-1, sora-2, tts-1, and whisper-1
`XAI_API_KEY`	At least one provider key	xAI API key — enables image and video generation via grok-imagine-image and grok-imagine-video
`GEMINI_API_KEY`	At least one provider key	Gemini API key — enables image, video, and audio generation via imagen-4, veo-3.1, and gemini-2.5-flash-preview-tts
`GOOGLE_API_KEY`	—	Alias for `GEMINI_API_KEY`; either name is accepted
`ELEVENLABS_API_KEY`	At least one provider key	ElevenLabs API key — enables audio generation (TTS, sound effects) and transcription via Flash v2.5 and Scribe v1
`BFL_API_KEY`	At least one provider key	BFL API key — enables image generation and editing via FLUX Pro 1.1 and FLUX Kontext
`MEDIA_OUTPUT_DIR`	No	Directory for saved media files. Defaults to the current working directory

Available Tools

`generate_image`

Generate an image from a text prompt.

Parameter	Type	Required	Description
`prompt`	string	Yes	Text description of the image to generate
`provider`	string	No	Provider to use: `openai`, `xai`, `google`, `bfl`. Auto-selects if omitted
`aspectRatio`	string	No	Aspect ratio: `1:1`, `16:9`, `9:16`, `4:3`, `3:4`
`quality`	string	No	Quality level: `low`, `standard`, `high`
`outputDirectory`	string	No	Directory to save the generated file. Absolute or relative path. Defaults to `MEDIA_OUTPUT_DIR` or cwd
`providerOptions`	object	No	Provider-specific parameters passed through directly

`generate_video`

Generate a video from a text prompt. Video generation is asynchronous and may take several minutes.

Parameter	Type	Required	Description
`prompt`	string	Yes	Text description of the video

... View full README on GitHub

Loading README…

Scored, not listed

Why this score

Five weighted categories — click any category to see the underlying evidence.

Score breakdown

75/100across 5 weighted dimensions

How we score →

0255075100

−25

Security

Maintenance

Efficiency

Documentation

Compatibility

Categoriesclick a row to see evidence

Security

OSV.dev

No known CVEs.

Checked @r16t/multimodal-mcp against OSV.dev.

Inventory

Tools (5)

Click any tool to inspect its schema.

~1.0k tokens total

Community

Reviews

Be the first to review

Have you used this server?

Share your experience — it helps other developers decide.

How easy was setup?Did it work reliably?How was the documentation?

Frequently Asked Questions

Is io.github.rsmdt/multimodal safe to use?: io.github.rsmdt/multimodal has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments.
How do I install io.github.rsmdt/multimodal?: io.github.rsmdt/multimodal supports copy-paste install configs on its MCPpedia page for Claude Desktop, Cursor, and Claude Code. Scroll to the Quick Install section and select your client.
What can io.github.rsmdt/multimodal do?: io.github.rsmdt/multimodal provides 5 tools: generate_image, generate_video, generate_audio, transcribe_audio, list_providers. See the full tools list on the server page for descriptions and parameters.
What AI clients work with io.github.rsmdt/multimodal?: io.github.rsmdt/multimodal is compatible with claude-desktop, cursor, claude-code. It uses stdio and sse and http transport.
Is io.github.rsmdt/multimodal actively maintained?: io.github.rsmdt/multimodal is less actively maintained — last commit was 117 days ago. It has 1 GitHub stars.