High-performance audio, music, and voice generation MCP server for Gemini 2.5 and Lyria 3.
Config is the same across clients — only the file and path differ.
{
"mcpServers": {
"gemini-audio": {
"env": {
"GEMINI_API_KEY": "YOUR_SECURE_API_KEY"
},
"args": [
"-y",
"gemini-audio-mcp"
],
"command": "npx"
}
}
}Are you the author?
Add this badge to your README to show your security score and help users find safe servers.
Gemini Audio MCP is a high-performance Model Context Protocol (MCP) server engineered for professional-grade audio synthesis. It leverages the Gemini 2.5 Multimodal Live API and Google DeepMind's Lyria 3 models to deliver high-fidelity environmental soundscapes, musical compositions, and expressive narration on-demand.
No automated test available for this server. Check the GitHub README for setup instructions.
Five weighted categories — click any category to see the underlying evidence.
No known CVEs.
No package registry to scan.
Click any tool to inspect its schema.
Be the first to review
Have you used this server?
Share your experience — it helps other developers decide.
Sign in to write a review.
Others in ai-ml / entertainment
Workspace template + MCP server for Claude Code, Codex CLI, Cursor & Windsurf. Multi-agent knowledge engine (ag-refresh / ag-ask) that turns any codebase into a queryable AI assistant.
Persistent memory using a knowledge graph
Dynamic problem-solving through sequential thought chains
A markdown editor — and the bridge to your LLM. Local-first, MIT, ~15 MB. Bundled MCP server lets Claude Code / Codex / Cursor drive your vault directly. 14 AI providers BYOK.
MCP Security Weekly
Get CVE alerts and security updates for io.github.jxoesneon/gemini-audio-mcp and similar servers.
Start a conversation
Ask a question, share a tip, or report an issue.
Sign in to join the discussion.
Gemini Audio MCP is a high-performance Model Context Protocol (MCP) server engineered for professional-grade audio synthesis. It leverages the Gemini 2.5 Multimodal Live API and Google DeepMind's Lyria 3 models to deliver high-fidelity environmental soundscapes, musical compositions, and expressive narration on-demand.
Before deploying the server, ensure your environment meets the following technical requirements:
Required for high-performance audio encoding, decoding, and transcoding.
brew install ffmpegwinget install ffmpeg or download from ffmpeg.org.sudo apt update && sudo apt install ffmpegRequired to build the server from source.
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | shRequired if using the pre-compiled NPM package.
node >= 18.0.0The fastest way to integrate the server into your MCP client (e.g., Claude Desktop).
{
"mcpServers": {
"gemini-audio": {
"command": "npx",
"args": ["-y", "gemini-audio-mcp"],
"env": {
"GEMINI_API_KEY": "YOUR_SECURE_API_KEY"
}
}
}
}
For maximum performance, build the Rust binary locally:
git clone https://github.com/mcp-servers/gemini-audio-mcp.git
cd gemini-audio-mcp
cargo build --release
./target/release/gemini-audio-mcp.The server requires a valid Google AI Studio API key.
GEMINI_API_KEY environment variable.generate_soundscape)Synthesizes immersive, vocal-free ambient textures.
{
"name": "generate_soundscape",
"arguments": {
"prompt": "Deep underwater abyss, low-frequency whale songs, rhythmic air bubbles rising, muffled aquatic pressure.",
"duration": 60,
"quality": "high",
"auto_play": true
}
}
generate_music)Generates structural compositions with optional vocal control.
{
"name": "generate_music",
"arguments": {
"prompt": "Melancholic solo cello in a vast cathedral with 5-second decay reverb.",
"bpm": 72,
"song_key": "D minor",
"intensity": 4
}
}
generate_voice)Narration and character dialogue using Gemini 2.5 Native Audio.
{
"name": "generate_voice",
"arguments": {
"text": "The artifacts are stable, but the rift remains open.",
"voice_direction": "Gravelly, urgent, whispered"
}
}
transition_soundscape)Crossfades two distinct environments for seamless scene transitions.
{
"name": "transition_soundscape",
"arguments": {
"from_prompt": "Quiet library silence.",
"to_prompt": "Sudden heavy rain on a tin roof.",
"transition_duration": 8
}
}
| Parameter | Type | Description |
|---|---|---|