{
"mcpServers": {
"mcptube": {
"command": "<see-readme>",
"args": []
}
}
}No install config available. Check the server's README for setup instructions.
Are you the author?
Add this badge to your README to show your security score and help users find safe servers.
Is it safe?
No package registry to scan.
No authentication — any process on your machine can connect.
License not specified.
Is it maintained?
Last commit 2 days ago. 68 stars.
Will it work with my client?
Transport: stdio. Works with Claude Desktop, Cursor, Claude Code, and most MCP clients.
YouTube video knowledge engine — transcripts, vision, and persistent wiki.
mcptube-vision transforms YouTube videos into a persistent, structured knowledge base using both transcripts and visual frame analysis. Built on the Karpathy LLM Wiki pattern: knowledge compounds with every video you add.
Evolved from mcptube v0.1 — mcptube-vision replaces semantic chunk search with a persistent wiki that gets smarter with every video ingested.
Traditional video tools re-discover knowledge from scratch on every query. mcptube-vision is different:
mcptube v0.1 mcptube-vision
┌─────────────────────┐ ┌─────────────────────────┐
│ Query → vector search│ │ Video ingested → LLM │
│ → raw chunks → LLM │ │ extracts knowledge → │
│ → answer (from scratch│ │ wiki pages created → │
│ every time) │ │ cross-references built │
└─────────────────────┘ │ │
│ Query → FTS5 + agent │
│ → reasons over compiled │
│ knowledge → answer │
└─────────────────────────┘
| v0.1 (Video Search Engine) | vision (Video Knowledge Engine) | |
|---|---|---|
| On ingest | Chunk transcript, embed in vector DB | LLM watches + reads, writes wiki pages |
| On query | Find similar chunks | Agent reasons over compiled knowledge |
| Frames | Timestamp or keyword extraction | Scene-change detection + vision model |
| Cross-video | Re-search all chunks each time | Connections already in the wiki |
| Over time | Library of isolated videos | Compounding knowledge base |
mcptube-vision is built around a core insight: video knowledge should compound, not be re-discovered. Every architectural decision flows from this principle.
flowchart TD
YT[YouTube URL] --> EXT[YouTubeExtractor\ntranscript + metadata]
EXT --> FRAMES[SceneFrameExtractor\nffmpeg scene-change detection]
FRAMES --> VISION[VisionDescriber\nLLM vision model]
VISION --> WIKI_EXT[WikiExtractor\nLLM knowledge extraction]
EXT --> WIKI_EXT
WIKI_EXT --> WIKI_ENG[WikiEngine\nmerge + update]
WIKI_ENG --> FILE[FileWikiRepository\nJSON pages on disk]
WIKI_ENG --> FTS[SQLite FTS5\nsearch index]
FILE --> AGENT[Ask Agent\nFTS5 → LLM reasoning]
FTS --> AGENT
FILE --> CLI[CLI / MCP Server]
FTS --> CLI
subgraph Ingestion Pipeline
EXT
FRAMES
VISION
WIKI_EXT
end
subgraph Knowledge Store
WIKI_ENG
FILE
FTS
end
subgraph Retrieval
AGENT
end
The system overview shows three distinct subsystems connected by a unidirectional data flow. The Ingestion Pipeline (left) transforms a raw YouTube URL into structured knowledge through four stages: transcript extraction, scene-change frame detection, vision-model description, and LLM-powered knowledge extraction. Each stage enriches the signal — raw video becomes text, text becomes typed knowledge objects.
The Knowledge Store (center) is the persistent layer. The WikiEngine applies merge semantics — deciding whether to create new pages or append to existing ones — then writes JSON files to disk and updates the FTS5 search index in parallel. These two stores serve different access patterns: files for full-page reads
No automated test available for this server. Check the GitHub README for setup instructions.
This server is missing a description. Tools and install config are also missing.If you've used it, help the community.
Add informationNo known vulnerabilities.
Have you used this server?
Share your experience — it helps other developers decide.
Sign in to write a review.
Persistent memory using a knowledge graph
Hash-verified file editing MCP server with token efficiency hook. 11 tools for AI coding agents.
Privacy-first. MCP is the protocol for tool access. We're the virtualization layer for context.
Monitor browser logs directly from Cursor and other MCP compatible IDEs.
MCP Security Weekly
Get CVE alerts and security updates for Mcptube and similar servers.
Start a conversation
Ask a question, share a tip, or report an issue.
Sign in to join the discussion.