Is io.github.base76-research-lab/token-compressor safe to use?

io.github.base76-research-lab/token-compressor has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments.

How do I install io.github.base76-research-lab/token-compressor?

io.github.base76-research-lab/token-compressor supports copy-paste install configs on its MCPpedia page for Claude Desktop, Cursor, and Claude Code. Scroll to the Quick Install section and select your client.

What can io.github.base76-research-lab/token-compressor do?

io.github.base76-research-lab/token-compressor provides 1 tool: compress_prompt. See the full tools list on the server page for descriptions and parameters.

What AI clients work with io.github.base76-research-lab/token-compressor?

io.github.base76-research-lab/token-compressor is compatible with claude-desktop, cursor, claude-code. It uses stdio and sse and http transport.

Is io.github.base76-research-lab/token-compressor actively maintained?

io.github.base76-research-lab/token-compressor is less actively maintained — last commit was 115 days ago. It has 9 GitHub stars.

io.github.base76-research-lab/token-compressor

ollama

Compress prompts 40-60% using local LLM + embedding validation. Preserves all conditionals.

9 1 tool GitHub PyPI

6 open CVEs

No license

Maintained

Last commit 115d ago

Works with most clients

Transport: stdio, sse, http

1 tool · ~87 tok

Grade A · 0.0% of 200K ctx

Edit this pageView history

AI / ML

Step 1

Install in your client

Config is the same across clients — only the file and path differ.

Supported in Claude Desktopstdio, sse, http · Node 18+

Paste into ~/Library/Application Support/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "token-compressor": {
      "cwd": "/path/to/token-compressor",
      "args": [
        "-m",
        "token_compressor_mcp"
      ],
      "command": "python3"
    }
  }
}

Are you the author?

Add this badge to your README to show your security score and help users find safe servers.

Embed in your READMEAbout badges →

[![MCPpedia Score](https://mcppedia.org/api/badge/io-github-base76-research-lab-token-compressor)](https://mcppedia.org/s/io-github-base76-research-lab-token-compressor)

Read me

What io.github.base76-research-lab/token-compressor does

mcp-name: io.github.base76-research-lab/token-compressor

Test This Server

Run this in your terminal to verify the server starts. Then let us know if it worked — your result helps other developers.

uvx 'ollama' 2>&1 | head -1 && echo "✓ Server started successfully"

After testing, let us know if it worked:

Loading README…

Scored, not listed

Why this score

Five weighted categories — click any category to see the underlying evidence.

Score breakdown

79/100across 5 weighted dimensions

How we score →

0255075100

−21

Security

Maintenance

Efficiency

Documentation

Compatibility

Categoriesclick a row to see evidence

Security

OSV.dev

6 open

CVE-2025-66960lowCVSS 3.1

PYSEC-2026-102

An issue in ollama v.0.12.10 allows a remote attacker to cause a denial of service via the fs/ggml/gguf.go, function readGGUFV1String reads a string length from untrusted GGUF metadata

Affected: >= 0source →

CVE-2025-66959lowCVSS 3.1

PYSEC-2026-101

An issue in ollama v.0.12.10 allows a remote attacker to cause a denial of service via the GGUF decoder

Affected: >= 0source →

CVE-2025-44779lowCVSS 3.1

PYSEC-2025-146

An issue in Ollama v0.1.33 allows attackers to delete arbitrary files via sending a crafted packet to the endpoint /api/pull.

Affected: >= 0source →

CVE-2025-51471lowCVSS 3.1

PYSEC-2025-147

Cross-Domain Token Exposure in server.auth.getAuthorizationToken in Ollama 0.6.7 allows remote attackers to steal authentication tokens and bypass access controls via a malicious realm value in a WWW-Authenticate header returned by the /api/pull endpoint.

Affected: >= 0source →

CVE-2025-1975lowCVSS 3

PYSEC-2025-145

A vulnerability in the Ollama server version 0.5.11 allows a malicious user to cause a Denial of Service (DoS) attack by customizing the manifest content and spoofing a service. This is due to improper validation of array index access when downloading a model via the /api/pull endpoint, which can lead to a server crash.

Affected: >= 0source →

Inventory

Tools (1)

Click any tool to inspect its schema.

~87 tokens total

Community

Reviews

Be the first to review

Have you used this server?

Share your experience — it helps other developers decide.

How easy was setup?Did it work reliably?How was the documentation?

Frequently Asked Questions

Is io.github.base76-research-lab/token-compressor safe to use?: io.github.base76-research-lab/token-compressor has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments.
How do I install io.github.base76-research-lab/token-compressor?: io.github.base76-research-lab/token-compressor supports copy-paste install configs on its MCPpedia page for Claude Desktop, Cursor, and Claude Code. Scroll to the Quick Install section and select your client.
What can io.github.base76-research-lab/token-compressor do?: io.github.base76-research-lab/token-compressor provides 1 tool: compress_prompt. See the full tools list on the server page for descriptions and parameters.
What AI clients work with io.github.base76-research-lab/token-compressor?: io.github.base76-research-lab/token-compressor is compatible with claude-desktop, cursor, claude-code. It uses stdio and sse and http transport.
Is io.github.base76-research-lab/token-compressor actively maintained?: io.github.base76-research-lab/token-compressor is less actively maintained — last commit was 115 days ago. It has 9 GitHub stars.

Similar servers

Others in ai-ml

View all →

Sequential Thinking MCP Server98

Dynamic problem-solving through sequential thought chains

87.9k 1

Memory MCP Server98

Persistent memory using a knowledge graph

87.9k 5

Gpt Researcher95

An autonomous agent that conducts deep research on any data using any LLM providers

28.0k 5

Ruflo95

🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architecture, distributed swarm intelligence, RAG integration, and native Claude Code / Codex Integration

62.3k 4

MCP Security Weekly

Get CVE alerts and security updates for io.github.base76-research-lab/token-compressor and similar servers.

Community

Discussion

Start a conversation

Ask a question, share a tip, or report an issue.

Has anyone used this with Cursor?How do you handle auth?Any alternatives?

Edit this pageView history

AI / ML

Step 1

Install in your client

Config is the same across clients — only the file and path differ.

Supported in Claude Desktopstdio, sse, http · Node 18+

Paste into ~/Library/Application Support/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "token-compressor": {
      "cwd": "/path/to/token-compressor",
      "args": [
        "-m",
        "token_compressor_mcp"
      ],
      "command": "python3"
    }
  }
}

Are you the author?

Add this badge to your README to show your security score and help users find safe servers.

Embed in your READMEAbout badges →

[![MCPpedia Score](https://mcppedia.org/api/badge/io-github-base76-research-lab-token-compressor)](https://mcppedia.org/s/io-github-base76-research-lab-token-compressor)

Read me

What io.github.base76-research-lab/token-compressor does

mcp-name: io.github.base76-research-lab/token-compressor

Test This Server

Run this in your terminal to verify the server starts. Then let us know if it worked — your result helps other developers.

uvx 'ollama' 2>&1 | head -1 && echo "✓ Server started successfully"

After testing, let us know if it worked:

README

token-compressor

Reduce LLM prompt tokens by 30–70% while preserving semantic meaning.

mcp-name: io.github.base76-research-lab/token-compressor

Semantic prompt compression for LLM workflows. Reduce token usage by 40–60% without losing meaning.

Built by Base76 Research Lab — research into epistemic AI architecture.

Live demo

Intent Compiler MVP is now live and uses this project as part of the idea -> spec -> compressed output flow:

Live: https://intent-compiler-mvp.pages.dev
Product repo: https://github.com/base76-research-lab/token-compressor

What it does

token-compressor is a two-stage pipeline that compresses prompts before they reach an LLM:

LLM compression — a local model (llama3.2:1b via Ollama) rewrites the prompt to its semantic minimum, preserving all conditionals and negations
Embedding validation — cosine similarity between original and compressed embeddings must exceed a threshold (default: 0.85) — if not, the original is sent unchanged

The result: shorter prompts, lower costs, same intent.

Input prompt (300 tokens)
        ↓
  LLM compresses
        ↓
  Embedding validates (cosine ≥ 0.85?)
        ↓
  Pass → compressed (120 tokens)   Fail → original (300 tokens)

Key design principle: conditionality is never sacrificed. If your prompt says "only do X if Y", that constraint survives compression.

Requirements

Python 3.10+
Ollama running locally
Two models pulled:

ollama pull llama3.2:1b
ollama pull nomic-embed-text

Python dependencies:

pip install ollama numpy

Quick start

from compressor import LLMCompressEmbedValidate

pipeline = LLMCompressEmbedValidate()
result = pipeline.process("Your prompt text here...")

print(result.output_text)   # compressed (or original if validation failed)
print(result.report())      # MODE / COVERAGE / TOKENS saved

Result object:

Field	Description
`output_text`	Text to send to your LLM
`mode`	`compressed` / `raw_fallback` / `skipped`
`coverage`	Cosine similarity (0.0–1.0)
`tokens_in`	Estimated input tokens
`tokens_out`	Estimated output tokens
`tokens_saved`	Difference

CLI usage

echo "Your long prompt here..." | python3 cli.py

Output: compressed text on stdout, stats on stderr.

Claude Code hook (recommended setup)

Add to your ~/.claude/settings.json under hooks → UserPromptSubmit:

{
  "type": "command",
  "command": "echo \"${CLAUDE_USER_PROMPT:-}\" | python3 /path/to/token-compressor/cli.py > /tmp/compressed_prompt.txt 2>/tmp/compress.log || true"
}

This runs on every prompt submission and writes the compressed version to a temp file, which can be injected back into context via a second hook or MCP server.

MCP server

The MCP server exposes compression as a tool callable from Claude Code and any MCP-compatible client.

Install:

pip install token-compressor-mcp

Tool: compress_prompt

Input: text (string)
Output: compressed text + stats footer

Claude Code MCP config (~/.claude/settings.json):

{
  "mcpServers": {
    "token-compressor": {
      "command": "uvx",
      "args": ["token-compressor-mcp"]
    }
  }
}

Or from source:

{
  "mcpServers": {
    "token-compressor": {
      "command": "python3",
      "args": ["-m", "token_compressor_mcp"],
      "cwd": "/path/to/token-compressor"
    }
  }
}

Configuration

pipeline = LLMCompressEmbedValidate(
    threshold=0.85,          # cosine similarity floor (lower = more aggressive)
  

... [View full README on GitHub](https://github.com/base76-research-lab/token-compressor#readme)

Loading README…

Scored, not listed

Why this score

Five weighted categories — click any category to see the underlying evidence.

Score breakdown

79/100across 5 weighted dimensions

How we score →

0255075100

−21

Security

Maintenance

Efficiency

Documentation

Compatibility

Categoriesclick a row to see evidence

Security

OSV.dev

6 open

CVE-2025-66960lowCVSS 3.1

PYSEC-2026-102

An issue in ollama v.0.12.10 allows a remote attacker to cause a denial of service via the fs/ggml/gguf.go, function readGGUFV1String reads a string length from untrusted GGUF metadata

Affected: >= 0source →

CVE-2025-66959lowCVSS 3.1

PYSEC-2026-101

An issue in ollama v.0.12.10 allows a remote attacker to cause a denial of service via the GGUF decoder

Affected: >= 0source →

CVE-2025-44779lowCVSS 3.1

PYSEC-2025-146

An issue in Ollama v0.1.33 allows attackers to delete arbitrary files via sending a crafted packet to the endpoint /api/pull.

Affected: >= 0source →

CVE-2025-51471lowCVSS 3.1

PYSEC-2025-147

Affected: >= 0source →

CVE-2025-1975lowCVSS 3

PYSEC-2025-145

Affected: >= 0source →

Inventory

Tools (1)

Click any tool to inspect its schema.

~87 tokens total

Community

Reviews

Be the first to review

Have you used this server?

Share your experience — it helps other developers decide.

How easy was setup?Did it work reliably?How was the documentation?

Frequently Asked Questions

Is io.github.base76-research-lab/token-compressor safe to use?: io.github.base76-research-lab/token-compressor has no known CVEs as of the latest MCPpedia security scan. It does not require authentication, so any local process can connect — keep this in mind in shared environments.
How do I install io.github.base76-research-lab/token-compressor?: io.github.base76-research-lab/token-compressor supports copy-paste install configs on its MCPpedia page for Claude Desktop, Cursor, and Claude Code. Scroll to the Quick Install section and select your client.
What can io.github.base76-research-lab/token-compressor do?: io.github.base76-research-lab/token-compressor provides 1 tool: compress_prompt. See the full tools list on the server page for descriptions and parameters.
What AI clients work with io.github.base76-research-lab/token-compressor?: io.github.base76-research-lab/token-compressor is compatible with claude-desktop, cursor, claude-code. It uses stdio and sse and http transport.
Is io.github.base76-research-lab/token-compressor actively maintained?: io.github.base76-research-lab/token-compressor is less actively maintained — last commit was 115 days ago. It has 9 GitHub stars.