Fetch any URL and get clean Markdown. Web scraping for AI agents.
Config is the same across clients — only the file and path differ.
{
"mcpServers": {
"webscrape": {
"url": "https://webscrape.mcpize.run",
"headers": {
"Authorization": "Bearer your-api-key"
}
}
}
}Are you the author?
Add this badge to your README to show your security score and help users find safe servers.
MCP server that lets AI agents search the web and extract clean Markdown content — no ads, no clutter, just the text your LLM needs.
No automated test available for this server. Check the GitHub README for setup instructions.
Five weighted categories — click any category to see the underlying evidence.
No known CVEs.
No package registry to scan.
Be the first to review
Have you used this server?
Share your experience — it helps other developers decide.
Sign in to write a review.
Others in browser / ai-ml
Dynamic problem-solving through sequential thought chains
Persistent memory using a knowledge graph
Browser automation with Puppeteer for web scraping and testing
An autonomous agent that conducts deep research on any data using any LLM providers
MCP Security Weekly
Get CVE alerts and security updates for io.github.carrasquelalex1/webscrape-mcp and similar servers.
Start a conversation
Ask a question, share a tip, or report an issue.
Sign in to join the discussion.
English · Español
MCP server that lets AI agents search the web and extract clean Markdown content — no ads, no clutter, just the text your LLM needs.
New features:
extract_schema=Truewebscrape_screenshot tool captures page screenshots with configurable viewport, full-page mode, and PNG/JPEG formatImprovements:
| Tool | Description |
|---|---|
webscrape_fetch_url | Fetch a single URL and return clean Markdown. Supports use_readability, js_render, extract_schema, and auto-detects PDFs |
webscrape_batch_fetch | Fetch up to 5 URLs in parallel. Supports PDF auto-detection, JS rendering, and structured data |
webscrape_search | Search the web (DuckDuckGo → Google → Bing fallback) and scrape results into Markdown |
webscrape_screenshot | Capture a screenshot of any web page with headless Chromium. Supports PNG/JPEG, viewport sizing, and full-page capture |
.pdf or with application/pdf content-type are auto-detected and text is extracted page by pageuse_readability=True to webscrape_fetch_url for cleaner article extraction using Mozilla Readability (removes nav, sidebars, ads, comments)js_render=True to render JavaScript-heavy sites with Playwright (headless Chromium). Auto-fallback when httpx gets 403 or empty contentextract_schema=True to extract JSON-LD, Open Graph, Twitter Cards, meta tags, canonical URLs, and hreflang links{
"mcpServers": {
"webscrape": {
"url": "https://webscrape.mcpize.run",
"headers": {
"Authorization": "Bearer your-api-key"
}
}
}
}
{
"mcpServers": {
"webscrape": {
"url": "https://webscrape-mcp.onrender.com"
}
}
}
git clone https://github.com/carrasquelalex1/webscrape-mcp.git
cd webscrape-mcp
pip install -r requirements.txt
playwright install chromium
python webscrape_mcp.py
io.github.carrasquelalex1/webscrape-mcp
mcp, httpx, beautifulsoup4, markdownify, pydantic, ddgs, readability-lxml, PyMuPDF, playwright
MIT
Servidor MCP que permite a agentes de IA buscar en la web y extraer contenido limpio en Markdown — sin anuncios, sin navegación, solo el texto que tu LLM necesita.
Nuevas características: