From 59a7c96da692ea8d11759e1e9094100752ad633c Mon Sep 17 00:00:00 2001 From: yoobin Date: Tue, 19 May 2026 04:37:32 +0900 Subject: [PATCH] Add yubinkim444/ai-first-scraper-mcp to Search & Data Extraction MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Self-contained MCP server with Glama listing (badge included). Three tools for AI agents needing ad-free Markdown web content: - fetch_page: URL → clean Markdown (HTML + PDF) - fetch_pages_batch: up to 25 URLs in parallel - search_web: query → top-k result pages already as Markdown --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index a58a33b32..12d2513f5 100644 --- a/README.md +++ b/README.md @@ -2093,6 +2093,7 @@ Tools for conducting research, surveys, interviews, and data collection. - [webscraping-ai/webscraping-ai-mcp-server](https://github.com/webscraping-ai/webscraping-ai-mcp-server) 🎖️ 📇 ☁️ - Interact with [WebScraping.ai](https://webscraping.ai) for web data extraction and scraping. - [webpeel/webpeel](https://github.com/webpeel/webpeel) 📇 ☁️ 🏠 - Smart web fetcher for AI agents with auto-escalation from HTTP to headless browser to stealth mode. Includes 9 MCP tools: fetch, search, crawl, map, extract, batch, screenshot, jobs, and agent. Achieved 100% success rate on a 30-URL benchmark. - [yamanoku/baseline-mcp-server](https://github.com/yamanoku/baseline-mcp-server) 📇 🏠 - MCP server that searches Baseline status using Web Platform API +- [yubinkim444/ai-first-scraper-mcp](https://github.com/yubinkim444/ai-first-scraper-mcp) [![yubinkim444/ai-first-scraper-mcp MCP server](https://glama.ai/mcp/servers/yubinkim444/ai-first-scraper-mcp/badges/score.svg)](https://glama.ai/mcp/servers/yubinkim444/ai-first-scraper-mcp) 🐍 🏠 ☁️ 🍎 🪟 🐧 - Three MCP tools for ad-free Markdown web scraping and search. `fetch_page` (URL → clean Markdown), `fetch_pages_batch` (up to 25 URLs in parallel), `search_web` (web search → top-k pages as Markdown). Works with Claude Desktop / Cursor / Cline. Install: `uvx ai-first-scraper-mcp`. - [zhsama/duckduckgo-mcp-server](https://github.com/zhsama/duckduckgo-mpc-server/) 📇 🏠 ☁️ - This is a TypeScript-based MCP server that provides DuckDuckGo search functionality. - [zoharbabin/google-researcher-mcp](https://github.com/zoharbabin/google-researcher-mcp) [![google-researcher-mcp MCP server](https://glama.ai/mcp/servers/@zoharbabin/google-researcher-mcp/badges/score.svg)](https://glama.ai/mcp/servers/@zoharbabin/google-researcher-mcp) 📇 ☁️ 🏠 - Comprehensive research tools including Google Search (web, news, images), web scraping with JavaScript rendering, academic paper search (arXiv, PubMed, IEEE), patent search, and YouTube transcript extraction. - [zlatkoc/youtube-summarize](https://github.com/zlatkoc/youtube-summarize) 🐍 ☁️ - MCP server that fetches YouTube video transcripts and optionally summarizes them. Supports multiple transcript formats (text, JSON, SRT, WebVTT), multi-language retrieval, and flexible YouTube URL parsing.