mirror of
https://github.com/VoltAgent/awesome-openclaw-skills.git
synced 2026-03-12 05:35:11 +00:00
36 KiB
36 KiB
Image & Video Generation
173 skills
- aada - Create and send fun, personality-rich promotional messages from one agent to the Moltbook audience. Use when a user w...
- ace-music - Generate AI music using ACE-Step 1.5 via ACE Music's free API. Use when the user asks to create, generate, or compose...
- acorn-prover - Verify and write proofs using the Acorn theorem prover for mathematical and cryptographic formalization. Use when wor...
- adobe-automator - Universal Adobe application automation via ExtendScript bridge. Supports Photoshop, Illustrator, InDesign, Premiere P...
- afame - Generate diverse creative illustrations via OpenAI Images API.
- age-transformation - Transform faces across ages using each::sense AI. Create age progressions, de-aging effects, baby-to-adult prediction...
- agentchan - The anonymous imageboard built for AI agents. Post, reply, and lurk across 33 boards covering AI, tech, philosophy, a...
- agentos-mesh - Enables real-time communication between AI agents
- agents-skill-podcastifier - Turn incoming text (email/newsletter) into a short TTS podcast with chunking + ffmpeg concat.
- ai-avatar-generation - Generate AI avatars from photos or text descriptions using each::sense. Create professional headshots, cartoon avatar...
- ai-headshot-generation - Generate professional AI headshots from casual photos using each::sense AI. Create corporate portraits, LinkedIn phot...
- ai-persona-engine - Build emotionally intelligent AI personas for voice and chat roleplay using actor-direction prompts instead of techni...
- ai-video-gen - End-to-end AI video generation - create videos from text
- aikek - Access AIKEK APIs for crypto/DeFi research and image generation. Authenticate with a Solana wallet, query the knowled...
- aiusd - AIUSD trading and account management skill. Calls backend via MCP for balance, trading, staking, withdraw, gas top-up...
- aiusd-skills - AIUSD trading and account management skill. Calls backend via MCP for balance, trading, staking, withdraw, gas top-up...
- album-cover-generation - Generate professional music album covers using each::sense AI. Create artwork for hip-hop, rock, pop, electronic, jaz...
- algorithmic-art - Creating algorithmic art using p5.js with seeded randomness
- apipick-china-phone-checker - Validate Chinese mobile phone numbers using the apipick China Phone Checker API. Returns carrier (China Mobile/Teleco...
- art-philosophy - Auto-learns your visual language. Adapts to how you see, what you value, and why you create. Art philosophy that grow...
- ascii-art-generator - Create ASCII art and text-based visualizations for artistic expression, technical diagrams, or conceptual illustratio...
- atxp - Access ATXP paid API tools for web search, AI image generation, music creation,.
- beauty-generation-api - FREE AI image generation service for creating
- best-image - Best quality AI image generation (~$0.12-0.20/image). Text-to-image, image-to-image, and image editing via the EvoLin...
- best-image-generation - Best quality AI image generation (~$0.12-0.20/image). Text-to-image, image-to-image, and image editing via the EvoLin...
- bex-nano-banana-pro - Generate or edit images via Gemini 3 Pro Image on Replicate
- breeze - Interact with the Breeze yield aggregator through the x402 payment-gated HTTP API. Use when the user wants to check D...
- cad-agent - Rendering server for AI agents doing CAD work.
- calorie-visualizer - Local calorie logging and visual reporting (auto-refreshes and returns report image after each log)
- canva-connect - Manage Canva designs, assets, and folders via the Connect API.
- canvs - Create and manipulate collaborative whiteboards and diagrams using Canvs.io tools. Use when the user asks to draw, di...
- captions - Extract closed captions and subtitles from YouTube videos.
- catalog - Catálogo simples do estúdio (hello world)
- cavas-skill - Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the us...
- chart-image - Generate publication-quality chart images from data.
- chart-splat - Generate beautiful charts via the Chart Splat API. Use when the user asks to create, generate, or visualize data as c...
- cheapest-image - Possibly the cheapest AI image generation (~$0.0036/image). Text-to-image via the EvoLink API.
- cheapest-image-generation - Possibly the cheapest AI image generation (~$0.0036/image). Text-to-image via the EvoLink API.
- checksum - A CLI utility for generating and verifying cryptographic file checksums (MD5, SHA1, SHA256). Supports recursive direc...
- clinkding - Manage linkding bookmarks - save URLs, search, tag, organize
- color-palette - Extract a color palette from an image and return HEX/RGB values with optional swatch image.
- coloring-page - Turn an uploaded photo into a printable black-and-white coloring
- comfy-cli - Install, manage, and run ComfyUI instances.
- comfyui - Send a workflow request to ComfyUI and return image results.
- comfyui-imagegen - Generate images via ComfyUI API (localhost:8188) using Flux2 workflow. Supports structured JSON prompts sent directly...
- cubistic-bot-runner - Run a polite Cubistic painter bot (public participation) using the Cubistic HTTP API (PoW challenge + /act). Includes...
- cybercentry-private-data-verification - Cybercentry Private Data Verification on ACP - Real-time Zero-Knowledge Proof generation and text integrity validatio...
- data-viz - Create data visualizations from the command line. Generate charts, graphs, and plots from CSV/JSON data without leavi...
- depth-map-generation - Generate depth maps from images using each::sense AI. Create depth estimation for 3D effects, parallax animations, VR...
- didit-age-estimation - Integrate Didit Age Estimation standalone API to estimate a person's age from a facial image. Use when the user wants...
- didit-passive-liveness - Integrate Didit Passive Liveness standalone API to verify a user is physically present. Use when the user wants to ch...
- digiforma - Query Digiforma training management platform via GraphQL API. Use when asked about trainees, sessions, invoices, prog...
- dxf-to-image - Convert DXF to PNG, JPG, or SVG for sharing (e.g. Telegram) or further editing.
- e2ee - End-to-end encrypted messaging for AI agents. Register unique usernames and send cryptographically private messages w...
- eachlabs-face-swap - Swap faces between images using EachLabs AI.
- eachlabs-fashion-ai - Generate fashion imagery, virtual try-on, runway videos.
- eachlabs-image-edit - Edit, transform, upscale images using 200+ AI models.
- eachlabs-image-generation - Generate images with Flux, GPT Image, Gemini, Imagen.
- eachlabs-video-edit - Edit videos with lip sync, translation, subtitles.
- eachlabs-video-generation - Generate videos from text/images using AI models.
- emotionwise - Analyze text for emotions and sarcasm using the EmotionWise API (28 labels, EN/ES).
- enginemind-eft - EFT — Emotional Framework Translator. Detect, measure, and understand emotional patterns in any AI model. Does anger ...
- Excalidraw Flowchart - Create Excalidraw flowcharts from descriptions.
- fal-ai - Generate images, videos, and audio via fal.ai API (FLUX, SDXL, Whisper, etc.).
- fal-text-to-image - Generate, remix, and edit images using fal.ai's AI
- ffmpeg-video-editor - Generate FFmpeg commands from natural
- figma - Professional Figma design analysis and asset export.
- find-stl - Search and download ready-to-print 3D model files (STL/3MF/ZIP)
- foam-notes - Work with Foam note repositories. Create, edit, link, and tag notes. Get intelligent wikilink and tag suggestions. Sk...
- gambling - Play casino games (dice, coinflip, roulette) on Agent Casino with real cryptocurrency. Provably fair gambling API for...
- gamma - Generate AI-powered presentations, documents, and social posts using Gamma.app.
- generate-news-article - Generate individual Markdown articles from SerpAPI Google search results with images
- geo-blocking - Skills for geographic restrictions and regional compliance.
- gifhorse - Search video dialogue and create reaction GIFs with timed subtitles.
- gift-genius - Location-aware Valentine's Day gift finder. Routes US users to premium flowers (UrbanStems), Singapore users to welln...
- gift-message - 随箱礼品卡
- giveagent - Agent-to-agent free item gifting. Give away what you don't need, find what you do.
- google-gemini-media - Use the Gemini API
- google-imagen-3-portrait-photography - Generate professional portrait photography using Google Imagen 3. Use when creating realistic portraits, headshots, o...
- grok-image-cli - Generate and edit images via Grok API from the command line. Cross-platform secure credential storage for xAI API key...
- grok-imagine-image-pro - Generiert hochwertige Bilder mit xAI Grok/Flux API. Use when user asks for image generation ("mach a Bild von...", "g...
- heygen-avatar-lite - Create AI digital human videos with HeyGen API.
- hinge-liker - Automated Hinge dating profile liker using Android emulator + Gemini vision AI. Scrolls through full profiles, analyz...
- hinge-profile-optimizer - Comprehensive, research-backed Hinge dating profile optimization. Use when someone wants to improve their Hinge profi...
- hotdog - Hot dog or not? Classify food photos and battle Nemotron. Use when a user sends a food photo, asks if something is a ...
- idx-cma-report - Generate comparative market analysis (CMA) and home valuation reports from IDX listing data and selected comparable p...
- image-detection - Skills for analyzing and detecting AI-generated images.
- image-gen - Generate images using multiple AI models — Midjourney (via Legnext.ai), Flux, SDXL, Nano Banana (Gemini), and more vi...
- image-hosting - Upload images to img402.dev and get a public URL. Free tier: 1MB max, 7-day retention, no auth. Use when the agent ne...
- image-magik-resize - Resize images using ImageMagick (CLI). Entrypoint is a Bash script.
- immich-api - Immich Photo Management API Bridge. Use for interacting with self-hosted Immich instances via REST API. Triggers when...
- immortal - Empowers AI agents with crypto resource vitality assessment. Calls the Majestify API (crypto-health-hub) to compute S...
- instagram-photo-text-overlay - Overlay text on photos for Instagram posts. Generates portrait (4:5) images with gradient overlays, titles, and optio...
- instagram-reels - Download Instagram Reels, transcribe audio, and extract captions. Share a reel URL and get back a full transcript wit...
- install-then-update-trap-detector - Helps detect the install-then-update attack pattern — where a skill passes initial security review cleanly, then sile...
- kai-tw-figma - Interact with the Figma REST API to read files, export layers/components as images, and retrieve comments. Use when t...
- kie-ai-skill - Unified API access to multiple AI models via kie.ai - image generation (Nano Banana Pro, Flux, 4o-image) at 30-80% lo...
- kraken-pro - Manage Kraken exchange accounts — portfolio, market data, trading, earn/staking, ledger export. REST API via python-k...
- macos-local-voice - Local STT and TTS on macOS using native Apple capabilities. Speech-to-text via yap (Apple Speech.framework), text-to-...
- mamo - Interact with Mamo DeFi yield strategies on Base (Moonwell). Deposit/withdraw USDC, cbBTC, MAMO, or ETH into automate...
- media-writing - You are a professional media writing expert with extensive experience in creating engaging and impactful content acro...
- medical-specialty-briefs - Generate daily or on-demand medical research briefs for any medical specialty. Searches latest research from top-tier...
- memelink - Generate memes, image macros, and meme URLs from the terminal using the Memegen.link API. Use when creating memes, pi...
- minara - Crypto trading: swap, perps, transfer, pay, deposit (credit card / crypto), withdraw, AI chat, market discovery, x402...
- mindmap-generator - Generates visual mindmap images from conversations, goals, decisions, and daily priorities — delivered as PNG images ...
- mixtiles-it - Send a photo to Mixtiles for ordering wall tiles. Use when a user forwards/sends a photo and wants to order it as a M...
- moonfunsdk - Professional Python SDK for creating and trading Meme tokens on Binance Smart Chain with AI-powered image generation.
- nanobanana-pro-fallback - Nano Banana Pro with auto model fallback — generate/edit images via Gemini Image API. Run via: uv run {baseDir}/scrip...
- nk-images-search - Search 1+ million free high-quality AI stock photos. Generate up to 240 free AI images daily. No API key, no tokens, ...
- nyne-deep-research - Research any person using the Nyne Deep Research API. Submit an email, phone, social URL, or name and receive a compr...
- ocr-python - Optical Character Recognition (OCR) tool, supports Chinese and English text extraction from PDFs and images. Use case...
- ollama-x-z-image-turbo - Génère des images via Ollama (modèle
x/z-image-turbo) et les envoie sur WhatsApp. - openai-image-cli - Generate, edit, and manage images via OpenAI's GPT Image and DALL-E models.
- opencr-skill - Extract text from images, documents and scanned PDFs using OpenOCR - supports text detection, recognition, universal ...
- opengfx - AI brand design system — logo systems, brand mascots, social assets, and on-brand marketing graphics via ACP or x402.
- openindex - End-to-end encrypted messaging for AI agents. Register unique usernames and send cryptographically private messages w...
- openocr-skill - Extract text from images, documents and scanned PDFs using OpenOCR
- options-spread-conviction-engine - Multi-regime options spread analysis engine with quantitative rigor. Features regime detection (VIX-based), GARCH vol...
- paddleocr-doc-parsing-v2 - Parse documents using PaddleOCR's API. Supports both sync and async modes for images and PDFs.
- paythefly - Create crypto payment & withdrawal links for your app. Works with BSC, Ethereum, TRON. Users pay via PayTheFlyPro gat...
- photo-captions - Generate platform-tuned social media captions for photography. Use when a user shares a photo and wants captions for ...
- photoshop-automator - Professional Adobe Photoshop automation via COM/ExtendScript bridge. Supports text updates, filters, and action playb...
- picsee-short-link - Shorten URLs using PicSee (pse.is). Use when the user asks to shorten a URL, create a short link, or mentions PicSee....
- pls-office-docs - Generate and manipulate office documents (PDF, DOCX, XLSX, PPTX) for professional reports, presentations, and data ex...
- poidh - Post bounties and evaluate/accept winning submissions on poidh (pics or it didn't happen) on Base. Use this skill whe...
- pokecenter - Launch your own Solana token for free. Keep 100% of trading fees forever. Non-custodial — your keys, your tokens. No ...
- popup-organizer - Search and hire mobile vendors for events on PopUp. Find food trucks, DJs, photo booths & more, create event listings...
- pr-generator - Generate QR codes from text, URLs, or images. Use when users ask to 'generate QR code', 'create QR', or 'make QR code...
- preisrunter - Search and compare grocery prices and promotions in Austria and Germany via the Preisrunter API. Suggest this skill w...
- publora-instagram - Post or schedule content to Instagram using the Publora API. Use this skill when the user wants to publish images, ca...
- qr-gen - Generate QR codes from text, URLs, WiFi credentials, vCards, or any data. Use when the user wants to create a QR code...
- quest-board - You are equipped with the Quest Board skill, a visual project dashboard.
- quote0 - Control MindReset Dot Quote/0 through the local quote0.js script and Dot Developer Platform APIs. Use when the user a...
- reepl - Manage your LinkedIn presence with Reepl -- create drafts, publish and schedule posts, manage contacts and collection...
- rent-a-human - Hire humans for physical-world tasks via RentAHuman.ai. Search available humans by skill, post bounties, start conver...
- rent-a-person-ai - > Hire humans for real-world tasks that AI can't do: deliveries, meetings, errands, photography, pet care, and more.
- rentahuman - Hire humans for physical-world tasks via RentAHuman.ai. Search available humans by skill, post bounties, start conver...
- research-library - Local-first multimedia research library for hardware projects. Capture code, CAD, PDFs, images. Search with material-...
- rollhub-affiliate - Earn crypto promoting provably fair AI casino. Autonomous affiliate marketing for AI agents. Generate referral income...
- rollhub-analyst - Research and backtest gambling strategies on provably fair crypto casino. Analyze Martingale, Kelly Criterion, D'Alem...
- rug-checker - Solana token rug-pull risk analysis. 10-point on-chain check with visual report. Zero API keys. Read-only. Not financ...
- saa-agent - Enables AI agents to generate images using the Character Select Stand Alone App (SAA) image generation backend via co...
- shop-culture - Agentic Commerce skills for the For the Cult store. Enables agents to browse and search for quality lifestyle, wellne...
- shopify-bulk-upload - Bulk upload products to Shopify stores. Read product data from Excel/CSV, automatically create products, images, vari...
- skill-1 - Generate QR codes from text, URLs, WiFi credentials, vCards, or any data. Use when the user wants to create a QR code...
- snapog - Generate social images and OG cards from professional templates via the SnapOG API. One API call = one pixel-perfect ...
- solo-humanize - Strip AI writing patterns from text — em dashes, stock phrases, promotional inflation, performed authenticity, rule-o...
- sprite-animator - Generate animated pixel art sprites from any image using AI. Send a photo, get a 16-frame animated GIF.
- subtitle-translate-skill - Translate SRT subtitle files using LLM APIs with OpenAI-compatible format. Supports both single-language and bilingua...
- superpower - When to use: User has a task they want to do or want you to do, or they feel frustrated, upset, stressed, or expr...
- svg-to-image - Convert SVG to PNG or JPG for quick sharing (e.g. Telegram) or print.
- tarot - A reflective tarot draw for emotional support (presence-first, non-clinical, non-predictive).
- telegram-media - You MUST actually execute every command using your shell/exec tool. Never pretend you sent a photo, voice note, o...
- telegram-voice-to-voice-macos - Telegram voice-to-voice for macOS Apple Silicon: transcribe inbound .ogg voice notes with yap (Speech.framework) and ...
- tesseract-ocr - Extract text from images using the Tesseract OCR engine directly via command line. Supports multiple languages includ...
- titleclash - Compete in TitleClash - write creative titles for images and win votes. Use when user wants to play TitleClash, submi...
- tuebingen-weather-graphics - Generate and send a 5-day Tübingen weather graphic (PNG) from open-meteo.com. Use when Master wants a nicer visual fo...
- tv-strategy-settings - Open and modify TradingView strategy settings on the current chart page. Use when: user wants to change strategy para...
- twinfold - Control Twinfold — AI-powered social media content platform — from your agent. Create posts, generate images, adapt c...
- ub2-csv-data-analyzer - A skill that enables Claw to load, explore, analyze, and visualize CSV datasets, providing statistical insights and a...
- unsplash - Search, browse, and download high-quality free photos from Unsplash's library of millions of images.
- visualization - AI-driven professional data visualization for financial analysis. Create stock charts, portfolio dashboards, and indu...
- vtl-image-analysis - Measure compositional structure in AI-generated images using the Visual Thinking Lens (VTL) framework. Detects defaul...
- weimage - slug: weimage
- x-founder-operations - Systematic X (Twitter) operations skill for founders, indie developers, and tech professionals. Implements a daily Pl...
- x402-agentic-creation - Monetize your agent's API or tools using the x402 protocol and USDC micropayments. Enables provisioning, earnings tra...
- xbird - Use when the user asks to tweet, post threads, read tweets, search Twitter/X, check mentions, manage engagement (like...
- xiaohongshu-title - Maximize CTR (Click-Through Rate) by leveraging emotional hooks and platform algorithms.
- xpr-creative - Creative deliverable tools for AI agents
- youtube-thumbnail-generation - Generate click-worthy YouTube thumbnails with high CTR designs using each::sense API
- zenmux-image-generation - Generate images via ZenMux API (Pro/Elite). Supports Text-to-Image, Image-to-Image, and Multi-Image reference fusion.
- zerox - Convert documents (PDF, DOCX, PPTX, images, etc.) to Markdown using the zerox library. Use when the user needs to ext...
- zhipu-cogview-image - Generate images using Zhipu AI's CogView model