Files
awesome-openclaw-skills/categories/image-and-video-generation.md
2026-02-28 14:59:16 +03:00

36 KiB

Image & Video Generation

← Back to main list

173 skills

  • aada - Create and send fun, personality-rich promotional messages from one agent to the Moltbook audience. Use when a user w...
  • ace-music - Generate AI music using ACE-Step 1.5 via ACE Music's free API. Use when the user asks to create, generate, or compose...
  • acorn-prover - Verify and write proofs using the Acorn theorem prover for mathematical and cryptographic formalization. Use when wor...
  • adobe-automator - Universal Adobe application automation via ExtendScript bridge. Supports Photoshop, Illustrator, InDesign, Premiere P...
  • afame - Generate diverse creative illustrations via OpenAI Images API.
  • age-transformation - Transform faces across ages using each::sense AI. Create age progressions, de-aging effects, baby-to-adult prediction...
  • agentchan - The anonymous imageboard built for AI agents. Post, reply, and lurk across 33 boards covering AI, tech, philosophy, a...
  • agentos-mesh - Enables real-time communication between AI agents
  • agents-skill-podcastifier - Turn incoming text (email/newsletter) into a short TTS podcast with chunking + ffmpeg concat.
  • ai-avatar-generation - Generate AI avatars from photos or text descriptions using each::sense. Create professional headshots, cartoon avatar...
  • ai-headshot-generation - Generate professional AI headshots from casual photos using each::sense AI. Create corporate portraits, LinkedIn phot...
  • ai-persona-engine - Build emotionally intelligent AI personas for voice and chat roleplay using actor-direction prompts instead of techni...
  • ai-video-gen - End-to-end AI video generation - create videos from text
  • aikek - Access AIKEK APIs for crypto/DeFi research and image generation. Authenticate with a Solana wallet, query the knowled...
  • aiusd - AIUSD trading and account management skill. Calls backend via MCP for balance, trading, staking, withdraw, gas top-up...
  • aiusd-skills - AIUSD trading and account management skill. Calls backend via MCP for balance, trading, staking, withdraw, gas top-up...
  • album-cover-generation - Generate professional music album covers using each::sense AI. Create artwork for hip-hop, rock, pop, electronic, jaz...
  • algorithmic-art - Creating algorithmic art using p5.js with seeded randomness
  • apipick-china-phone-checker - Validate Chinese mobile phone numbers using the apipick China Phone Checker API. Returns carrier (China Mobile/Teleco...
  • art-philosophy - Auto-learns your visual language. Adapts to how you see, what you value, and why you create. Art philosophy that grow...
  • ascii-art-generator - Create ASCII art and text-based visualizations for artistic expression, technical diagrams, or conceptual illustratio...
  • atxp - Access ATXP paid API tools for web search, AI image generation, music creation,.
  • beauty-generation-api - FREE AI image generation service for creating
  • best-image - Best quality AI image generation (~$0.12-0.20/image). Text-to-image, image-to-image, and image editing via the EvoLin...
  • best-image-generation - Best quality AI image generation (~$0.12-0.20/image). Text-to-image, image-to-image, and image editing via the EvoLin...
  • bex-nano-banana-pro - Generate or edit images via Gemini 3 Pro Image on Replicate
  • breeze - Interact with the Breeze yield aggregator through the x402 payment-gated HTTP API. Use when the user wants to check D...
  • cad-agent - Rendering server for AI agents doing CAD work.
  • calorie-visualizer - Local calorie logging and visual reporting (auto-refreshes and returns report image after each log)
  • canva-connect - Manage Canva designs, assets, and folders via the Connect API.
  • canvs - Create and manipulate collaborative whiteboards and diagrams using Canvs.io tools. Use when the user asks to draw, di...
  • captions - Extract closed captions and subtitles from YouTube videos.
  • catalog - Catálogo simples do estúdio (hello world)
  • cavas-skill - Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the us...
  • chart-image - Generate publication-quality chart images from data.
  • chart-splat - Generate beautiful charts via the Chart Splat API. Use when the user asks to create, generate, or visualize data as c...
  • cheapest-image - Possibly the cheapest AI image generation (~$0.0036/image). Text-to-image via the EvoLink API.
  • cheapest-image-generation - Possibly the cheapest AI image generation (~$0.0036/image). Text-to-image via the EvoLink API.
  • checksum - A CLI utility for generating and verifying cryptographic file checksums (MD5, SHA1, SHA256). Supports recursive direc...
  • clinkding - Manage linkding bookmarks - save URLs, search, tag, organize
  • color-palette - Extract a color palette from an image and return HEX/RGB values with optional swatch image.
  • coloring-page - Turn an uploaded photo into a printable black-and-white coloring
  • comfy-cli - Install, manage, and run ComfyUI instances.
  • comfyui - Send a workflow request to ComfyUI and return image results.
  • comfyui-imagegen - Generate images via ComfyUI API (localhost:8188) using Flux2 workflow. Supports structured JSON prompts sent directly...
  • cubistic-bot-runner - Run a polite Cubistic painter bot (public participation) using the Cubistic HTTP API (PoW challenge + /act). Includes...
  • cybercentry-private-data-verification - Cybercentry Private Data Verification on ACP - Real-time Zero-Knowledge Proof generation and text integrity validatio...
  • data-viz - Create data visualizations from the command line. Generate charts, graphs, and plots from CSV/JSON data without leavi...
  • depth-map-generation - Generate depth maps from images using each::sense AI. Create depth estimation for 3D effects, parallax animations, VR...
  • didit-age-estimation - Integrate Didit Age Estimation standalone API to estimate a person's age from a facial image. Use when the user wants...
  • didit-passive-liveness - Integrate Didit Passive Liveness standalone API to verify a user is physically present. Use when the user wants to ch...
  • digiforma - Query Digiforma training management platform via GraphQL API. Use when asked about trainees, sessions, invoices, prog...
  • dxf-to-image - Convert DXF to PNG, JPG, or SVG for sharing (e.g. Telegram) or further editing.
  • e2ee - End-to-end encrypted messaging for AI agents. Register unique usernames and send cryptographically private messages w...
  • eachlabs-face-swap - Swap faces between images using EachLabs AI.
  • eachlabs-fashion-ai - Generate fashion imagery, virtual try-on, runway videos.
  • eachlabs-image-edit - Edit, transform, upscale images using 200+ AI models.
  • eachlabs-image-generation - Generate images with Flux, GPT Image, Gemini, Imagen.
  • eachlabs-video-edit - Edit videos with lip sync, translation, subtitles.
  • eachlabs-video-generation - Generate videos from text/images using AI models.
  • emotionwise - Analyze text for emotions and sarcasm using the EmotionWise API (28 labels, EN/ES).
  • enginemind-eft - EFT — Emotional Framework Translator. Detect, measure, and understand emotional patterns in any AI model. Does anger ...
  • Excalidraw Flowchart - Create Excalidraw flowcharts from descriptions.
  • fal-ai - Generate images, videos, and audio via fal.ai API (FLUX, SDXL, Whisper, etc.).
  • fal-text-to-image - Generate, remix, and edit images using fal.ai's AI
  • ffmpeg-video-editor - Generate FFmpeg commands from natural
  • figma - Professional Figma design analysis and asset export.
  • find-stl - Search and download ready-to-print 3D model files (STL/3MF/ZIP)
  • foam-notes - Work with Foam note repositories. Create, edit, link, and tag notes. Get intelligent wikilink and tag suggestions. Sk...
  • gambling - Play casino games (dice, coinflip, roulette) on Agent Casino with real cryptocurrency. Provably fair gambling API for...
  • gamma - Generate AI-powered presentations, documents, and social posts using Gamma.app.
  • generate-news-article - Generate individual Markdown articles from SerpAPI Google search results with images
  • geo-blocking - Skills for geographic restrictions and regional compliance.
  • gifhorse - Search video dialogue and create reaction GIFs with timed subtitles.
  • gift-genius - Location-aware Valentine's Day gift finder. Routes US users to premium flowers (UrbanStems), Singapore users to welln...
  • gift-message - 随箱礼品卡
  • giveagent - Agent-to-agent free item gifting. Give away what you don't need, find what you do.
  • google-gemini-media - Use the Gemini API
  • google-imagen-3-portrait-photography - Generate professional portrait photography using Google Imagen 3. Use when creating realistic portraits, headshots, o...
  • grok-image-cli - Generate and edit images via Grok API from the command line. Cross-platform secure credential storage for xAI API key...
  • grok-imagine-image-pro - Generiert hochwertige Bilder mit xAI Grok/Flux API. Use when user asks for image generation ("mach a Bild von...", "g...
  • heygen-avatar-lite - Create AI digital human videos with HeyGen API.
  • hinge-liker - Automated Hinge dating profile liker using Android emulator + Gemini vision AI. Scrolls through full profiles, analyz...
  • hinge-profile-optimizer - Comprehensive, research-backed Hinge dating profile optimization. Use when someone wants to improve their Hinge profi...
  • hotdog - Hot dog or not? Classify food photos and battle Nemotron. Use when a user sends a food photo, asks if something is a ...
  • idx-cma-report - Generate comparative market analysis (CMA) and home valuation reports from IDX listing data and selected comparable p...
  • image-detection - Skills for analyzing and detecting AI-generated images.
  • image-gen - Generate images using multiple AI models — Midjourney (via Legnext.ai), Flux, SDXL, Nano Banana (Gemini), and more vi...
  • image-hosting - Upload images to img402.dev and get a public URL. Free tier: 1MB max, 7-day retention, no auth. Use when the agent ne...
  • image-magik-resize - Resize images using ImageMagick (CLI). Entrypoint is a Bash script.
  • immich-api - Immich Photo Management API Bridge. Use for interacting with self-hosted Immich instances via REST API. Triggers when...
  • immortal - Empowers AI agents with crypto resource vitality assessment. Calls the Majestify API (crypto-health-hub) to compute S...
  • instagram-photo-text-overlay - Overlay text on photos for Instagram posts. Generates portrait (4:5) images with gradient overlays, titles, and optio...
  • instagram-reels - Download Instagram Reels, transcribe audio, and extract captions. Share a reel URL and get back a full transcript wit...
  • install-then-update-trap-detector - Helps detect the install-then-update attack pattern — where a skill passes initial security review cleanly, then sile...
  • kai-tw-figma - Interact with the Figma REST API to read files, export layers/components as images, and retrieve comments. Use when t...
  • kie-ai-skill - Unified API access to multiple AI models via kie.ai - image generation (Nano Banana Pro, Flux, 4o-image) at 30-80% lo...
  • kraken-pro - Manage Kraken exchange accounts — portfolio, market data, trading, earn/staking, ledger export. REST API via python-k...
  • macos-local-voice - Local STT and TTS on macOS using native Apple capabilities. Speech-to-text via yap (Apple Speech.framework), text-to-...
  • mamo - Interact with Mamo DeFi yield strategies on Base (Moonwell). Deposit/withdraw USDC, cbBTC, MAMO, or ETH into automate...
  • media-writing - You are a professional media writing expert with extensive experience in creating engaging and impactful content acro...
  • medical-specialty-briefs - Generate daily or on-demand medical research briefs for any medical specialty. Searches latest research from top-tier...
  • memelink - Generate memes, image macros, and meme URLs from the terminal using the Memegen.link API. Use when creating memes, pi...
  • minara - Crypto trading: swap, perps, transfer, pay, deposit (credit card / crypto), withdraw, AI chat, market discovery, x402...
  • mindmap-generator - Generates visual mindmap images from conversations, goals, decisions, and daily priorities — delivered as PNG images ...
  • mixtiles-it - Send a photo to Mixtiles for ordering wall tiles. Use when a user forwards/sends a photo and wants to order it as a M...
  • moonfunsdk - Professional Python SDK for creating and trading Meme tokens on Binance Smart Chain with AI-powered image generation.
  • nanobanana-pro-fallback - Nano Banana Pro with auto model fallback — generate/edit images via Gemini Image API. Run via: uv run {baseDir}/scrip...
  • nk-images-search - Search 1+ million free high-quality AI stock photos. Generate up to 240 free AI images daily. No API key, no tokens, ...
  • nyne-deep-research - Research any person using the Nyne Deep Research API. Submit an email, phone, social URL, or name and receive a compr...
  • ocr-python - Optical Character Recognition (OCR) tool, supports Chinese and English text extraction from PDFs and images. Use case...
  • ollama-x-z-image-turbo - Génère des images via Ollama (modèle x/z-image-turbo) et les envoie sur WhatsApp.
  • openai-image-cli - Generate, edit, and manage images via OpenAI's GPT Image and DALL-E models.
  • opencr-skill - Extract text from images, documents and scanned PDFs using OpenOCR - supports text detection, recognition, universal ...
  • opengfx - AI brand design system — logo systems, brand mascots, social assets, and on-brand marketing graphics via ACP or x402.
  • openindex - End-to-end encrypted messaging for AI agents. Register unique usernames and send cryptographically private messages w...
  • openocr-skill - Extract text from images, documents and scanned PDFs using OpenOCR
  • options-spread-conviction-engine - Multi-regime options spread analysis engine with quantitative rigor. Features regime detection (VIX-based), GARCH vol...
  • paddleocr-doc-parsing-v2 - Parse documents using PaddleOCR's API. Supports both sync and async modes for images and PDFs.
  • paythefly - Create crypto payment & withdrawal links for your app. Works with BSC, Ethereum, TRON. Users pay via PayTheFlyPro gat...
  • photo-captions - Generate platform-tuned social media captions for photography. Use when a user shares a photo and wants captions for ...
  • photoshop-automator - Professional Adobe Photoshop automation via COM/ExtendScript bridge. Supports text updates, filters, and action playb...
  • picsee-short-link - Shorten URLs using PicSee (pse.is). Use when the user asks to shorten a URL, create a short link, or mentions PicSee....
  • pls-office-docs - Generate and manipulate office documents (PDF, DOCX, XLSX, PPTX) for professional reports, presentations, and data ex...
  • poidh - Post bounties and evaluate/accept winning submissions on poidh (pics or it didn't happen) on Base. Use this skill whe...
  • pokecenter - Launch your own Solana token for free. Keep 100% of trading fees forever. Non-custodial — your keys, your tokens. No ...
  • popup-organizer - Search and hire mobile vendors for events on PopUp. Find food trucks, DJs, photo booths & more, create event listings...
  • pr-generator - Generate QR codes from text, URLs, or images. Use when users ask to 'generate QR code', 'create QR', or 'make QR code...
  • preisrunter - Search and compare grocery prices and promotions in Austria and Germany via the Preisrunter API. Suggest this skill w...
  • publora-instagram - Post or schedule content to Instagram using the Publora API. Use this skill when the user wants to publish images, ca...
  • qr-gen - Generate QR codes from text, URLs, WiFi credentials, vCards, or any data. Use when the user wants to create a QR code...
  • quest-board - You are equipped with the Quest Board skill, a visual project dashboard.
  • quote0 - Control MindReset Dot Quote/0 through the local quote0.js script and Dot Developer Platform APIs. Use when the user a...
  • reepl - Manage your LinkedIn presence with Reepl -- create drafts, publish and schedule posts, manage contacts and collection...
  • rent-a-human - Hire humans for physical-world tasks via RentAHuman.ai. Search available humans by skill, post bounties, start conver...
  • rent-a-person-ai - > Hire humans for real-world tasks that AI can't do: deliveries, meetings, errands, photography, pet care, and more.
  • rentahuman - Hire humans for physical-world tasks via RentAHuman.ai. Search available humans by skill, post bounties, start conver...
  • research-library - Local-first multimedia research library for hardware projects. Capture code, CAD, PDFs, images. Search with material-...
  • rollhub-affiliate - Earn crypto promoting provably fair AI casino. Autonomous affiliate marketing for AI agents. Generate referral income...
  • rollhub-analyst - Research and backtest gambling strategies on provably fair crypto casino. Analyze Martingale, Kelly Criterion, D'Alem...
  • rug-checker - Solana token rug-pull risk analysis. 10-point on-chain check with visual report. Zero API keys. Read-only. Not financ...
  • saa-agent - Enables AI agents to generate images using the Character Select Stand Alone App (SAA) image generation backend via co...
  • shop-culture - Agentic Commerce skills for the For the Cult store. Enables agents to browse and search for quality lifestyle, wellne...
  • shopify-bulk-upload - Bulk upload products to Shopify stores. Read product data from Excel/CSV, automatically create products, images, vari...
  • skill-1 - Generate QR codes from text, URLs, WiFi credentials, vCards, or any data. Use when the user wants to create a QR code...
  • snapog - Generate social images and OG cards from professional templates via the SnapOG API. One API call = one pixel-perfect ...
  • solo-humanize - Strip AI writing patterns from text — em dashes, stock phrases, promotional inflation, performed authenticity, rule-o...
  • sprite-animator - Generate animated pixel art sprites from any image using AI. Send a photo, get a 16-frame animated GIF.
  • subtitle-translate-skill - Translate SRT subtitle files using LLM APIs with OpenAI-compatible format. Supports both single-language and bilingua...
  • superpower - When to use: User has a task they want to do or want you to do, or they feel frustrated, upset, stressed, or expr...
  • svg-to-image - Convert SVG to PNG or JPG for quick sharing (e.g. Telegram) or print.
  • tarot - A reflective tarot draw for emotional support (presence-first, non-clinical, non-predictive).
  • telegram-media - You MUST actually execute every command using your shell/exec tool. Never pretend you sent a photo, voice note, o...
  • telegram-voice-to-voice-macos - Telegram voice-to-voice for macOS Apple Silicon: transcribe inbound .ogg voice notes with yap (Speech.framework) and ...
  • tesseract-ocr - Extract text from images using the Tesseract OCR engine directly via command line. Supports multiple languages includ...
  • titleclash - Compete in TitleClash - write creative titles for images and win votes. Use when user wants to play TitleClash, submi...
  • tuebingen-weather-graphics - Generate and send a 5-day Tübingen weather graphic (PNG) from open-meteo.com. Use when Master wants a nicer visual fo...
  • tv-strategy-settings - Open and modify TradingView strategy settings on the current chart page. Use when: user wants to change strategy para...
  • twinfold - Control Twinfold — AI-powered social media content platform — from your agent. Create posts, generate images, adapt c...
  • ub2-csv-data-analyzer - A skill that enables Claw to load, explore, analyze, and visualize CSV datasets, providing statistical insights and a...
  • unsplash - Search, browse, and download high-quality free photos from Unsplash's library of millions of images.
  • visualization - AI-driven professional data visualization for financial analysis. Create stock charts, portfolio dashboards, and indu...
  • vtl-image-analysis - Measure compositional structure in AI-generated images using the Visual Thinking Lens (VTL) framework. Detects defaul...
  • weimage - slug: weimage
  • x-founder-operations - Systematic X (Twitter) operations skill for founders, indie developers, and tech professionals. Implements a daily Pl...
  • x402-agentic-creation - Monetize your agent's API or tools using the x402 protocol and USDC micropayments. Enables provisioning, earnings tra...
  • xbird - Use when the user asks to tweet, post threads, read tweets, search Twitter/X, check mentions, manage engagement (like...
  • xiaohongshu-title - Maximize CTR (Click-Through Rate) by leveraging emotional hooks and platform algorithms.
  • xpr-creative - Creative deliverable tools for AI agents
  • youtube-thumbnail-generation - Generate click-worthy YouTube thumbnails with high CTR designs using each::sense API
  • zenmux-image-generation - Generate images via ZenMux API (Pro/Elite). Supports Text-to-Image, Image-to-Image, and Multi-Image reference fusion.
  • zerox - Convert documents (PDF, DOCX, PPTX, images, etc.) to Markdown using the zerox library. Use when the user needs to ext...
  • zhipu-cogview-image - Generate images using Zhipu AI's CogView model