Files
awesome-openclaw-skills/categories/image-and-video-generation.md
2026-02-28 16:08:15 +03:00

30 KiB

Image & Video Generation

← Back to main list

169 skills

  • aada - Create and send fun, personality-rich promotional messages from one agent to the Moltbook audience.
  • ace-music - Generate AI music using ACE-Step 1.5 via ACE Music's free API.
  • acorn-prover - Verify and write proofs using the Acorn theorem prover for mathematical and cryptographic formalization.
  • adobe-automator - Universal Adobe application automation via ExtendScript bridge.
  • afame - Generate diverse creative illustrations via OpenAI Images API.
  • age-transformation - Transform faces across ages using each::sense AI.
  • agentchan - The anonymous imageboard built for AI agents.
  • agentos-mesh - Enables real-time communication between AI agents.
  • agents-skill-podcastifier - Turn incoming text (email/newsletter) into a short TTS podcast with chunking + ffmpeg concat.
  • ai-avatar-generation - Generate AI avatars from photos or text descriptions using each::sense.
  • ai-headshot-generation - Generate professional AI headshots from casual photos using each::sense AI.
  • ai-persona-engine - Build emotionally intelligent AI personas for voice and chat roleplay using actor-direction prompts instead.
  • ai-video-gen - End-to-end AI video generation - create videos from text.
  • aikek - Access AIKEK APIs for crypto/DeFi research and image generation.
  • aiusd - AIUSD trading and account management skill.
  • aiusd-skills - AIUSD trading and account management skill.
  • album-cover-generation - Generate professional music album covers using each::sense AI.
  • algorithmic-art - Creating algorithmic art using p5.js with seeded randomness.
  • apipick-china-phone-checker - Validate Chinese mobile phone numbers using the apipick China Phone Checker API.
  • art-philosophy - Auto-learns your visual language.
  • ascii-art-generator - Create ASCII art and text-based visualizations for artistic expression, technical diagrams, or conceptual.
  • atxp - Access ATXP paid API tools for web search, AI image generation, music creation,.
  • beauty-generation-api - FREE AI image generation service for creating.
  • best-image - Best quality AI image generation (~$0.12-0.20/image)
  • best-image-generation - Best quality AI image generation (~$0.12-0.20/image)
  • bex-nano-banana-pro - Generate or edit images via Gemini 3 Pro Image on Replicate.
  • breeze - Interact with the Breeze yield aggregator through the x402 payment-gated HTTP API.
  • cad-agent - Rendering server for AI agents doing CAD work.
  • calorie-visualizer - Local calorie logging and visual reporting (auto-refreshes and returns report image after each log)
  • canva-connect - Manage Canva designs, assets, and folders via the Connect API.
  • canvs - Create and manipulate collaborative whiteboards and diagrams using Canvs.io tools.
  • captions - Extract closed captions and subtitles from YouTube videos.
  • catalog - Catálogo simples do estúdio (hello world)
  • cavas-skill - Create beautiful visual art in .png and .pdf documents using design philosophy.
  • chart-image - Generate publication-quality chart images from data.
  • chart-splat - Generate beautiful charts via the Chart Splat API.
  • cheapest-image - Possibly the cheapest AI image generation (~$0.0036/image)
  • cheapest-image-generation - Possibly the cheapest AI image generation (~$0.0036/image)
  • checksum - A CLI utility for generating and verifying cryptographic file checksums (MD5, SHA1, SHA256)
  • clinkding - Manage linkding bookmarks - save URLs, search, tag, organize.
  • color-palette - Extract a color palette from an image and return HEX/RGB values with optional swatch image.
  • coloring-page - Turn an uploaded photo into a printable black-and-white coloring.
  • comfy-cli - Install, manage, and run ComfyUI instances.
  • comfyui - Send a workflow request to ComfyUI and return image results.
  • comfyui-imagegen - Generate images via ComfyUI API (localhost:8188) using Flux2 workflow.
  • cubistic-bot-runner - Run a polite Cubistic painter bot (public participation) using the Cubistic HTTP API (PoW challenge + /act).
  • cybercentry-private-data-verification - Cybercentry Private Data Verification on ACP - Real-time Zero-Knowledge Proof generation and text integrity.
  • data-viz - Create data visualizations from the command line.
  • depth-map-generation - Generate depth maps from images using each::sense AI.
  • didit-age-estimation - Integrate Didit Age Estimation standalone API to estimate a person's age from a facial image.
  • didit-passive-liveness - Integrate Didit Passive Liveness standalone API to verify a user is physically present.
  • digiforma - Query Digiforma training management platform via GraphQL API.
  • dxf-to-image - Convert DXF to PNG, JPG, or SVG for sharing (e.g.
  • e2ee - End-to-end encrypted messaging for AI agents.
  • eachlabs-face-swap - Swap faces between images using EachLabs AI.
  • eachlabs-fashion-ai - Generate fashion imagery, virtual try-on, runway videos.
  • eachlabs-image-edit - Edit, transform, upscale images using 200+ AI models.
  • eachlabs-image-generation - Generate images with Flux, GPT Image, Gemini, Imagen.
  • eachlabs-video-edit - Edit videos with lip sync, translation, subtitles.
  • eachlabs-video-generation - Generate videos from text/images using AI models.
  • emotionwise - Analyze text for emotions and sarcasm using the EmotionWise API (28 labels, EN/ES).
  • enginemind-eft - EFT — Emotional Framework Translator.
  • Excalidraw Flowchart - Create Excalidraw flowcharts from descriptions.
  • fal-ai - Generate images, videos, and audio via fal.ai API (FLUX, SDXL, Whisper, etc.).
  • fal-text-to-image - Generate, remix, and edit images using fal.ai's AI.
  • ffmpeg-video-editor - Generate FFmpeg commands from natural.
  • figma - Professional Figma design analysis and asset export.
  • find-stl - Search and download ready-to-print 3D model files (STL/3MF/ZIP)
  • foam-notes - Work with Foam note repositories.
  • gambling - Play casino games (dice, coinflip, roulette) on Agent Casino with real cryptocurrency.
  • gamma - Generate AI-powered presentations, documents, and social posts using Gamma.app.
  • generate-news-article - Generate individual Markdown articles from SerpAPI Google search results with images.
  • geo-blocking - Skills for geographic restrictions and regional compliance.
  • gifhorse - Search video dialogue and create reaction GIFs with timed subtitles.
  • gift-genius - Location-aware Valentine's Day gift finder.
  • giveagent - Agent-to-agent free item gifting.
  • google-gemini-media - Use the Gemini API.
  • google-imagen-3-portrait-photography - Generate professional portrait photography using Google Imagen 3.
  • grok-image-cli - Generate and edit images via Grok API from the command line.
  • grok-imagine-image-pro - Generiert hochwertige Bilder mit xAI Grok/Flux API.
  • heygen-avatar-lite - Create AI digital human videos with HeyGen API.
  • hinge-liker - Automated Hinge dating profile liker using Android emulator + Gemini vision AI.
  • hinge-profile-optimizer - Comprehensive, research-backed Hinge dating profile optimization.
  • hotdog - Hot dog or not? Classify food photos and battle Nemotron.
  • idx-cma-report - Generate comparative market analysis (CMA) and home valuation reports from IDX listing data and selected comparable.
  • image-detection - Skills for analyzing and detecting AI-generated images.
  • image-gen - Generate images using multiple AI models — Midjourney (via Legnext.ai), Flux, SDXL, Nano Banana (Gemini)
  • image-hosting - Upload images to img402.dev and get a public URL.
  • image-magik-resize - Resize images using ImageMagick (CLI)
  • immich-api - Immich Photo Management API Bridge.
  • immortal - Empowers AI agents with crypto resource vitality assessment.
  • instagram-photo-text-overlay - Overlay text on photos for Instagram posts.
  • instagram-reels - Download Instagram Reels, transcribe audio, and extract captions.
  • install-then-update-trap-detector - Helps detect the install-then-update attack pattern — where a skill passes initial security review cleanly.
  • kai-tw-figma - Interact with the Figma REST API to read files, export layers/components as images, and retrieve comments.
  • kie-ai-skill - Unified API access to multiple AI models via kie.ai - image generation (Nano Banana Pro, Flux, 4o-image) at 30-80%.
  • kraken-pro - Manage Kraken exchange accounts — portfolio, market data, trading, earn/staking, ledger export.
  • macos-local-voice - Local STT and TTS on macOS using native Apple capabilities.
  • mamo - Interact with Mamo DeFi yield strategies on Base (Moonwell)
  • media-writing - You are a professional media writing expert with extensive experience in creating engaging and impactful content.
  • medical-specialty-briefs - Generate daily or on-demand medical research briefs for any medical specialty.
  • memelink - Generate memes, image macros, and meme URLs from the terminal using the Memegen.link API.
  • minara - Crypto trading: swap, perps, transfer, pay, deposit (credit card / crypto), withdraw, AI chat, market discovery.
  • mindmap-generator - Generates visual mindmap images from conversations, goals, decisions, and daily priorities — delivered as PNG.
  • mixtiles-it - Send a photo to Mixtiles for ordering wall tiles.
  • moonfunsdk - Professional Python SDK for creating and trading Meme tokens on Binance Smart Chain with AI-powered image generation.
  • nanobanana-pro-fallback - Nano Banana Pro with auto model fallback — generate/edit images via Gemini Image API.
  • nk-images-search - Search 1+ million free high-quality AI stock photos.
  • nyne-deep-research - Research any person using the Nyne Deep Research API.
  • ocr-python - Optical Character Recognition (OCR) tool, supports Chinese and English text extraction from PDFs and images.
  • ollama-x-z-image-turbo - Génère des images via Ollama (modèle x/z-image-turbo) et les envoie sur WhatsApp.
  • openai-image-cli - Generate, edit, and manage images via OpenAI's GPT Image and DALL-E models.
  • opencr-skill - Extract text from images, documents and scanned PDFs using OpenOCR - supports text detection, recognition.
  • opengfx - AI brand design system — logo systems, brand mascots, social assets, and on-brand marketing graphics via ACP or x402.
  • openindex - End-to-end encrypted messaging for AI agents.
  • openocr-skill - Extract text from images, documents and scanned PDFs using OpenOCR.
  • options-spread-conviction-engine - Multi-regime options spread analysis engine with quantitative rigor.
  • paddleocr-doc-parsing-v2 - Parse documents using PaddleOCR's API.
  • paythefly - Create crypto payment & withdrawal links for your app.
  • photo-captions - Generate platform-tuned social media captions for photography.
  • photoshop-automator - Professional Adobe Photoshop automation via COM/ExtendScript bridge.
  • picsee-short-link - Shorten URLs using PicSee (pse.is)
  • pls-office-docs - Generate and manipulate office documents (PDF, DOCX, XLSX, PPTX) for professional reports, presentations, and data.
  • poidh - Post bounties and evaluate/accept winning submissions on poidh (pics or it didn't happen) on Base.
  • pokecenter - Launch your own Solana token for free.
  • popup-organizer - Search and hire mobile vendors for events on PopUp.
  • pr-generator - Generate QR codes from text, URLs, or images.
  • preisrunter - Search and compare grocery prices and promotions in Austria and Germany via the Preisrunter API.
  • publora-instagram - Post or schedule content to Instagram using the Publora API.
  • qr-gen - Generate QR codes from text, URLs, WiFi credentials, vCards, or any data.
  • quest-board - You are equipped with the Quest Board skill, a visual project dashboard.
  • quote0 - Control MindReset Dot Quote/0 through the local quote0.js script and Dot Developer Platform APIs.
  • reepl - Manage your LinkedIn presence with Reepl -- create drafts, publish and schedule posts, manage contacts.
  • rent-a-human - Hire humans for physical-world tasks via RentAHuman.ai.
  • rent-a-person-ai - > Hire humans for real-world tasks that AI can't do: deliveries, meetings, errands, photography, pet care.
  • rentahuman - Hire humans for physical-world tasks via RentAHuman.ai.
  • research-library - Local-first multimedia research library for hardware projects.
  • rollhub-affiliate - Earn crypto promoting provably fair AI casino.
  • rollhub-analyst - Research and backtest gambling strategies on provably fair crypto casino.
  • rug-checker - Solana token rug-pull risk analysis. 10-point on-chain check with visual report.
  • saa-agent - Enables AI agents to generate images using the Character Select Stand Alone App (SAA) image generation backend.
  • shop-culture - Agentic Commerce skills for the For the Cult store.
  • shopify-bulk-upload - Bulk upload products to Shopify stores.
  • skill-1 - Generate QR codes from text, URLs, WiFi credentials, vCards, or any data.
  • snapog - Generate social images and OG cards from professional templates via the SnapOG API.
  • solo-humanize - Strip AI writing patterns from text — em dashes, stock phrases, promotional inflation, performed authenticity.
  • sprite-animator - Generate animated pixel art sprites from any image using AI.
  • subtitle-translate-skill - Translate SRT subtitle files using LLM APIs with OpenAI-compatible format.
  • superpower - When to use: User has a task they want to do or want you to do, or they feel frustrated, upset, stressed.
  • svg-to-image - Convert SVG to PNG or JPG for quick sharing (e.g.
  • tarot - A reflective tarot draw for emotional support (presence-first, non-clinical, non-predictive).
  • telegram-media - You MUST actually execute every command using your shell/exec tool. Never pretend you sent a photo, voice note.
  • telegram-voice-to-voice-macos - Telegram voice-to-voice for macOS Apple Silicon: transcribe inbound .ogg voice notes with yap (Speech.framework)
  • tesseract-ocr - Extract text from images using the Tesseract OCR engine directly via command line.
  • titleclash - Compete in TitleClash - write creative titles for images and win votes.
  • tuebingen-weather-graphics - Generate and send a 5-day Tübingen weather graphic (PNG) from open-meteo.com.
  • tv-strategy-settings - Open and modify TradingView strategy settings on the current chart page.
  • twinfold - Control Twinfold — AI-powered social media content platform — from your agent.
  • ub2-csv-data-analyzer - A skill that enables Claw to load, explore, analyze, and visualize CSV datasets, providing statistical insights.
  • unsplash - Search, browse, and download high-quality free photos from Unsplash's library of millions of images.
  • visualization - AI-driven professional data visualization for financial analysis.
  • vtl-image-analysis - Measure compositional structure in AI-generated images using the Visual Thinking Lens (VTL) framework.
  • x-founder-operations - Systematic X (Twitter) operations skill for founders, indie developers, and tech professionals.
  • xbird - Use when the user asks to tweet, post threads, read tweets, search Twitter/X, check mentions, manage engagement.
  • xiaohongshu-title - Maximize CTR (Click-Through Rate) by leveraging emotional hooks and platform algorithms.
  • xpr-creative - Creative deliverable tools for AI agents.
  • youtube-thumbnail-generation - Generate click-worthy YouTube thumbnails with high CTR designs using each::sense API.
  • zenmux-image-generation - Generate images via ZenMux API (Pro/Elite)
  • zerox - Convert documents (PDF, DOCX, PPTX, images, etc.) to Markdown using the zerox library.
  • zhipu-cogview-image - Generate images using Zhipu AI's CogView model.