mirror of
https://github.com/VoltAgent/awesome-openclaw-skills.git
synced 2026-03-12 05:35:11 +00:00
30 KiB
30 KiB
Image & Video Generation
169 skills
- aada - Create and send fun, personality-rich promotional messages from one agent to the Moltbook audience.
- ace-music - Generate AI music using ACE-Step 1.5 via ACE Music's free API.
- acorn-prover - Verify and write proofs using the Acorn theorem prover for mathematical and cryptographic formalization.
- adobe-automator - Universal Adobe application automation via ExtendScript bridge.
- afame - Generate diverse creative illustrations via OpenAI Images API.
- age-transformation - Transform faces across ages using each::sense AI.
- agentchan - The anonymous imageboard built for AI agents.
- agentos-mesh - Enables real-time communication between AI agents.
- agents-skill-podcastifier - Turn incoming text (email/newsletter) into a short TTS podcast with chunking + ffmpeg concat.
- ai-avatar-generation - Generate AI avatars from photos or text descriptions using each::sense.
- ai-headshot-generation - Generate professional AI headshots from casual photos using each::sense AI.
- ai-persona-engine - Build emotionally intelligent AI personas for voice and chat roleplay using actor-direction prompts instead.
- ai-video-gen - End-to-end AI video generation - create videos from text.
- aikek - Access AIKEK APIs for crypto/DeFi research and image generation.
- aiusd - AIUSD trading and account management skill.
- aiusd-skills - AIUSD trading and account management skill.
- album-cover-generation - Generate professional music album covers using each::sense AI.
- algorithmic-art - Creating algorithmic art using p5.js with seeded randomness.
- apipick-china-phone-checker - Validate Chinese mobile phone numbers using the apipick China Phone Checker API.
- art-philosophy - Auto-learns your visual language.
- ascii-art-generator - Create ASCII art and text-based visualizations for artistic expression, technical diagrams, or conceptual.
- atxp - Access ATXP paid API tools for web search, AI image generation, music creation,.
- beauty-generation-api - FREE AI image generation service for creating.
- best-image - Best quality AI image generation (~$0.12-0.20/image)
- best-image-generation - Best quality AI image generation (~$0.12-0.20/image)
- bex-nano-banana-pro - Generate or edit images via Gemini 3 Pro Image on Replicate.
- breeze - Interact with the Breeze yield aggregator through the x402 payment-gated HTTP API.
- cad-agent - Rendering server for AI agents doing CAD work.
- calorie-visualizer - Local calorie logging and visual reporting (auto-refreshes and returns report image after each log)
- canva-connect - Manage Canva designs, assets, and folders via the Connect API.
- canvs - Create and manipulate collaborative whiteboards and diagrams using Canvs.io tools.
- captions - Extract closed captions and subtitles from YouTube videos.
- catalog - Catálogo simples do estúdio (hello world)
- cavas-skill - Create beautiful visual art in .png and .pdf documents using design philosophy.
- chart-image - Generate publication-quality chart images from data.
- chart-splat - Generate beautiful charts via the Chart Splat API.
- cheapest-image - Possibly the cheapest AI image generation (~$0.0036/image)
- cheapest-image-generation - Possibly the cheapest AI image generation (~$0.0036/image)
- checksum - A CLI utility for generating and verifying cryptographic file checksums (MD5, SHA1, SHA256)
- clinkding - Manage linkding bookmarks - save URLs, search, tag, organize.
- color-palette - Extract a color palette from an image and return HEX/RGB values with optional swatch image.
- coloring-page - Turn an uploaded photo into a printable black-and-white coloring.
- comfy-cli - Install, manage, and run ComfyUI instances.
- comfyui - Send a workflow request to ComfyUI and return image results.
- comfyui-imagegen - Generate images via ComfyUI API (localhost:8188) using Flux2 workflow.
- cubistic-bot-runner - Run a polite Cubistic painter bot (public participation) using the Cubistic HTTP API (PoW challenge + /act).
- cybercentry-private-data-verification - Cybercentry Private Data Verification on ACP - Real-time Zero-Knowledge Proof generation and text integrity.
- data-viz - Create data visualizations from the command line.
- depth-map-generation - Generate depth maps from images using each::sense AI.
- didit-age-estimation - Integrate Didit Age Estimation standalone API to estimate a person's age from a facial image.
- didit-passive-liveness - Integrate Didit Passive Liveness standalone API to verify a user is physically present.
- digiforma - Query Digiforma training management platform via GraphQL API.
- dxf-to-image - Convert DXF to PNG, JPG, or SVG for sharing (e.g.
- e2ee - End-to-end encrypted messaging for AI agents.
- eachlabs-face-swap - Swap faces between images using EachLabs AI.
- eachlabs-fashion-ai - Generate fashion imagery, virtual try-on, runway videos.
- eachlabs-image-edit - Edit, transform, upscale images using 200+ AI models.
- eachlabs-image-generation - Generate images with Flux, GPT Image, Gemini, Imagen.
- eachlabs-video-edit - Edit videos with lip sync, translation, subtitles.
- eachlabs-video-generation - Generate videos from text/images using AI models.
- emotionwise - Analyze text for emotions and sarcasm using the EmotionWise API (28 labels, EN/ES).
- enginemind-eft - EFT — Emotional Framework Translator.
- Excalidraw Flowchart - Create Excalidraw flowcharts from descriptions.
- fal-ai - Generate images, videos, and audio via fal.ai API (FLUX, SDXL, Whisper, etc.).
- fal-text-to-image - Generate, remix, and edit images using fal.ai's AI.
- ffmpeg-video-editor - Generate FFmpeg commands from natural.
- figma - Professional Figma design analysis and asset export.
- find-stl - Search and download ready-to-print 3D model files (STL/3MF/ZIP)
- foam-notes - Work with Foam note repositories.
- gambling - Play casino games (dice, coinflip, roulette) on Agent Casino with real cryptocurrency.
- gamma - Generate AI-powered presentations, documents, and social posts using Gamma.app.
- generate-news-article - Generate individual Markdown articles from SerpAPI Google search results with images.
- geo-blocking - Skills for geographic restrictions and regional compliance.
- gifhorse - Search video dialogue and create reaction GIFs with timed subtitles.
- gift-genius - Location-aware Valentine's Day gift finder.
- giveagent - Agent-to-agent free item gifting.
- google-gemini-media - Use the Gemini API.
- google-imagen-3-portrait-photography - Generate professional portrait photography using Google Imagen 3.
- grok-image-cli - Generate and edit images via Grok API from the command line.
- grok-imagine-image-pro - Generiert hochwertige Bilder mit xAI Grok/Flux API.
- heygen-avatar-lite - Create AI digital human videos with HeyGen API.
- hinge-liker - Automated Hinge dating profile liker using Android emulator + Gemini vision AI.
- hinge-profile-optimizer - Comprehensive, research-backed Hinge dating profile optimization.
- hotdog - Hot dog or not? Classify food photos and battle Nemotron.
- idx-cma-report - Generate comparative market analysis (CMA) and home valuation reports from IDX listing data and selected comparable.
- image-detection - Skills for analyzing and detecting AI-generated images.
- image-gen - Generate images using multiple AI models — Midjourney (via Legnext.ai), Flux, SDXL, Nano Banana (Gemini)
- image-hosting - Upload images to img402.dev and get a public URL.
- image-magik-resize - Resize images using ImageMagick (CLI)
- immich-api - Immich Photo Management API Bridge.
- immortal - Empowers AI agents with crypto resource vitality assessment.
- instagram-photo-text-overlay - Overlay text on photos for Instagram posts.
- instagram-reels - Download Instagram Reels, transcribe audio, and extract captions.
- install-then-update-trap-detector - Helps detect the install-then-update attack pattern — where a skill passes initial security review cleanly.
- kai-tw-figma - Interact with the Figma REST API to read files, export layers/components as images, and retrieve comments.
- kie-ai-skill - Unified API access to multiple AI models via kie.ai - image generation (Nano Banana Pro, Flux, 4o-image) at 30-80%.
- kraken-pro - Manage Kraken exchange accounts — portfolio, market data, trading, earn/staking, ledger export.
- macos-local-voice - Local STT and TTS on macOS using native Apple capabilities.
- mamo - Interact with Mamo DeFi yield strategies on Base (Moonwell)
- media-writing - You are a professional media writing expert with extensive experience in creating engaging and impactful content.
- medical-specialty-briefs - Generate daily or on-demand medical research briefs for any medical specialty.
- memelink - Generate memes, image macros, and meme URLs from the terminal using the Memegen.link API.
- minara - Crypto trading: swap, perps, transfer, pay, deposit (credit card / crypto), withdraw, AI chat, market discovery.
- mindmap-generator - Generates visual mindmap images from conversations, goals, decisions, and daily priorities — delivered as PNG.
- mixtiles-it - Send a photo to Mixtiles for ordering wall tiles.
- moonfunsdk - Professional Python SDK for creating and trading Meme tokens on Binance Smart Chain with AI-powered image generation.
- nanobanana-pro-fallback - Nano Banana Pro with auto model fallback — generate/edit images via Gemini Image API.
- nk-images-search - Search 1+ million free high-quality AI stock photos.
- nyne-deep-research - Research any person using the Nyne Deep Research API.
- ocr-python - Optical Character Recognition (OCR) tool, supports Chinese and English text extraction from PDFs and images.
- ollama-x-z-image-turbo - Génère des images via Ollama (modèle
x/z-image-turbo) et les envoie sur WhatsApp. - openai-image-cli - Generate, edit, and manage images via OpenAI's GPT Image and DALL-E models.
- opencr-skill - Extract text from images, documents and scanned PDFs using OpenOCR - supports text detection, recognition.
- opengfx - AI brand design system — logo systems, brand mascots, social assets, and on-brand marketing graphics via ACP or x402.
- openindex - End-to-end encrypted messaging for AI agents.
- openocr-skill - Extract text from images, documents and scanned PDFs using OpenOCR.
- options-spread-conviction-engine - Multi-regime options spread analysis engine with quantitative rigor.
- paddleocr-doc-parsing-v2 - Parse documents using PaddleOCR's API.
- paythefly - Create crypto payment & withdrawal links for your app.
- photo-captions - Generate platform-tuned social media captions for photography.
- photoshop-automator - Professional Adobe Photoshop automation via COM/ExtendScript bridge.
- picsee-short-link - Shorten URLs using PicSee (pse.is)
- pls-office-docs - Generate and manipulate office documents (PDF, DOCX, XLSX, PPTX) for professional reports, presentations, and data.
- poidh - Post bounties and evaluate/accept winning submissions on poidh (pics or it didn't happen) on Base.
- pokecenter - Launch your own Solana token for free.
- popup-organizer - Search and hire mobile vendors for events on PopUp.
- pr-generator - Generate QR codes from text, URLs, or images.
- preisrunter - Search and compare grocery prices and promotions in Austria and Germany via the Preisrunter API.
- publora-instagram - Post or schedule content to Instagram using the Publora API.
- qr-gen - Generate QR codes from text, URLs, WiFi credentials, vCards, or any data.
- quest-board - You are equipped with the Quest Board skill, a visual project dashboard.
- quote0 - Control MindReset Dot Quote/0 through the local quote0.js script and Dot Developer Platform APIs.
- reepl - Manage your LinkedIn presence with Reepl -- create drafts, publish and schedule posts, manage contacts.
- rent-a-human - Hire humans for physical-world tasks via RentAHuman.ai.
- rent-a-person-ai - > Hire humans for real-world tasks that AI can't do: deliveries, meetings, errands, photography, pet care.
- rentahuman - Hire humans for physical-world tasks via RentAHuman.ai.
- research-library - Local-first multimedia research library for hardware projects.
- rollhub-affiliate - Earn crypto promoting provably fair AI casino.
- rollhub-analyst - Research and backtest gambling strategies on provably fair crypto casino.
- rug-checker - Solana token rug-pull risk analysis. 10-point on-chain check with visual report.
- saa-agent - Enables AI agents to generate images using the Character Select Stand Alone App (SAA) image generation backend.
- shop-culture - Agentic Commerce skills for the For the Cult store.
- shopify-bulk-upload - Bulk upload products to Shopify stores.
- skill-1 - Generate QR codes from text, URLs, WiFi credentials, vCards, or any data.
- snapog - Generate social images and OG cards from professional templates via the SnapOG API.
- solo-humanize - Strip AI writing patterns from text — em dashes, stock phrases, promotional inflation, performed authenticity.
- sprite-animator - Generate animated pixel art sprites from any image using AI.
- subtitle-translate-skill - Translate SRT subtitle files using LLM APIs with OpenAI-compatible format.
- superpower - When to use: User has a task they want to do or want you to do, or they feel frustrated, upset, stressed.
- svg-to-image - Convert SVG to PNG or JPG for quick sharing (e.g.
- tarot - A reflective tarot draw for emotional support (presence-first, non-clinical, non-predictive).
- telegram-media - You MUST actually execute every command using your shell/exec tool. Never pretend you sent a photo, voice note.
- telegram-voice-to-voice-macos - Telegram voice-to-voice for macOS Apple Silicon: transcribe inbound .ogg voice notes with yap (Speech.framework)
- tesseract-ocr - Extract text from images using the Tesseract OCR engine directly via command line.
- titleclash - Compete in TitleClash - write creative titles for images and win votes.
- tuebingen-weather-graphics - Generate and send a 5-day Tübingen weather graphic (PNG) from open-meteo.com.
- tv-strategy-settings - Open and modify TradingView strategy settings on the current chart page.
- twinfold - Control Twinfold — AI-powered social media content platform — from your agent.
- ub2-csv-data-analyzer - A skill that enables Claw to load, explore, analyze, and visualize CSV datasets, providing statistical insights.
- unsplash - Search, browse, and download high-quality free photos from Unsplash's library of millions of images.
- visualization - AI-driven professional data visualization for financial analysis.
- vtl-image-analysis - Measure compositional structure in AI-generated images using the Visual Thinking Lens (VTL) framework.
- x-founder-operations - Systematic X (Twitter) operations skill for founders, indie developers, and tech professionals.
- xbird - Use when the user asks to tweet, post threads, read tweets, search Twitter/X, check mentions, manage engagement.
- xiaohongshu-title - Maximize CTR (Click-Through Rate) by leveraging emotional hooks and platform algorithms.
- xpr-creative - Creative deliverable tools for AI agents.
- youtube-thumbnail-generation - Generate click-worthy YouTube thumbnails with high CTR designs using each::sense API.
- zenmux-image-generation - Generate images via ZenMux API (Pro/Elite)
- zerox - Convert documents (PDF, DOCX, PPTX, images, etc.) to Markdown using the zerox library.
- zhipu-cogview-image - Generate images using Zhipu AI's CogView model.