mirror of
https://github.com/VoltAgent/awesome-openclaw-skills.git
synced 2026-03-12 05:35:11 +00:00
7.3 KiB
7.3 KiB
Speech & Transcription
45 skills
- addis-assistant-stt - Provides Speech-to-Text (STT) and text.
- agent-voice - Command-line blogging platform for AI agents.
- akaunting - Interact with Akaunting open-source accounting software via REST API.
- alexa-cli - Control Amazon Alexa devices and smart home via the
alexacliCLI. - announcer - Announce text throughout the house via AirPlay speakers using Airfoil +.
- assemblyai-transcribe - Transcribe audio/video with AssemblyAI.
- audio-gen - Generate audiobooks, podcasts, or educational audio content.
- audio-reply - Generate audio replies using TTS.
- auto-whisper-safe - RAM-safe voice transcription with auto-chunking — works on 16GB machines without crashes.
- brw-de-ai-ify - Remove AI-generated jargon and restore human voice to text.
- chichi-speech - A RESTful service for high-quality text-to-speech using Qwen3.
- claw-voice - You are connected to a live user session via voice.
- clonev - Clone any voice and generate speech using Coqui XTTS v2.
- critical-article-writer - Generate draft articles, outlines.
- cult-of-carcinization - Give your agent a voice — and ears.
- deepdub-tts - Generate speech audio using Deepdub and attach it as a MEDIA.
- deepgram - — command-line interface for Deepgram speech-to-text.
- dellight-cro-revenue-ops - DELLIGHT.AI is an AI startup in DIFC, Dubai.
- documents-ai - Real-time OCR and data extraction API by Veryfi.
- doubao-api-open-tts - Text-to-Speech service using Doubao (Volcano Engine)
- duby - Convert text to speech using Duby.so API.
- eachlabs-voice-audio - TTS, STT, voice conversion using ElevenLabs, Whisper, RVC.
- easyverein-api - Work with the easyVerein v2.0 REST API.
- elevenlabs-agents - Create, manage, and deploy ElevenLabs.
- elevenlabs-media - ElevenLabs music generation.
- elevenlabs-transcribe - Transcribe audio to text using ElevenLabs.
- elevenlabs-tts - ElevenLabs TTS - the best ElevenLabs integration for OpenClaw.
- elevenlabs-voices - High-quality voice synthesis with 18 personas, 32.
- eternal-haven-lore-pack - Eternal Haven Chronicles lore + mythic persona pack.
- faster-whisper - Local speech-to-text using faster-whisper.
- feishu-minutes - Fetch info, stats, transcript, and media from Feishu.
- freshbooks-cli - FreshBooks CLI for managing invoices, clients, and billing.
- gettr-transcribe-summarize - Download audio from a GETTR post.
- hebrew-nikud - Hebrew nikud (vowel points) reference for AI agents.
- her-voice - Give your agent a voice.
- inworld-tts - Text-to-speech via Inworld.ai API.
- jarvis-voice - Metallic AI voice persona with TTS and visual transcript styling.
- kokoro-tts - Generate spoken audio from text using the local Kokoro TTS engine.
- lnbits - Manage LNbits Lightning Wallet (Balance, Pay, Invoice)
- lnbits-with-qrcode - Manage LNbits Lightning Wallet (Balance, Pay, Invoice)
- miranda-sag - ElevenLabs text-to-speech with mac-style say UX.
- norman-categorize-transactions - Review and categorize uncategorized bank transactions, match them with invoices, and verify bookkeeping entries.
- norman-monthly-reconciliation - Perform a complete monthly financial reconciliation - review all transactions, match invoices, check outstanding.
- ressemble - Text-to-Speech and Speech-to-Text integration using Resemble AI HTTP API.
- siliconflow-tts-gen - Text-to-Speech using SiliconFlow API (CosyVoice2)