Files
awesome-openclaw-skills/categories/speech-and-transcription.md
2026-02-28 14:59:16 +03:00

8.4 KiB

Speech & Transcription

← Back to main list

49 skills

  • addis-assistant-stt - Provides Speech-to-Text (STT) and text
  • agent-voice - Command-line blogging platform for AI agents.
  • akaunting - Interact with Akaunting open-source accounting software via REST API. Use for creating invoices, tracking income/expe...
  • alexa-cli - Control Amazon Alexa devices and smart home via the alexacli CLI. Use when a user asks to speak/announce on Echo de...
  • announcer - Announce text throughout the house via AirPlay speakers using Airfoil +.
  • assemblyai-transcribe - Transcribe audio/video with AssemblyAI
  • audio-gen - Generate audiobooks, podcasts, or educational audio content
  • audio-reply - Generate audio replies using TTS.
  • auto-whisper-safe - RAM-safe voice transcription with auto-chunking — works on 16GB machines without crashes
  • brw-de-ai-ify - Remove AI-generated jargon and restore human voice to text. Built from analyzing 1,000+ AI vs human content pieces.
  • chichi-speech - A RESTful service for high-quality text-to-speech using Qwen3
  • claw-voice - You are connected to a live user session via voice.
  • clonev - Clone any voice and generate speech using Coqui XTTS v2.
  • critical-article-writer - Generate draft articles, outlines
  • cult-of-carcinization - Give your agent a voice — and ears.
  • deepdub-tts - Generate speech audio using Deepdub and attach it as a MEDIA
  • deepgram - — command-line interface for Deepgram speech-to-text.
  • dellight-cro-revenue-ops - DELLIGHT.AI is an AI startup in DIFC, Dubai. Four products at various stages. The CRO's singular obsession: **generat...
  • documents-ai - Real-time OCR and data extraction API by Veryfi. Extract structured data from receipts, invoices, bank statements, W-...
  • doubao-api-open-tts - Text-to-Speech service using Doubao (Volcano Engine)
  • duby - Convert text to speech using Duby.so API.
  • eachlabs-voice-audio - TTS, STT, voice conversion using ElevenLabs, Whisper, RVC.
  • easyverein-api - Work with the easyVerein v2.0 REST API
  • edge-tts - |.
  • elevenlabs-agents - Create, manage, and deploy ElevenLabs
  • elevenlabs-media - ElevenLabs music generation and speech-to-text...
  • elevenlabs-transcribe - Transcribe audio to text using ElevenLabs
  • elevenlabs-tts - ElevenLabs TTS - the best ElevenLabs integration for OpenClaw.
  • elevenlabs-voices - High-quality voice synthesis with 18 personas, 32
  • espeak-ng - TTS with espeak-ng
  • eternal-haven-lore-pack - Eternal Haven Chronicles lore + mythic persona pack. Use when the agent needs deep narrative context, character arcs,...
  • faster-whisper - Local speech-to-text using faster-whisper.
  • feishu-minutes - Fetch info, stats, transcript, and media from Feishu
  • freshbooks-cli - FreshBooks CLI for managing invoices, clients, and billing.
  • gettr-transcribe-summarize - Download audio from a GETTR post
  • hebrew-nikud - Hebrew nikud (vowel points) reference for AI agents. Correct nikud rules for verb conjugations (binyanim), dagesh, ge...
  • her-voice - Give your agent a voice. Use when the user wants the agent to speak, read aloud, or have voice responses.
  • inworld-tts - Text-to-speech via Inworld.ai API.
  • jarvis-voice - Metallic AI voice persona with TTS and visual transcript styling.
  • kokoro-tts - Generate spoken audio from text using the local Kokoro TTS engine.
  • lnbits - Manage LNbits Lightning Wallet (Balance, Pay, Invoice)
  • lnbits-with-qrcode - Manage LNbits Lightning Wallet (Balance, Pay, Invoice)
  • miranda-sag - ElevenLabs text-to-speech with mac-style say UX.
  • norman-categorize-transactions - Review and categorize uncategorized bank transactions, match them with invoices, and verify bookkeeping entries. Use ...
  • norman-monthly-reconciliation - Perform a complete monthly financial reconciliation - review all transactions, match invoices, check outstanding paym...
  • ressemble - Text-to-Speech and Speech-to-Text integration using Resemble AI HTTP API.
  • siliconflow-tts-gen - Text-to-Speech using SiliconFlow API (CosyVoice2). Supports multiple voices, languages, and dialects.
  • tg-voice-whisper - emoji: 🎙️🔊
  • voice-devotional - No description available.