🤖

ElevenLabs Speech-to-Text

Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2).

下载5.0k

星标6

版本1.0.0

开发工具

安全通过

💬Prompt

在 App 中使用在 ClawHub 查看 ↗

技能说明

name: elevenlabs-stt description: Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2). homepage: https://elevenlabs.io/speech-to-text metadata: {"clawdbot":{"emoji":"🎙️","requires":{"bins":["curl"],"env":["ELEVENLABS_API_KEY"]},"primaryEnv":"ELEVENLABS_API_KEY"}}

ElevenLabs Speech-to-Text

Transcribe audio files using ElevenLabs' Scribe v2 model. Supports 90+ languages with speaker diarization.

Quick Start

# Basic transcription
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3

# With speaker diarization
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --diarize

# Specify language (improves accuracy)
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --lang en

# Full JSON output with timestamps
{baseDir}/scripts/transcribe.sh /path/to/audio.mp3 --json

Options

Flag	Description
`--diarize`	Identify different speakers
`--lang CODE`	ISO language code (e.g., en, pt, es)
`--json`	Output full JSON with word timestamps
`--events`	Tag audio events (laughter, music, etc.)

Supported Formats

All major audio/video formats: mp3, m4a, wav, ogg, webm, mp4, etc.

API Key

Set ELEVENLABS_API_KEY environment variable, or configure in clawdbot.json:

{
  skills: {
    entries: {
      "elevenlabs-stt": {
        apiKey: "sk_..."
      }
    }
  }
}

Examples

# Transcribe a WhatsApp voice note
{baseDir}/scripts/transcribe.sh ~/Downloads/voice_note.ogg

# Meeting recording with multiple speakers
{baseDir}/scripts/transcribe.sh meeting.mp3 --diarize --lang en

# Get JSON for processing
{baseDir}/scripts/transcribe.sh podcast.mp3 --json > transcript.json

如何使用「ElevenLabs Speech-to-Text」？

打开小龙虾AI（Web 或 iOS App）
点击上方「立即使用」按钮，或在对话框中输入任务描述
小龙虾AI 会自动匹配并调用「ElevenLabs Speech-to-Text」技能完成任务
结果即时呈现，支持继续对话优化

ElevenLabs Speech-to-Text

技能说明

name: elevenlabs-stt description: Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2). homepage: https://elevenlabs.io/speech-to-text metadata: {"clawdbot":{"emoji":"🎙️","requires":{"bins":["curl"],"env":["ELEVENLABS_API_KEY"]},"primaryEnv":"ELEVENLABS_API_KEY"}}

ElevenLabs Speech-to-Text

Quick Start

Options

Supported Formats

API Key

Examples

如何使用「ElevenLabs Speech-to-Text」？

相关技能