🤖
Walkie-Talkie Mode
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
安全通过
💬Prompt
技能说明
name: walkie-talkie description: Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
Walkie-Talkie Mode
This skill automates the voice-to-voice loop on WhatsApp using local transcription and local TTS.
Workflow
-
Incoming Audio: When a user sends an audio/ogg/opus file:
- Use
tools/transcribe_voice.shto get the text. - Process the text as a normal user prompt.
- Use
-
Outgoing Response:
- Instead of a text reply, generate speech using
bin/sherpa-onnx-tts. - Send the resulting
.oggfile back to the user as a voice note.
- Instead of a text reply, generate speech using
Triggers
- User sends an audio message.
- User says "activa modo walkie-talkie" or "hablemos por voz".
Constraints
- Use local tools only (ffmpeg, whisper-cpp, sherpa-onnx-tts).
- Maintain a fast response time (RTF < 0.5).
- Always reply with BOTH text (for clarity) and audio.
Manual Execution (Internal)
To respond with voice manually:
bin/sherpa-onnx-tts /tmp/reply.ogg "Tu mensaje aquí"
Then send /tmp/reply.ogg via message tool with filePath.
如何使用「Walkie-Talkie Mode」?
- 打开小龙虾AI(Web 或 iOS App)
- 点击上方「立即使用」按钮,或在对话框中输入任务描述
- 小龙虾AI 会自动匹配并调用「Walkie-Talkie Mode」技能完成任务
- 结果即时呈现,支持继续对话优化