🤖
Gemini Assistant
General-purpose AI assistant using Gemini API with voice and text support. Use when you need a smart AI assistant that can answer questions, have conversatio...
安全通过
⚙️脚本
技能说明
name: gemini-assistant description: General-purpose AI assistant using Gemini API with voice and text support. Use when you need a smart AI assistant that can answer questions, have conversations, or help with general tasks using Google's Gemini models with audio/text capabilities. metadata: openclaw: emoji: "🤖"
Gemini Assistant
A general-purpose AI assistant powered by Google's Gemini API. Supports both text and voice interactions.
Usage
Text Mode
cd ~/.openclaw/agents/kashif/skills/gemini-assistant && python3 handler.py "Your question or message"
Voice Mode
cd ~/.openclaw/agents/kashif/skills/gemini-assistant && python3 handler.py --audio /path/to/audio.ogg "optional context"
Response Format
The handler returns a JSON response:
{
"message": "[[audio_as_voice]]\nMEDIA:/tmp/gemini_voice_xxx.ogg",
"text": "Text response from Gemini"
}
Configuration
Set your Gemini API key:
export GEMINI_API_KEY="your-api-key-here"
Or create a .env file in the skill directory:
GEMINI_API_KEY=your-api-key-here
Model Options
The default model is gemini-2.5-flash-native-audio-preview-12-2025 for audio support.
To use a different model, edit handler.py:
MODEL = "gemini-2.0-flash-exp" # For text-only
Requirements
google-genai>=1.0.0numpy>=1.24.0soundfile>=0.12.0librosa>=0.10.0(for audio input)- FFmpeg (for audio conversion)
Features
- 🎙️ Voice input/output support
- 💬 Text conversations
- 🔧 Configurable system instructions
- ⚡ Fast responses with Gemini Flash
如何使用「Gemini Assistant」?
- 打开小龙虾AI(Web 或 iOS App)
- 点击上方「立即使用」按钮,或在对话框中输入任务描述
- 小龙虾AI 会自动匹配并调用「Gemini Assistant」技能完成任务
- 结果即时呈现,支持继续对话优化