🤖

NVIDIA Kimi Vision

Analyze images using NVIDIA Kimi K2.5 vision model via NVIDIA NIM API. Perfect for adding vision to non-vision models like MiniMax M2.5, GLM-5, or any model...

下载366

星标0

版本1.0.3

安全通过

⚙️脚本

在 App 中使用在 ClawHub 查看 ↗

技能说明

name: nvidia-kimi-vision description: Analyze images using NVIDIA Kimi K2.5 vision model via NVIDIA NIM API. Perfect for adding vision to non-vision models like MiniMax M2.5, GLM-5, or any model without native image support. Supports png, jpg, jpeg, webp. metadata: { "openclaw": { "requires": { "bins": ["python3"] }, "install": [ { "id": "python", "kind": "system", "label": "Install Python dependencies (requests)", }, ], }, }

NVIDIA Kimi Vision

Fast image analysis using Kimi K2.5 multimodal model from NVIDIA NIM.

Why This Skill?

Fast - NVIDIA NIM inference
Quality - Kimi K2.5 is a solid vision model
Simple - Just pass an image and prompt
Free tier - Available through NVIDIA build.nvidia.com

API Setup (IMPORTANT)

When using this skill, if no API key is found, it will automatically guide the user through setup:

Step 1: Get a Free API Key

Go to https://build.nvidia.com
Sign up / Log in with GitHub or Google
Search for "Kimi K2.5"
Click on the model and get your free API key

Step 2: Save the Key

# Option A: Save to file (recommended)
mkdir -p ~/.config
echo 'your-api-key-here' > ~/.config/nvidia-kimi-api-key

# Option B: Pass directly when running
python3 scripts/analyze_image.py photo.jpg "What's this?" sk-your-key-here

First Time Setup (for agents)

When a user tries to use this skill without an API key, the script will output clear setup instructions. Guide them through:

Visiting https://build.nvidia.com
Getting their free API key
Saving it to ~/.config/nvidia-kimi-api-key

Usage

python3 scripts/analyze_image.py <image_path> "<prompt>" [api_key]

Examples

# What's in this image?
python3 scripts/analyze_image.py "/path/to/image.jpg" "Describe what's in this image"

# Extract text from screenshot
python3 scripts/analyze_image.py "/path/screenshot.png" "Extract all text"

# Analyze a meme
python3 scripts/analyze_image.py "/path/meme.jpg" "Explain this meme"

# With API key inline
python3 scripts/analyze_image.py photo.jpg "What's this?" sk-xxxxx

Image Formats

Supports: png, jpg, jpeg, webp

Rate Limits

The free tier through NVIDIA NIM has some limits, but they're not clearly documented on the site. Check https://build.nvidia.com for the latest info on your specific key's limits.

如何使用「NVIDIA Kimi Vision」？

打开小龙虾AI（Web 或 iOS App）
点击上方「立即使用」按钮，或在对话框中输入任务描述
小龙虾AI 会自动匹配并调用「NVIDIA Kimi Vision」技能完成任务
结果即时呈现，支持继续对话优化

NVIDIA Kimi Vision

技能说明

NVIDIA Kimi Vision

Why This Skill?

API Setup (IMPORTANT)

Step 1: Get a Free API Key

Step 2: Save the Key

First Time Setup (for agents)

Usage

Examples

Image Formats

Rate Limits

如何使用「NVIDIA Kimi Vision」？

相关技能