WiseOCR
PDF OCR — Convert PDF to Markdown via WiseDiag cloud API (requires WISEDIAG_API_KEY). Files are uploaded to WiseDiag servers for processing. Supports table r...
技能说明
name: wiseocr description: "Convert PDF files to Markdown using WiseOCR API (powered by WiseDiag). Supports table recognition, multi-column layouts, and medical document OCR. Usage: Upload a PDF file and say Use WiseOCR to process this." registry: homepage: https://github.com/fmdmm/WiseOCR author: fmdmm credentials: required: true env_vars: - WISEDIAG_API_KEY
⚠️ Privacy Warning
IMPORTANT - READ BEFORE INSTALLING:
This skill uploads your files to WiseDiag's cloud servers for OCR processing.
Do NOT use with sensitive or confidential documents unless:
- You trust WiseDiag's data handling policies
- You accept that file contents will be transmitted and processed remotely
For sensitive documents, use offline/local OCR tools instead.
WiseOCR Skill (powered by WiseDiag)
Convert PDF files into Markdown format. The script handles API authentication, file upload, OCR processing, and saves the result automatically.
⚠️ IMPORTANT: How to Use This Skill
You MUST use the provided script to process files. Do NOT call any API or HTTP endpoint directly.
The script scripts/wiseocr.py handles everything:
- API authentication (reads
WISEDIAG_API_KEYfrom environment) - PDF upload and OCR processing
- Saves the Markdown result to
WiseOCR/{filename}.md
🔑 API Key Setup (Required)
Get your API key: 👉 https://console.wisediag.com/apiKeyManage
export WISEDIAG_API_KEY=your_api_key
Installation
pip install -r requirements.txt
Usage
To process a PDF file, run:
cd scripts
python wiseocr.py -i /path/to/input.pdf -n original_filename
IMPORTANT: Always pass -n with the original filename (without extension) so the output file is named correctly. If the uploaded file has been renamed (e.g. to ocr_input.pdf), -n ensures the output uses the user's original filename.
The script will automatically save the result to WiseOCR/{name}.md.
Example:
python wiseocr.py -i /tmp/ocr_input.pdf -n medical_report
# Output saved to: WiseOCR/medical_report.md
With custom output directory:
python wiseocr.py -i /path/to/input.pdf -n medical_report -o /custom/output/dir
Arguments
| Flag | Description |
|---|---|
-i, --input | Input PDF file path (required) |
-n, --name | Original filename without extension for output (recommended) |
-o, --output | Output directory (default: ./WiseOCR) |
--dpi | PDF rendering DPI, 72-600 (default: 200) |
Output
After the script runs, the Markdown file is saved automatically:
- Default:
WiseOCR/{name}.md - The file is named using the
-nparameter (e.g.-n report→report.md), or falls back to the input filename - No additional saving is needed — the file is already on disk
Data Privacy
What happens to your files:
- Files are uploaded to WiseDiag's OCR API (
https://openapi.wisediag.com) - Files are processed on WiseDiag servers
- Processing results are returned to you
- Files are not permanently stored on WiseDiag servers
For sensitive documents, use offline/local OCR tools instead.
License
MIT
如何使用「WiseOCR」?
- 打开小龙虾AI(Web 或 iOS App)
- 点击上方「立即使用」按钮,或在对话框中输入任务描述
- 小龙虾AI 会自动匹配并调用「WiseOCR」技能完成任务
- 结果即时呈现,支持继续对话优化