跳至主要内容
小龙虾小龙虾AI
🤖

WiseOCR

PDF OCR — Convert PDF to Markdown via WiseDiag cloud API (requires WISEDIAG_API_KEY). Files are uploaded to WiseDiag servers for processing. Supports table r...

下载278
星标5
版本1.0.9
效率工具
安全通过
⚙️脚本

技能说明


name: wiseocr description: "Convert PDF files to Markdown using WiseOCR API (powered by WiseDiag). Supports table recognition, multi-column layouts, and medical document OCR. Usage: Upload a PDF file and say Use WiseOCR to process this." registry: homepage: https://github.com/fmdmm/WiseOCR author: fmdmm credentials: required: true env_vars: - WISEDIAG_API_KEY

⚠️ Privacy Warning

IMPORTANT - READ BEFORE INSTALLING:

This skill uploads your files to WiseDiag's cloud servers for OCR processing.

Do NOT use with sensitive or confidential documents unless:

  • You trust WiseDiag's data handling policies
  • You accept that file contents will be transmitted and processed remotely

For sensitive documents, use offline/local OCR tools instead.


WiseOCR Skill (powered by WiseDiag)

Convert PDF files into Markdown format. The script handles API authentication, file upload, OCR processing, and saves the result automatically.

⚠️ IMPORTANT: How to Use This Skill

You MUST use the provided script to process files. Do NOT call any API or HTTP endpoint directly.

The script scripts/wiseocr.py handles everything:

  • API authentication (reads WISEDIAG_API_KEY from environment)
  • PDF upload and OCR processing
  • Saves the Markdown result to WiseOCR/{filename}.md

🔑 API Key Setup (Required)

Get your API key: 👉 https://console.wisediag.com/apiKeyManage

export WISEDIAG_API_KEY=your_api_key

Installation

pip install -r requirements.txt

Usage

To process a PDF file, run:

cd scripts
python wiseocr.py -i /path/to/input.pdf -n original_filename

IMPORTANT: Always pass -n with the original filename (without extension) so the output file is named correctly. If the uploaded file has been renamed (e.g. to ocr_input.pdf), -n ensures the output uses the user's original filename.

The script will automatically save the result to WiseOCR/{name}.md.

Example:

python wiseocr.py -i /tmp/ocr_input.pdf -n medical_report
# Output saved to: WiseOCR/medical_report.md

With custom output directory:

python wiseocr.py -i /path/to/input.pdf -n medical_report -o /custom/output/dir

Arguments

FlagDescription
-i, --inputInput PDF file path (required)
-n, --nameOriginal filename without extension for output (recommended)
-o, --outputOutput directory (default: ./WiseOCR)
--dpiPDF rendering DPI, 72-600 (default: 200)

Output

After the script runs, the Markdown file is saved automatically:

  • Default: WiseOCR/{name}.md
  • The file is named using the -n parameter (e.g. -n reportreport.md), or falls back to the input filename
  • No additional saving is needed — the file is already on disk

Data Privacy

What happens to your files:

  1. Files are uploaded to WiseDiag's OCR API (https://openapi.wisediag.com)
  2. Files are processed on WiseDiag servers
  3. Processing results are returned to you
  4. Files are not permanently stored on WiseDiag servers

For sensitive documents, use offline/local OCR tools instead.

License

MIT

如何使用「WiseOCR」?

  1. 打开小龙虾AI(Web 或 iOS App)
  2. 点击上方「立即使用」按钮,或在对话框中输入任务描述
  3. 小龙虾AI 会自动匹配并调用「WiseOCR」技能完成任务
  4. 结果即时呈现,支持继续对话优化

相关技能