opencr-skill
Extract text from images, documents and scanned PDFs using OpenOCR - supports text detection, recognition.
Setup & Installation
Install command
clawhub install topdu/opencr-skillIf the CLI is not installed:
Install command
npx clawhub@latest install topdu/opencr-skillOr install with OpenClaw CLI:
Install command
openclaw skills install topdu/opencr-skillor paste the repo link into your assistant's chat
Install command
https://github.com/openclaw/skills/tree/main/skills/topdu/opencr-skillWhat This Skill Does
OpenOCR is a Python library for extracting text from images, scanned documents, and PDFs. It provides five task modes: text detection, text recognition, end-to-end OCR, VLM-based recognition for formulas and tables, and document parsing with layout analysis. Supports Chinese, English, and other languages.
Handles text, formulas, tables, and layout-aware document parsing in a single library, eliminating the need for separate specialized OCR tools.
When to Use It
- Converting scanned PDF invoices to searchable text
- Extracting LaTeX formulas from academic paper screenshots
- Parsing tables from document images into structured format
- Batch processing folders of scanned receipts
- Converting document images to Markdown with layout preserved
Example Workflow
Here's how your AI assistant might use this skill in practice.
User asks: Converting scanned PDF invoices to searchable text
- 1Converting scanned PDF invoices to searchable text
- 2Extracting LaTeX formulas from academic paper screenshots
- 3Parsing tables from document images into structured format
- 4Batch processing folders of scanned receipts
- 5Converting document images to Markdown with layout preserved
Extract text from images, documents and scanned PDFs using OpenOCR - supports text detection, recognition.
Security Audits
These signals reflect official OpenClaw status values. A Suspicious status means the skill should be used with extra caution.