mineru-pdf-extractor
Extract PDF content to Markdown using MinerU API.
Setup & Installation
Install command
clawhub install a-i-r/mineru-pdf-extractorIf the CLI is not installed:
Install command
npx clawhub@latest install a-i-r/mineru-pdf-extractorOr install with OpenClaw CLI:
Install command
openclaw skills install a-i-r/mineru-pdf-extractoror paste the repo link into your assistant's chat
Install command
https://github.com/openclaw/skills/tree/main/skills/a-i-r/mineru-pdf-extractorWhat This Skill Does
MinerU PDF Extractor converts PDF documents to structured Markdown via the MinerU API. It handles formula recognition, table extraction, and OCR. Two methods are available: uploading a local file in 4 steps, or submitting a public URL in 2 steps.
Handles complex PDF content like LaTeX formulas and embedded tables that basic text extractors miss.
When to Use It
- Converting arXiv research papers to Markdown for note-taking
- Extracting tables from financial reports into structured text
- Batch processing academic PDFs to build a searchable knowledge base
- Parsing scanned documents with OCR into plain text
- Converting technical manuals with embedded formulas to Markdown
Example Workflow
Here's how your AI assistant might use this skill in practice.
User asks: Converting arXiv research papers to Markdown for note-taking
- 1Converting arXiv research papers to Markdown for note-taking
- 2Extracting tables from financial reports into structured text
- 3Batch processing academic PDFs to build a searchable knowledge base
- 4Parsing scanned documents with OCR into plain text
- 5Converting technical manuals with embedded formulas to Markdown
Extract PDF content to Markdown using MinerU API.
Security Audits
These signals reflect official OpenClaw status values. A Suspicious status means the skill should be used with extra caution.