Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.
Scanned 2/12/2026
Install via CLI
openskills install sundial-org/awesome-openclaw-skills---
name: mineru-pdf
description: Parse PDFs locally (CPU) into Markdown/JSON using MinerU. Assumes MinerU creates per‑doc output folders; supports table/image extraction.
---
# MinerU PDF
## Overview
Parse a PDF locally with MinerU (CPU). Default output is Markdown + JSON. Use tables/images only when requested.
## Quick start (single PDF)
```bash
# Run from the skill directory
./scripts/mineru_parse.sh /path/to/file.pdf
```
Optional examples:
```bash
./scripts/mineru_parse.sh /path/to/file.pdf --format json
./scripts/mineru_parse.sh /path/to/file.pdf --tables --images
```
## When to read references
If flags differ from your wrapper or you need advanced defaults (backend/method/device/threads/format mapping), read:
- `references/mineru-cli.md`
## Output conventions
- Output root defaults to `./mineru-output/`.
- MinerU creates the per-document subfolder under the output root (e.g., `./mineru-output/<basename>/...`).
## Batching
Default is single-PDF parsing. Only implement batch folder parsing if explicitly requested.
No comments yet. Be the first to comment!
**Version:** 2.7.0 **Feature:** Bootstrap Skill (Dogfooding) **Status:** ✅ Production Ready **Last Updated:** 2026-01-18 ---