Back to skills
Pdf Extractor
Extract and convert PDF documents using Python scripts
72 stars
0 votes
0 copies
0 views
Added 12/19/2025
data-aipythonbash
Install via CLI
$
openskills install maxvaega/skillkitFiles
SKILL.md
---
name: pdf-extractor
description: Extract and convert PDF documents using Python scripts
version: 1.0.0
allowed-tools:
- Bash
- Read
- Write
---
# PDF Extractor Skill
This skill provides tools for extracting text and metadata from PDF documents and converting them to different formats.
## Available Scripts
### extract.py
Extracts text and metadata from PDF files.
**Input**:
```json
{
"file_path": "/path/to/document.pdf",
"pages": "all" | [1, 2, 3]
}
```
**Output**:
```json
{
"text": "Extracted text content...",
"metadata": {
"title": "Document Title",
"author": "Author Name",
"pages": 10
}
}
```
### convert.sh
Converts PDF files to different formats (text, markdown, etc.).
**Input**:
```json
{
"input_file": "/path/to/input.pdf",
"output_format": "txt" | "md" | "html"
}
```
### parse.py
Parses structured data from PDF forms and tables.
**Input**:
```json
{
"file_path": "/path/to/form.pdf",
"extract_tables": true,
"extract_forms": true
}
```
## Usage Example
```python
from skillkit import SkillManager
manager = SkillManager()
result = manager.execute_skill_script(
skill_name="pdf-extractor",
script_name="extract",
arguments={"file_path": "document.pdf", "pages": "all"}
)
if result.success:
print(result.stdout)
```
Attribution
Comments (0)
No comments yet. Be the first to comment!
