PDF processing skill for extraction, analysis, and manipulation of PDF documents
View on GitHubtrilogy-group/swarm-claude-plugin
devops-assistant
sample-plugin/skills/pdf-processor/SKILL.md
January 17, 2026
Select agents to install to:
npx add-skill https://github.com/trilogy-group/swarm-claude-plugin/blob/main/sample-plugin/skills/pdf-processor/SKILL.md -a claude-code --skill pdf-processorInstallation paths:
.claude/skills/pdf-processor/# PDF Processor Skill
## Overview
The PDF Processor skill provides comprehensive PDF document handling capabilities including extraction, analysis, manipulation, and generation of PDF files for documentation, reporting, and compliance purposes.
## Features
### 1. Text Extraction
- Extract plain text from PDFs
- Preserve formatting and structure
- Extract tables and structured data
- Multi-language support
- Handle encrypted PDFs
### 2. OCR Processing
- Convert scanned documents to text
- Support for 100+ languages
- Image preprocessing for better accuracy
- Handwriting recognition
- Layout analysis
### 3. Metadata Operations
- Extract document properties
- Read/write custom metadata
- Extract embedded files
- Digital signature verification
- Creation/modification date tracking
### 4. PDF Manipulation
- Merge multiple PDFs
- Split PDFs by pages or bookmarks
- Rotate pages
- Crop and resize
- Add watermarks and stamps
### 5. Form Processing
- Extract form fields
- Fill PDF forms programmatically
- Validate form data
- Create fillable forms
- Export form data to JSON/CSV
### 6. Report Generation
- Generate PDFs from templates
- Create reports from data
- Add charts and graphs
- Include images and logos
- Apply corporate branding
## Configuration
```json
{
"pdf_processor": {
"enabled": true,
"ocr": {
"enabled": true,
"languages": ["eng", "fra", "deu", "spa"],
"dpi": 300,
"preprocessing": true
},
"extraction": {
"preserve_formatting": true,
"extract_images": true,
"extract_tables": true,
"extract_metadata": true
},
"security": {
"allow_encrypted": true,
"max_file_size_mb": 100,
"sandbox_mode": true
},
"output": {
"formats": ["text", "json", "html", "markdown"],
"compression": true,
"optimization": true
},
"performance": {
"parallel_processing": true,
"max_workers": 4,
"cache_enabled": true
}
}
}
```
## Usage Exam