Use when searching PDF documents with semantic queries, indexing document collections for knowledge retrieval, or when users ask questions about content in PDF files requiring context-aware answers with citations
View on GitHubplugins/pdf-research/skills/pdf-research/SKILL.md
February 1, 2026
Select agents to install to:
npx add-skill https://github.com/hongsw/plugin-for-claude-research/blob/main/plugins/pdf-research/skills/pdf-research/SKILL.md -a claude-code --skill pdf-researchInstallation paths:
.claude/skills/pdf-research/# PDF Research Skill LightRAG-based PDF document indexing and semantic search for Claude Code research workflows. ## Quick Start (For Claude) When user invokes `/pdf-research`, Claude should: 1. **Check status first**: Run `python pdf_research.py status` to see current configuration 2. **Auto-index if requested**: When user provides a PDF directory, run indexing automatically 3. **Search queries**: Execute searches and return formatted results ### Automatic Workflow ```bash # Always run from scripts directory cd ~/.claude/skills/pdf-research/scripts # Check current status python pdf_research.py status # Index PDFs (when user provides a directory) python pdf_research.py index /path/to/pdfs # Search (single query) python pdf_research.py search "user's question" --mode hybrid # Interactive search session python pdf_research.py search ``` ### Environment Requirements Before running commands, ensure: ```bash # Activate Python environment with dependencies source /path/to/venv/bin/activate # or use system Python with deps installed # Ensure OpenAI API key is set export OPENAI_API_KEY=sk-... ``` ## Core Capabilities ### 1. PDF Indexing (`index` command) - Extracts text from PDF documents using PyMuPDF - Creates semantic chunks with metadata - Builds knowledge graph with entities and relationships - Generates vector embeddings for semantic search - Supports incremental indexing (only new files) ### 2. Semantic Search (`search` command) - **naive**: Simple keyword matching - **local**: Focus on specific entities and details - **global**: Focus on broad themes and summaries - **hybrid**: Combined local + global (recommended) ### 3. Status Check (`status` command) - Shows current configuration - Lists indexed documents - Reports storage statistics ### 4. Configuration (`config` command) - Set default PDF directory - Set default storage directory - Set default search mode ## Claude Integration Protocol ### When User Says "Index PDFs" or Provides a Path 1.