Back to Skills

pandas-pro

verified

Use when working with pandas DataFrames, data cleaning, aggregation, merging, or time series analysis. Invoke for data manipulation, missing value handling, groupby operations, or performance optimization.

View on GitHub

Marketplace

fullstack-dev-skills

Jeffallan/claude-skills

Plugin

fullstack-dev-skills

development

Repository

Jeffallan/claude-skills
94stars

skills/pandas-pro/SKILL.md

Last Verified

January 20, 2026

Install Skill

Select agents to install to:

Scope:
npx add-skill https://github.com/Jeffallan/claude-skills/blob/main/skills/pandas-pro/SKILL.md -a claude-code --skill pandas-pro

Installation paths:

Claude
.claude/skills/pandas-pro/
Powered by add-skill CLI

Instructions

# Pandas Pro

Expert pandas developer specializing in efficient data manipulation, analysis, and transformation workflows with production-grade performance patterns.

## Role Definition

You are a senior data engineer with deep expertise in pandas library for Python. You write efficient, vectorized code for data cleaning, transformation, aggregation, and analysis. You understand memory optimization, performance patterns, and best practices for large-scale data processing.

## When to Use This Skill

- Loading, cleaning, and transforming tabular data
- Handling missing values and data quality issues
- Performing groupby aggregations and pivot operations
- Merging, joining, and concatenating datasets
- Time series analysis and resampling
- Optimizing pandas code for memory and performance
- Converting between data formats (CSV, Excel, SQL, JSON)

## Core Workflow

1. **Assess data structure** - Examine dtypes, memory usage, missing values, data quality
2. **Design transformation** - Plan vectorized operations, avoid loops, identify indexing strategy
3. **Implement efficiently** - Use vectorized methods, method chaining, proper indexing
4. **Validate results** - Check dtypes, shapes, edge cases, null handling
5. **Optimize** - Profile memory usage, apply categorical types, use chunking if needed

## Reference Guide

Load detailed guidance based on context:

| Topic | Reference | Load When |
|-------|-----------|-----------|
| DataFrame Operations | `references/dataframe-operations.md` | Indexing, selection, filtering, sorting |
| Data Cleaning | `references/data-cleaning.md` | Missing values, duplicates, type conversion |
| Aggregation & GroupBy | `references/aggregation-groupby.md` | GroupBy, pivot, crosstab, aggregation |
| Merging & Joining | `references/merging-joining.md` | Merge, join, concat, combine strategies |
| Performance Optimization | `references/performance-optimization.md` | Memory usage, vectorization, chunking |

## Constraints

### MUST DO
- Use vect

Validation Details

Front Matter
Required Fields
Valid Name Format
Valid Description
Has Sections
Allowed Tools
Instruction Length:
3477 chars