home/categories/data-engineering/majiayu000-claude-skill-registry-skills-data-data-cleaning-pipeline-generator-skill-md
data-engineeringdata-ai

data-cleaning-pipeline-generator

Generates data cleaning pipelines for pandas/polars with handling for missing values, duplicates, outliers, type conversions, and data validation. Use when user asks to "clean data", "generate data pipeline", "handle missing values", or "remove duplicates from dataset".

majiayu000
maintainer
majiayu000
Updated 1/22/2026
Stars
185
Forks
40
quick start

Installation and usage

Generates data cleaning pipelines for pandas/polars with handling for missing values, duplicates, outliers, type conversions, and data validation. Use when user asks to "clean data", "generate data pipeline", "handle missing values", or "remove duplicates from dataset".

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use data-cleaning-pipeline-generator