home/categories/data-engineering/benchflow-ai-skillsbench-tasks-trend-anomaly-causal-inference-environment-skills-data-cleaning-skill-md
data-engineeringdata-ai
data-cleaning
Clean messy tabular datasets with deduplication, missing value imputation, outlier handling, and text processing. Use when dealing with dirty data that has duplicates, nulls, or inconsistent formatting.
maintainer
benchflow-ai
Updated 1/23/2026
Stars
946
Forks
244
quick start
Installation and usage
Clean messy tabular datasets with deduplication, missing value imputation, outlier handling, and text processing. Use when dealing with dirty data that has duplicates, nulls, or inconsistent formatting.
Installation
$ install --globalskills.sh
Usage
Once installed, you can use this skill by running the following command in your terminal:
skills use data-cleaning