home/categories/data-engineering/benchflow-ai-skillsbench-tasks-trend-anomaly-causal-inference-environment-skills-data-cleaning-skill-md
data-engineeringdata-ai

data-cleaning

Clean messy tabular datasets with deduplication, missing value imputation, outlier handling, and text processing. Use when dealing with dirty data that has duplicates, nulls, or inconsistent formatting.

benchflow-ai
maintainer
benchflow-ai
Updated 1/23/2026
Stars
946
Forks
244
quick start

Installation and usage

Clean messy tabular datasets with deduplication, missing value imputation, outlier handling, and text processing. Use when dealing with dirty data that has duplicates, nulls, or inconsistent formatting.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use data-cleaning