home/categories/machine-learning/sundial-org-skills-skills-training-data-curation-skill-md
machine-learningdata-ai

training-data-curation

Guidelines for creating high-quality datasets for LLM post-training (SFT/DPO/RLHF). Use when preparing data for fine-tuning, evaluating data quality, or designing data collection strategies.

sundial-org
maintainer
sundial-org
Updated 1/20/2026
Stars
138
Forks
8
quick start

Installation and usage

Guidelines for creating high-quality datasets for LLM post-training (SFT/DPO/RLHF). Use when preparing data for fine-tuning, evaluating data quality, or designing data collection strategies.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use training-data-curation