home/categories/machine-learning/aws-solutions-library-samples-guidance-for-claude-code-with-amazon-bedrock-assets-claude-code-plugins-plugins-ml-training-skills-grpo-finetuning-skill-md
machine-learningdata-ai

grpo-finetuning

Implement GRPO (Group Relative Policy Optimization) fine-tuning for vision-language models on small datasets. Use when SFT underperforms or training data is limited (<1000 examples).

aws-solutions-library-samples
maintainer
aws-solutions-library-samples
Mis à jour 1/27/2026
Étoiles
225
Forks
86
quick start

Installation and usage

Implement GRPO (Group Relative Policy Optimization) fine-tuning for vision-language models on small datasets. Use when SFT underperforms or training data is limited (<1000 examples).

Installation
$ install --globalskills.sh
Utilisation

Après l'installation, vous pouvez utiliser ce skill en exécutant la commande suivante dans votre terminal :

skills use grpo-finetuning