home/categories/machine-learning/aws-solutions-library-samples-guidance-for-claude-code-with-amazon-bedrock-assets-claude-code-plugins-plugins-ml-training-skills-grpo-finetuning-skill-md
machine-learningdata-ai

grpo-finetuning

Implement GRPO (Group Relative Policy Optimization) fine-tuning for vision-language models on small datasets. Use when SFT underperforms or training data is limited (<1000 examples).

aws-solutions-library-samples
maintainer
aws-solutions-library-samples
अपडेट किया गया 1/27/2026
स्टार
225
फोर्क
86
quick start

Installation and usage

Implement GRPO (Group Relative Policy Optimization) fine-tuning for vision-language models on small datasets. Use when SFT underperforms or training data is limited (<1000 examples).

इंस्टॉलेशन
$ install --globalskills.sh
उपयोग

इंस्टॉल करने के बाद, आप टर्मिनल में यह कमांड चलाकर इस स्किल का उपयोग कर सकते हैं:

skills use grpo-finetuning