home/categories/machine-learning/orchestra-research-ai-research-skills-19-emerging-techniques-knowledge-distillation-skill-md
machine-learningdata-ai

knowledge-distillation

Compress large language models using knowledge distillation from teacher to student models. Use when deploying smaller models with retained performance, transferring GPT-4 capabilities to open-source models, or reducing inference costs. Covers temperature scaling, soft targets, reverse KLD, logit distillation, and MiniLLM training strategies.

Orchestra-Research
maintainer
Orchestra-Research
Updated 11/20/2025
Stars
6563
Forks
515
quick start

Installation and usage

Compress large language models using knowledge distillation from teacher to student models. Use when deploying smaller models with retained performance, transferring GPT-4 capabilities to open-source models, or reducing inference costs. Covers temperature scaling, soft targets, reverse KLD, logit distillation, and MiniLLM training strategies.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use knowledge-distillation