home/categories/machine-learning/melodic-software-claude-code-plugins-plugins-systems-design-skills-ml-inference-optimization-skill-md
machine-learningdata-ai

ml-inference-optimization

ML inference latency optimization, model compression, distillation, caching strategies, and edge deployment patterns. Use when optimizing inference performance, reducing model size, or deploying ML at the edge.

melodic-software
maintainer
melodic-software
更新于 1/19/2026
星标
11
分支
1
quick start

Installation and usage

ML inference latency optimization, model compression, distillation, caching strategies, and edge deployment patterns. Use when optimizing inference performance, reducing model size, or deploying ML at the edge.

安装
$ install --globalskills.sh
使用

安装后,您可以通过在终端运行以下命令来使用此技能:

skills use ml-inference-optimization