home/categories/machine-learning/melodic-software-claude-code-plugins-plugins-systems-design-skills-llm-serving-patterns-skill-md
machine-learningdata-ai
llm-serving-patterns
LLM inference infrastructure, serving frameworks (vLLM, TGI, TensorRT-LLM), quantization techniques, batching strategies, and streaming response patterns. Use when designing LLM serving infrastructure, optimizing inference latency, or scaling LLM deployments.
maintainer
melodic-software
更新於 1/19/2026
星標
11
分支
1
quick start
Installation and usage
LLM inference infrastructure, serving frameworks (vLLM, TGI, TensorRT-LLM), quantization techniques, batching strategies, and streaming response patterns. Use when designing LLM serving infrastructure, optimizing inference latency, or scaling LLM deployments.
安裝
$ install --globalskills.sh
使用
安裝後,您可以通過在終端運行以下命令來使用此技能:
skills use llm-serving-patterns