home/categories/machine-learning/melodic-software-claude-code-plugins-plugins-systems-design-skills-llm-serving-patterns-skill-md
machine-learningdata-ai

llm-serving-patterns

LLM inference infrastructure, serving frameworks (vLLM, TGI, TensorRT-LLM), quantization techniques, batching strategies, and streaming response patterns. Use when designing LLM serving infrastructure, optimizing inference latency, or scaling LLM deployments.

melodic-software
maintainer
melodic-software
اپ ڈیٹ ہوا 1/19/2026
اسٹارز
11
فورکس
1
quick start

Installation and usage

LLM inference infrastructure, serving frameworks (vLLM, TGI, TensorRT-LLM), quantization techniques, batching strategies, and streaming response patterns. Use when designing LLM serving infrastructure, optimizing inference latency, or scaling LLM deployments.

انسٹالیشن
$ install --globalskills.sh
استعمال

انسٹال کرنے کے بعد، آپ یہ اسکل ٹرمینل میں درج ذیل کمانڈ چلا کر استعمال کر سکتے ہیں:

skills use llm-serving-patterns