quick start
Installation and usage
Use when "LLM inference", "serving LLM", "vLLM", "llama.cpp", "GGUF", "text generation", "model serving", "inference optimization", "KV cache", "continuous batching", "speculative decoding", "local LLM", "CPU inference"
安裝
$ install --globalskills.sh
使用
安裝後,您可以通過在終端運行以下命令來使用此技能:
skills use llm-inference