
Text Generation Inference
Legendaryby Hugging Face
High-performance text generation server with continuous batching, quantization, and speculative decoding.
35,600downloads
Security Verified
Infrastructure
Compatibility
llminferencegpu

by Hugging Face
High-performance text generation server with continuous batching, quantization, and speculative decoding.