Launches
vLLM v0.19.0 Cracks Zero-Bubble Scheduling, Guts Speculative Decode Overhead
vLLM v0.19.0 ships 448 commits from 197 contributors: zero-bubble async scheduling with speculative decoding, full Gemma 4 support, and Model Runner V2 maturation across pipeline parallelism and multimodal paths.