- vLLM's V1 engine is ready for testing! This is a rewritten engine designed for performance and architectural simplicity. You can turn it on by setting environment variable VLLM_USE_V1=1.
More updates:
Follow this link: https://github.com/vllm-project/vllm/releases/tag/v0.7.0