Tyler Smith

Tyler Smith's contributions

Featured blog image with the following text: vLLM and DeepSeek

How we optimized vLLM for DeepSeek-R1

Michael Goin +4

March 19, 2025

Explore inference performance improvements that help vLLM serve DeepSeek AI models more efficiently in this technical deep dive.

Featured image for vLLM FP8 inference.

Explore the integration of FP8 in vLLM. Learn how to receive up to a 2x reduction in latency on NVIDIA GPUs with minimal accuracy degradation.

Report a website issue

Your name

Your e-mail address

Subject

Message

Type of request/issue

Problem Page URL

Country/Territory

Red Hat Account Number