Robert Shaw

Robert Shaw's contributions

Featured blog image with the following text: vLLM and DeepSeek
Article

How we optimized vLLM for DeepSeek-R1

Michael Goin +4

Explore inference performance improvements that help vLLM serve DeepSeek AI models more efficiently in this technical deep dive.

featured image for SparseGPT.
Article

SparseGPT: Remove 100 billion parameters for free

Robert Shaw +1

Compress large language models (LLMs) with SparseGPT to make your machine learning inference fast and efficient. Prune in one-shot with minimal accuracy loss.