Red Hat AI

Featured image for Deploy Llama 3 8B with vLLM blog.
Article

Deploy Llama 3 8B with vLLM

Mark Kurtz

Llama 3's advancements, particularly at 8 billion parameters, make AI more accessible and efficient.

featured image for SparseGPT.
Article

SparseGPT: Remove 100 billion parameters for free

Robert Shaw +1

Compress large language models (LLMs) with SparseGPT to make your machine learning inference fast and efficient. Prune in one-shot with minimal accuracy loss.