Red Hat Developer Blog

Here's our most recent blog content. Explore our featured monthly resource as well as our most recently published items. Don't miss the chance to learn more about our contributors.

Subscribe to the feed

View all blogs & articles

Content type
Product
Topics
Article Featured image for 2.4 Sparse Foundation Models.

Discover Sparse Llama: A 50% pruned, GPU-optimized Llama 3.1 model with 2:4...

Article Featured image showing scaffolding that forms the word "V1".

Explore how vLLM's new multimodal AI inference capabilities enhance...

Article Featured image for AI/ML

Learn about an efficient inference scaling method that can improve your...

Article Featured image for multimodal LLM Compressor article.

Explore multimodal model quantization in LLM Compressor, a unified library...

Article Featured image for AI/ML
Feb 17, 2025
Akash Srivastava +8

Progress in small LLM reasoning: Our Qwen-32B model, using particle...

Article Featured image for AI/ML
Feb 07, 2025
Akash Srivastava +8

On reproducing R1-like reasoning in small LLMs: LIMO dataset ineffective for...

Article Featured image for Distributed inference with vLLM.
Feb 06, 2025
Michael Goin

Explore how distributed inference works within vLLM in this recap of Neural...

Article Featured image for AI/ML
Feb 06, 2025
Akash Srivastava +8

An update on reproducing R1-like reasoning in small LLMs: Granite models show...

Article Featured image for AI/ML

Open-sourced on Hugging Face, deployment-ready with vLLM, and extensible...