Artificial intelligence

RHEL
Article

How to run TRELLIS on RHEL with Podman

Brian Smith

This article demonstrates how to run the Microsoft TRELLIS AI workload using Podman on RHEL to generate 3D assets.

Event

Red Hat at Devoxx UK 2025

Headed to DevNexus 2025? Visit the Red Hat Developer booth on-site to speak to our expert technologists.

Featured blog image with the following text: vLLM and DeepSeek
Article

How we optimized vLLM for DeepSeek-R1

Michael Goin +4

Explore inference performance improvements that help vLLM serve DeepSeek AI models more efficiently in this technical deep dive.

Featured image for AI/ML
Article

Granite, LIMO, and small LLM reasoning

Akash Srivastava +8

On reproducing R1-like reasoning in small LLMs: LIMO dataset ineffective for Llama/Granite; synthetic data generation shows promise but fine-tuning is tricky.

Featured image for AI/ML
Article

How particle filtering makes small LLMs think big

Akash Srivastava +8

An update on reproducing R1-like reasoning in small LLMs: Granite models show big gains with particle filtering, outperforming GPT-4o on benchmarks.

Featured image for Distributed inference with vLLM.
Article

Distributed inference with vLLM

Michael Goin

Explore how distributed inference works within vLLM in this recap of Neural Magic's vLLM Office Hours with Michael Goin and Murali Andoorveedu, a vLLM committer from CentML.