Red Hat Developer Blog

Article

Jun 03, 2025

Structured outputs in vLLM: Guiding AI responses

Michael Goin +2

Learn how to control the output of vLLM's AI responses with structured...

Article

May 28, 2025

Implement AI safeguards with Node.js and Llama Stack

Michael Dawson

Explore how to utilize guardrails for safety mechanisms in large language...

Article

May 27, 2025

Boost GPU efficiency in Kubernetes with NVIDIA Multi-Instance GPU

Kuan Feng (IBM)

Learn how to optimize GPU resource use with NVIDIA Multi-Instance GPU (MIG)...

Blog

RAG) with Node.js to optimize your AI applications image

May 26, 2025

PowerUp 2025 Wrap up - Thoughts from the Red Hat Team

Michael Dawson

Members from the Red Hat Node.js team were recently at PowerUp 2025. It was...

Article

Featured image for Red Hat OpenShift AI.

May 22, 2025

Improve GPU utilization with Kueue in OpenShift AI

Akram Ben Aissi +2

Discover how IBM used OpenShift AI to maximize GPU efficiency on its internal...

Article

May 21, 2025

Implement LLM observability with Dynatrace on OpenShift AI

Pavol Loffay +2

Gain detailed insights into vLLM deployments on OpenShift AI. Learn to build...

Article

May 20, 2025

Getting reasoning models enterprise-ready

Abhishek Bhandwaldar +2

Learn how to use synthetic data generation (SDG) and fine-tuning in Red Hat...

Article

May 20, 2025

LLM Semantic Router: Intelligent request routing for large language models

Ron Haberman +6

LLM Semantic Router uses semantic understanding and caching to boost...

Article

May 20, 2025

llm-d: Kubernetes-native distributed inferencing

Robert Shaw +2

llm-d delivers Kubernetes-native distributed inference with advanced...

Report a website issue

Red Hat Developer Sandbox

Programming languages & frameworks

System design & architecture

Developer experience

Automated data processing

Platform engineering

Secure development & architectures

E-books

Cheat sheets

Documentation

View all blogs & articles

Structured outputs in vLLM: Guiding AI responses

Implement AI safeguards with Node.js and Llama Stack

Boost GPU efficiency in Kubernetes with NVIDIA Multi-Instance GPU

PowerUp 2025 Wrap up - Thoughts from the Red Hat Team

Improve GPU utilization with Kueue in OpenShift AI

Implement LLM observability with Dynatrace on OpenShift AI

Getting reasoning models enterprise-ready

LLM Semantic Router: Intelligent request routing for large language models

llm-d: Kubernetes-native distributed inferencing

Featured Authors

Cedric Clyburn

Michael Dawson

Don Schenck

Andrew Azores

Platforms

Build

Quicklinks

Communicate

RED HAT DEVELOPER

Red Hat legal and privacy links

Red Hat legal and privacy links

Report a website issue