Kubernetes

Feature image for Red Hat OpenShift
Article

Blast radius validation: Large and small Red Hat OpenShift nodes

Chris Janiszewski +1

This article evaluates the impact of deploying larger, higher-density "monster" servers on blast radius failure recovery time compared to smaller nodes in Red Hat OpenShift and Kubernetes platforms. The testing focuses on validating real-world architectural concerns, including whether higher core density increases operational risk, whether evacuation and recovery times are worse with larger, higher core-count nodes, and whether blast radius is driven by node size, or by imbalance of compute, storage, and networking performance.

Featured image for Red Hat OpenShift AI.
Article

Run Model-as-a-Service for multiple LLMs on OpenShift

Vladimir Belousov

Learn how to deploy multiple large language models (LLMs) behind a single OpenAI-compatible endpoint on OpenShift using a Model-as-a-Service (MaaS) approach. This guide demonstrates how to build an intelligent routing infrastructure that dynamically inspects the request payload and directs traffic based on the specified model field, reducing GPU waste and simplifying application logic.

Feature image for Red Hat OpenShift
Article

Integrate Red Hat Advanced Cluster Management with Argo CD

Francisco De Melo Junior

Learn how to integrate Red Hat Advanced Cluster Management with Argo CD for efficient application control. Discover how to use both push and pull models, and configure Argo CD to watch Policy resources.

Feature image for Red Hat OpenShift
Article

What's new in network observability 1.11

Steven Lee

Explore the latest features in Network Observability 1.11, an operator for Red Hat OpenShift and Kubernetes that provides insights into your network traffic flows.

Featured image for Red Hat OpenShift AI.
Article

Serve and benchmark Prithvi models with vLLM on OpenShift

Michele Gazzetti +3

Learn how to deploy and test an Earth and space model inference service on Red Hat AI Inference Server and Red Hat OpenShift AI. This article includes two self-contained activities, one deploying Prithvi using a traditional Deployment object and another serving the model using KServe and observing Knative scaling.

secure coding - simple
Article

Manage AI resource use with TokenRateLimitPolicy

Maximiliano Pizarro

Learn how to implement TokenRateLimitPolicy for LLM APIs. This approach to advanced rate limiting ensures fair use and better cost management in AI workflows.

Featured image for security.
Article

Deeper visibility in Red Hat Advanced Cluster Security

Sabina Aledort +1

Discover the new features in Red Hat Advanced Cluster Security that allow integrating component health and security data into Prometheus for better observability and alerting.

Event

Red Hat at DevNexus 2026

Headed to DevNexus? Visit the Red Hat Developer booth on-site to speak to our expert technologists.