Artificial intelligence

Red Hat OpenShift AI
Article

How to route external and local LLMs with Models-as-a-Service

Edward Arthur Quarm Jnr

Discover how Red Hat OpenShift AI 3.4's Models-as-a-Service (MaaS) capability streamlines AI inference by acting as an integrated AI gateway within the platform, providing centralized governance and routing requests to both self-hosted models and external providers.

Featured image for agentic AI
Article

How Kagenti ADK simplifies production AI agent management

Legare Kerrison

Learn how Kagenti ADK, an open source toolkit, handles the complexities of managing production AI agents. It aligns with the Linux Foundation's Agent2Agent (A2A) protocol and provides a set of runtime services for easier deployment and operation.

Featured image for Red Hat OpenShift AI.
Article

OpenShift AI observability summarizer: Transform metrics into meaning

Twinkll Sisodia +4

Learn how the Red Hat OpenShift AI observability summarizer transforms raw time-series data from Prometheus into actionable, human-readable insights for platform teams. Discover the five-layer pipeline architecture and how it reduces noise and increases signal for a focused answer.

Featured image for vLLM interference article.
Article

Beyond the next token: Why diffusion LLMs are changing the game

Alon Kellner +1

This article discusses the benefits of diffusion LLMs, a revolutionary approach to language models that offers a dynamic tradeoff between accuracy and performance. The article covers the architecture, evolution, and real-world statistics of this technology, including examples of open source models like LLaDA 2.X and Mercury 2.

Featured image for machine learning
Article

From 200 lines to 15: How Helion is rewriting the rules of GPU programming

Sumantro Mukherjee +1

Discover how Helion, a Python embedded domain-specific language, abstracts low-level parallelism details to allow developers to write GPU operations using simple, intuitive PyTorch-like syntax. Automatically generate hundreds or even thousands of Triton variants for optimal performance.

OpenShift Dev Spaces
Article

OpenCode: A model-neutral AI coding assistant for OpenShift Dev Spaces

Rohan Kumar

Discover OpenCode, a model-neutral AI coding assistant that supports over 75 providers, including OpenAI, Anthropic Claude, Google Gemini, and local large language models (LLMs) via Ollama. Switch models on demand, compare outputs, avoid vendor lock-in, and even run fully offline with local models. Learn how to set up your environment in Red Hat OpenShift Dev Spaces.

A stylized illustration representing an artificial neural network, set against a dark purple background within a slightly rounded, darker purple square icon shape. The neural network consists of multiple layers of interconnected nodes, depicted as glossy, spherical red orbs. Lines connect these red orbs, forming a complex web. White arrow shapes extend horizontally from the left side, pointing towards the network, suggesting input or data flowing into the system.
Article

Combining KServe and llm-d for optimized generative AI inference

Ran Pollak +1

Learn how to combine KServe and llm-d to optimize generative AI inference, improve performance, and reduce infrastructure costs. This article demonstrates the integration architecture and provides practical guidance for AI platform teams.

Featured image for vLLM interference article.
Article

AI-powered documentation updates: From code diff to docs PR in one comment

Carmel Soceanu

Learn how to automate documentation updates for code changes using Code-to-Docs, an open source GitHub Action. This tool uses AI to analyze your code changes, identify affected documentation files, and generate updated content. Get started with this guide.