How we built integration testing for fast-moving AI backend
A Llama Stack-dependent backend, or any rapidly-evolving upstream project faces a version-drift problem. Explore our no-cost solution that provides early warnings.
A Llama Stack-dependent backend, or any rapidly-evolving upstream project faces a version-drift problem. Explore our no-cost solution that provides early warnings.
Learn how to transform a simple chatbot into an enterprise RAG application by applying metadata filtering, hybrid search, and neural reranking using the OGX framework in Red Hat OpenShift AI.
Discover how Red Hat OpenShift AI 3.4's Models-as-a-Service (MaaS) capability streamlines AI inference by acting as an integrated AI gateway within the platform, providing centralized governance and routing requests to both self-hosted models and external providers.
Learn how to prevent GPU waste and financial loss by implementing just-in-time (JIT) checkpointing with Kubeflow Training SDK on OpenShift AI.
Learn how to monitor and analyze costs associated with your OpenShift clusters using Red Hat Lightspeed Cost Management API. Retrieve cost data for every cluster, project, node, and more. Use filters to get the exact data you need.
Learn how Red Hat Hybrid Cloud Console uses a single data layer to serve the access management interface at runtime, Storybook mocks during development, and a standalone CLI that seeds and cleanses real test environments.
Learn about critical lessons from building an MCP-powered AI agent for ServiceNow, including how to structure testing environments, best practices for implementing safeguards, and a phased approach to deploying enterprise AI integrations.
Discover how Red Hat optimized for human maintainability and significantly increased AI-assisted productivity by formalizing architectural constraints into machine-readable rules, custom lint rules, and deep documentation. Learn about the three layers they built and the impact on development.
Learn how to set up vLLM Semantic Router locally with two models: a quantized Qwen3-Coder-Next running on Apple Silicon, and Google's Gemini 2.5 Pro as the cloud fallback. This router can significantly reduce token costs by routing common requests to a less expensive model.
Learn how to deploy multiple large language models (LLMs) behind a single OpenAI-compatible endpoint on OpenShift using a Model-as-a-Service (MaaS) approach. This guide demonstrates how to build an intelligent routing infrastructure that dynamically inspects the request payload and directs traffic based on the specified model field, reducing GPU waste and simplifying application logic.
Learn how integrating Red Hat Lightspeed Model Context Protocol (MCP) and Red Hat Lightspeed advisor optimizes infrastructure health management.
This guide details how to use curl and jq to retrieve and format CVE data in readable text and structured CSV formats for data analysis and security tasks.
Learn more about the Software Catalog and Templates in the Red Hat Developer Hub.
Compare OVN-K, MACVLAN, and SR-IOV performance on OpenShift 4.20. See how control plane churn impacts data plane throughput and stability in telco environments.
Use Red Hat Lightspeed to simplify inventory management and convert natural language into inventory API queries for auditing and multi-agent automation.
Learn how OpenShift APIs for Data Protection self-service enables developers to manage OpenShift application backup and restore, enforcing least privilege.
Simplify the management of numerous Red Hat OpenShift HyperShift (HCP) clusters
Learn how to integrate the Gateway API for OpenShift with OpenShift Service Mesh, utilizing certificate trust and mTLS communication.
Learn how to migrate from Llama Stack’s deprecated Agent APIs to the modern, OpenAI-compatible Responses API without rebuilding from scratch.
Explore new features in Red Hat JBoss EAP XP 6, including upgrades to MicroProfile 7, MicroProfile LRA and multi-app support, and observability tools.
Use SDG Hub to generate high-quality synthetic data for your AI models. This guide provides a full, copy-pasteable Jupyter Notebook for practitioners.
Learn how to install and use new MCP plug-ins for Red Hat Developer Hub that provide tools for MCP clients to interact with it.
Your Red Hat Developer membership unlocks access to product trials, learning resources, events, tools, and a community you can trust to help you stay ahead in AI and emerging tech.
HyperShift streamlines OpenShift cluster management with hosted control planes, cutting costs, accelerating creation, and efficiently scaling large fleets.
Discover how llama.cpp API remoting brings AI inference to native speed on macOS, closing the gap between API remoting and native performance.