Implement LLM observability with Dynatrace on OpenShift AI
Gain detailed insights into vLLM deployments on OpenShift AI. Learn to build dashboards with Dynatrace and OpenTelemetry to enable reliable LLM performance.
Gain detailed insights into vLLM deployments on OpenShift AI. Learn to build dashboards with Dynatrace and OpenTelemetry to enable reliable LLM performance.
Learn how to use Red Hat OpenShift AI to quickly develop, train, and deploy
Explore the complete machine learning operations (MLOps) pipeline utilizing Red
llm-d delivers Kubernetes-native distributed inference with advanced optimizations, reducing latency and maximizing throughput.
LLM Semantic Router uses semantic understanding and caching to boost performance, cut costs, and enable efficient inference with llm-d.
Optimize model inference and reduce costs with model compression techniques like quantization and pruning with LLM Compressor on Red Hat OpenShift AI.
Learn how to use synthetic data generation (SDG) and fine-tuning in Red Hat AI to customize reasoning models for your enterprise workflows.
Learn how to deploy a trained model with Red Hat OpenShift AI and use its
Explore how to use large language models (LLMs) with Node.js by observing Ollama
Discover how you can use the Podman AI Lab extension for Podman Desktop to work
More Essential AI tutorials for Node.js Developers
Learn how to run a fraud detection AI model using confidential virtual machines on RHEL running in the Microsoft Azure public cloud.
Configure your Red Hat Enterprise Linux AI machine, download, serve, and
vLLM empowers macOS and iOS developers to build powerful AI-driven applications by providing a robust and optimized engine for running large language models.
PowerUP 2025 is the week of May 19th. It's held in Anaheim, California this year
Learn how to use pipelines in OpenShift AI to automate the full AI/ML lifecycle on a single-node OpenShift instance.
Jupyter Notebook works with OpenShift AI to interactively classify images. In
LLM Compressor bridges the gap between model training and efficient deployment via quantization and sparsity, enabling cost-effective, low-latency inference.
Learn how to set up NVIDIA NIM on Red Hat OpenShift AI and how this benefits AI and data science workloads.
Learn how the dynamic accelerator slicer operator improves GPU resource management in OpenShift by dynamically adjusting allocation based on workload needs.
Get an introduction to AI function calling using Node.js and the LangGraph.js framework, now available in the Podman AI Lab extension.
This tutorial shows you how to use the Llama Stack API to implement retrieval-augmented generation for an AI application built with Node.js.
Learn about the Red Hat OpenShift AI model fine-tuning stack and how to run performance and scale validation.
Learn how NVIDIA GPUDirect RDMA over Ethernet enhances distributed model training performance and reduces communication bottlenecks in Red Hat OpenShift AI.