Red Hat AI

Video Thumbnail
Video

Red Hat Dan on Tech: Episode 15 - AI Code Reviews: It's Sourcery to us

Eric Curtin

Welcome back to Red Hat Dan on Tech, where Senior Distinguished Engineer Dan Walsh dives deep on all things technical, from his expertise in container technologies with tools like Podman and Buildah, to runtimes, Kubernetes, AI, and SELinux! In this episode, Eric Curtin joins to discuss Sorcery AI, a new AI code review tool that has been helping to find bugs, review PR's and much more!

Video Thumbnail
Video

Red Hat Dan on Tech: Episode 17 - Your Data + AI with RamaLama RAG

Brian Mahabir

Welcome back to Red Hat Dan on Tech, where Senior Distinguished Engineer Dan Walsh dives deep on all things technical, from his expertise in container technologies with tools like Podman and Buildah, to runtimes, Kubernetes, AI, and SELinux! In this episode, you'll see a live demo on Ramalama's new RAG capability, allowing you to use your unique data with a local LLM. Learn More: https://developers.redhat.com/articles/2025/04/03/simplify-ai-data-integration-ramalama-and-rag5.

Video Thumbnail
Video

The Llama Stack Tutorial: Episode Two - Getting Started with Llama Stack

Cedric Clyburn

Building AI applications is more than just running a model — you need a consistent way to connect inference, agents, storage, and safety features across different environments. That’s where Llama Stack comes in. In this second episode of The Llama Stack Tutorial Series, Cedric (Developer Advocate @ Red Hat) walks through how to:- Run Llama 3.2 (3B) locally and connect it to Llama Stack- Use the Llama Stack server as the backbone for your AI applications- Call REST APIs for inference, agents, vector databases, guardrails, and telemetry- Test out a Python app that talks to Llama Stack for inferenceBy the end of the series, you’ll see how Llama Stack gives developers a modular API layer that makes it easy to start building enterprise-ready generative AI applications—from local testing all the way to production. In the next episode, we'll use Llama Stack to chat with your own data (PDFs, websites, and images) with local models.🔗 Explore MoreLlama Stack GitHub: https://github.com/meta-llama/llama-stackDocs: https://llama-stack.readthedocs.io5.

Video Thumbnail
Video

The Llama Stack Tutorial: Episode One - What is Llama Stack?

Cedric Clyburn

AI applications are moving fast—but building them at scale is hard. Local prototypes often don’t translate to production, and every environment seems to require a different setup. Llama Stack, an open-source framework from Meta, was created to bring consistency and modularity to generative AI applications. In this first episode of The Llama Stack Tutorial Series, Cedric (Developer Advocate @ Red Hat) explains what Llama Stack is, why it’s being compared to Kubernetes for the AI world, key building blocks, and future episodes that'll dive into real-world use cases with Llama Stack. Explore MoreLlama Stack Tutorial (what we'll be following during the series): https://rh-aiservices-bu.github.io/llama-stack-tutorial Llama Stack GitHub: https://github.com/meta-llama/llama-stackDocs: https://llama-stack.readthedocs.io5.

Video Thumbnail
Video

Enhancing generative AI with InstructLab for accessible model fine-tuning

Red Hat Developers

The rise of large language models (LLMs) has opened up exciting possibilities for developers looking to build intelligent applications. However, the process of adapting these models to specific use cases can be difficult, requiring deep expertise and substantial resources. In this talk, we'll introduce you to InstructLab, an open-source project that aims to make LLM tuning accessible to developers and data scientists of all skill levels, on consumer-grade hardware.

In this video, we'll explore how InstructLab's innovative approach combines collaborative knowledge curation, efficient data generation, and instruction training to enable developers to refine foundation models for specific use cases. Through a live demonstration, you'll learn how IBM Research has partnered with Red Hat to simplify the process of enhancing LLMs with new knowledge and skills for targeted applications. Join us to explore how InstructLab is making LLM tuning more accessible, empowering developers to harness the power of AI in their projects.

Video Thumbnail
Video

Workflow: How to create an issue on GitHub

Red Hat Developers

Found a bug? Have new features would like to propose? You don't want to miss this tutorial on how to create them on GitHub in the community. In this video, we will be walking through how to create a GitHub issue to report bugs or suggest any features you would like to propose.

Video Thumbnail
Video

Red Hat empowers Developers

Red Hat Developers

Red Hat empowers Developers. Wherever you are, whoever you are, it's your innovations that drive us to go bigger and build better, but we know there's only so much one developer can do. That's why it's our mission to bring you together, to create a community where you can learn new skills, get inspired, and create incredible ideas. We are here to empower you.

Video Thumbnail
Video

Welcome to Red Hat Developer

Red Hat Developers

Welcome to Red Hat Red Hat Developer brings developers together to learn from each other and create more extraordinary things, faster. We serve the builders. Those who solve problems and create their careers with code. We chart a course for you, giving your career a path and your work purpose. We share what we know to help you solve problems once, build momentum together, and make the world better for all.

Featured image for Red Hat OpenShift AI.
Article

Optimize GPU utilization with Kueue and KEDA

Christian Zaccaria

As GPU demand grows, idle time gets expensive. Learn how to efficiently manage AI workloads on OpenShift AI with Kueue and the custom metrics autoscaler.

Featured image for LLM Compressor 0.7.0 release blog.
Article

LLM Compressor 0.7.0 release recap

Dipika Sikka +3

LLM Compressor 0.7.0 brings Hadamard transforms for better accuracy, mixed-precision FP4/FP8, and calibration-free block quantization for efficient compression.

Featured image for AI/ML
Article

How to enhance Agent2Agent (A2A) security

Florencio Cano Gabarda

The Agent2Agent (A2A) protocol is an open standard enabling seamless communication between AI agents. Here are the key things to know before getting started.

Featured image for LLM Compressor.
Article

Optimizing generative AI models with quantization

James Harmison

Learn how to optimize LLMs like Granite 3.3 for better performance and efficiency on a single server by using open source tools like LLM Compressor and vLLM.