Red Hat Developer Blog
Here's our most recent blog content. Explore our featured monthly resource as well as our most recently published items. Don't miss the chance to learn more about our contributors.
View all blogs & articles
Llama 3's advancements, particularly at 8 billion parameters, make AI more...
Learn about Marlin, a mixed-precision matrix multiplication kernel that...
4-bit and 8-bit quantized LLMs excel in long-context tasks, retaining over...
Sparse fine-tuning in combination with sparsity-aware inference software,...
Compress large language models (LLMs) with SparseGPT to make your machine...