Dipika Sikka

Dipika Sikka's contributions

The LLM Compressor 0.8.0 release introduces quantization workflow enhancements, extended support for Qwen3 models, and improved accuracy recovery.

LLM Compressor 0.7.0 release recap

Dipika Sikka +3

August 25, 2025

LLM Compressor 0.7.0 brings Hadamard transforms for better accuracy, mixed-precision FP4/FP8, and calibration-free block quantization for efficient compression.

Discover how to deploy compressed, fine-tuned models for efficient inference with the new Axolotl and LLM Compressor integration.

Optimize model inference and reduce costs with model compression techniques like quantization and pruning with LLM Compressor on Red Hat OpenShift AI.

Explore multimodal model quantization in LLM Compressor, a unified library for optimizing models for deployment with vLLM.

Report a website issue

Red Hat Developer Sandbox

Programming Languages & Frameworks

System Design & Architecture

Developer Productivity

Automated Data Processing

Platform Engineering

Secure Development & Architectures

E-Books

Cheat Sheets

Documentation

Dipika Sikka

Dipika Sikka's contributions

LLM Compressor 0.8.0: Extended support for Qwen3 and more

LLM Compressor 0.7.0 release recap

Axolotl meets LLM Compressor: Fast, sparse, open

Optimize LLMs with LLM Compressor in Red Hat OpenShift AI

Multimodal model quantization support through LLM Compressor

Platforms

Build

Quicklinks

Communicate

RED HAT DEVELOPER

Red Hat legal and privacy links

Red Hat legal and privacy links

Report a website issue