Charles Hernandez

Charles Hernandez's contributions

LLM Compressor v0.10 introduces Distributed Data Parallel (DDP) for faster compression, memory management, and advanced quantization formats. Make model compression workflows more efficient for large language models.

Explore the latest release of LLM Compressor, featuring attention quantization, MXFP4 support, AutoRound quantization modifier, and more.

Charles Hernandez

Charles Hernandez's contributions

LLM Compressor v0.10: Faster compression with distributed GPTQ

LLM Compressor 0.9.0: Attention quantization, MXFP4 support, and more

Platforms

Build

Quicklinks

Communicate

RED HAT DEVELOPER

Red Hat legal and privacy links

Red Hat legal and privacy links