James Harmison

James Harmison's contributions

Featured image for LLM Compressor.
Article

Optimizing generative AI models with quantization

James Harmison

Learn how to optimize LLMs like Granite 3.3 for better performance and efficiency on a single server by using open source tools like LLM Compressor and vLLM.