Alexandre Marques
Alexandre Marques's contributions
Article
How well do quantized models handle long-context tasks?
Eldar Kurtić
+3
4-bit and 8-bit quantized LLMs excel in long-context tasks, retaining over 99% accuracy across 4K to 64K sequence lengths.

Article
How well do quantized models handle long-context tasks?
Eldar Kurtić
+3
4-bit and 8-bit quantized LLMs excel in long-context tasks, retaining over 99% accuracy across 4K to 64K sequence lengths.