Mark Kurtz
Mark Kurtz's contributions

Article
Deploy Llama 3 8B with vLLM
Mark Kurtz
Llama 3's advancements, particularly at 8 billion parameters, make AI more accessible and efficient.

Article
How well do quantized models handle long-context tasks?
Eldar Kurtić
+3
4-bit and 8-bit quantized LLMs excel in long-context tasks, retaining over 99% accuracy across 4K to 64K sequence lengths.