Christopher Nuland
Christopher Nuland's contributions
Article
Introduction to distributed inference with llm-d
Christopher Nuland
+1
Learn how the llm-d project is revolutionizing LLM inference by enabling distributed, efficient, and scalable model serving across Kubernetes clusters.
Article
Master KV cache aware routing with llm-d for efficient AI inference
Christopher Nuland
+1
Learn how llm-d's KV cache aware routing reduces latency and improves throughput by directing requests to pods that already hold relevant context in GPU memory.
Article
Introduction to distributed inference with llm-d
Christopher Nuland
+1
Learn how the llm-d project is revolutionizing LLM inference by enabling distributed, efficient, and scalable model serving across Kubernetes clusters.
Article
Master KV cache aware routing with llm-d for efficient AI inference
Christopher Nuland
+1
Learn how llm-d's KV cache aware routing reduces latency and improves throughput by directing requests to pods that already hold relevant context in GPU memory.