Edoardo Vacchi's contributions
Article
Intelligent inference scheduling with llm-d on Red Hat AI
Madhu Goutham Reddy Ambati
+1
Learn how llm-d routes each inference request to the GPU that already has the relevant data cached, cutting down on time-to-first-token, and doubling throughput without changing hardware. Discover how Red Hat's stack packages this neatly into a single Kubernetes resource.
Article
Quarking Drools: How we turned a 13-year-old Java project into a first-class serverless component
Mario Fusco
+1
Updating Drools, the world's most popular open source rule engine, to make it part of the cloud and serverless revolution.
Article
Intelligent inference scheduling with llm-d on Red Hat AI
Madhu Goutham Reddy Ambati
+1
Learn how llm-d routes each inference request to the GPU that already has the relevant data cached, cutting down on time-to-first-token, and doubling throughput without changing hardware. Discover how Red Hat's stack packages this neatly into a single Kubernetes resource.
Article
Quarking Drools: How we turned a 13-year-old Java project into a first-class serverless component
Mario Fusco
+1
Updating Drools, the world's most popular open source rule engine, to make it part of the cloud and serverless revolution.