Raymond Zhao
Raymond Zhao's contributions
Article
Batch inference on OpenShift AI with llm-d: Architecture, integration, and workflows
Lior Aronovich
+2
Learn about the llm-d batch gateway, a Kubernetes-native batch inference service that plugs into the same llm-d inference stack managed by Red Hat OpenShift AI.
Article
Batch inference on OpenShift AI with llm-d: Architecture, integration, and workflows
Lior Aronovich
+2
Learn about the llm-d batch gateway, a Kubernetes-native batch inference service that plugs into the same llm-d inference stack managed by Red Hat OpenShift AI.