Alberto Perdomo

Github

Alberto Perdomo's contributions

Featured image for vLLM interference article.

This performance analysis compares KServe's SLO-driven KEDA autoscaling approach against Knative's concurrency-based autoscaling for vLLM inference.

Featured image for Red Hat OpenShift AI.

Walk through how to set up KServe autoscaling by leveraging the power of vLLM, KEDA, and the custom metrics autoscaler operator in Open Data Hub.

Featured image for Red Hat OpenShift AI.

Learn about the Red Hat OpenShift AI model fine-tuning stack and how to run performance and scale validation.