William Caban Babilonia

William Caban Babilonia's contributions

Discover how to configure EvalHub evaluation collections on Red Hat AI. Run Lighteval, Garak, and GuideLLM in parallel for a unified LLM pass/fail verdict.

Discover how to use EvalHub and OCI persistence to make your AI evaluation results immutable, content-addressable, and fully auditable.

Add automated AI evaluations to your CI/CD pipeline

William Caban Babilonia +2

June 11, 2026

Learn how to use the EvalHub CLI to automate AI evaluations in your CI/CD pipelines. Install the SDK, configure profiles, and set up a production gate.

Bring your own evaluation framework to EvalHub

William Caban Babilonia +2

June 9, 2026

Learn how to onboard a custom evaluation framework into EvalHub using one class, one method, and a container image. This guide covers the contract, data structures, and a complete minimal adapter.

Understanding evaluation collections in EvalHub

William Caban Babilonia +2

June 4, 2026

Learn how to read an existing system collection, understand its threshold logic, and build your own collection that encodes your actual measurement strategy with thresholds that mean something.

Evaluation-driven development with EvalHub

William Caban Babilonia +1

June 2, 2026

Learn how evaluation-driven development (EDD) turns AI optimization from an art into an engineering discipline with EvalHub.

Learn about the five primary structural challenges in enterprise AI evaluation and how EvalHub addresses them with a unified foundation for AI evaluation.

Learn how Red Hat AI 3.4 uses EvalHub to orchestrate AI evaluations on Kubernetes. Scale frameworks like Garak and LightEval with built-in MLflow tracking.

William Caban Babilonia