Fynn Schmitt-Ulms

Fynn Schmitt-Ulms's contributions

Featured image for vLLM interference article.

Speculators v0.5.0 introduces DFlash support, enabling single-pass draft token generation with block diffusion for more efficient speculative decoding workflows. The release also adds unified online and offline training through vLLM’s native hidden states extraction system, improving training flexibility, version stability, and production readiness.

Featured image for Speculators blog

Speculators standardizes speculative decoding for large language models, with a unified Hugging Face format, vLLM integration, and more.

Featured image for AI/ML content on Red Hat Developer.

Explore the evolving LLM post-training datasets, the various formats, and transformation process from structured datasets into token sequences.