Mustafa Eyceoz

Mustafa Eyceoz's contributions

Red Hat AI
Article

Unsloth and Training Hub: Lightning-fast LoRA and QLoRA fine-tuning

Aditi Saluja +2

Learn how to fine-tune large language models in enterprise environments with Training Hub, an open source library for LLM post-training. Discover the benefits of LoRA and QLoRA using Unsloth, including reduced VRAM requirements and faster training times.

Featured image for model customization
Article

Post-training methods for language models

Mustafa Eyceoz +1

Dive into LLM post-training methods, from supervised fine-tuning and continual learning to parameter-efficient and reinforcement learning approaches.

Featured image for AI/ML
Article

Async-GRPO: Open, fast, and performant

Aldo Pareja +1

Discover Async-GRPO, a new library for reinforcement learning tasks that efficiently handles large models, eliminates bottlenecks, and accelerates experiments.

Featured image for AI/ML
Article

Granite, LIMO, and small LLM reasoning

Akash Srivastava +8

On reproducing R1-like reasoning in small LLMs: LIMO dataset ineffective for Llama/Granite; synthetic data generation shows promise but fine-tuning is tricky.