Kai Xu

Kai Xu's contributions

Featured image for AI/ML
Article

Granite, LIMO, and small LLM reasoning

Akash Srivastava +8

On reproducing R1-like reasoning in small LLMs: LIMO dataset ineffective for Llama/Granite; synthetic data generation shows promise but fine-tuning is tricky.

Featured image for AI/ML
Article

How particle filtering makes small LLMs think big

Akash Srivastava +8

An update on reproducing R1-like reasoning in small LLMs: Granite models show big gains with particle filtering, outperforming GPT-4o on benchmarks.