Exploring Next

Exploring Next — Ep 188 w/ Justy & Cody — DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning

DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning Yicheng Chen 1,2 , Zerun Ma 2 , Xinchen Xie 2 , Yining Li 2† , Kai Chen 2† 1 Fudan University 2 Shanghai AI Laboratory Github : https://github.com/yichengchen24/DataChef Abstract In the current landscape of Large Language Models (LLMs), the curation of large-scale, high-quality training data is a primary driver of model performance. A key lever is the data recipe , which comprises a data processing pipeline to transform raw sources into training corpora.

Open source article

Full episode page with transcript →

Browse all Exploring Next episodes →