Awesome Reinforcement Fine Tuning
This repository will be continuously updated with articles related to reinforcement fine-tuning. 🌟🌟🌟
-
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step
Guowei Xu, Peng Jin, Hao Li, Yibing Song, Lichao Sun, Li Yuan [paper] 2024.11
-
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step
Guowei Xu, Peng Jin, Hao Li, Yibing Song, Lichao Sun, Li Yuan [paper] 2024.11