Skip to content

XxFChen/awesome-reinforcement-fine-tuning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

awesome-reinforcement-fine-tuning

Awesome Reinforcement Fine Tuning

This repository will be continuously updated with articles related to reinforcement fine-tuning. 🌟🌟🌟

Inference Time Scaling


  1. LLaVA-CoT: Let Vision Language Models Reason Step-by-Step

    Guowei Xu, Peng Jin, Hao Li, Yibing Song, Lichao Sun, Li Yuan [paper] 2024.11

Dataset


  1. LLaVA-CoT: Let Vision Language Models Reason Step-by-Step

    Guowei Xu, Peng Jin, Hao Li, Yibing Song, Lichao Sun, Li Yuan [paper] 2024.11

Releases

No releases published

Packages

No packages published