Skip to content

Training DPO with pre-computed reference scores #1603

Training DPO with pre-computed reference scores

Training DPO with pre-computed reference scores #1603

Build wheels (pt2.3.0, py3.10, linux-x86_64, cpu, nosan)  /  Build

succeeded Jan 8, 2025 in 4m 4s