AI & ML interests

None defined yet.

Recent Activity

qgallouedec  updated a Space 1 day ago
trl-lib/diff-view
qgallouedec  updated a Space 1 day ago
trl-lib/trl-v1-pypi
qgallouedec  updated a Space 1 day ago
trl-lib/stack-llama
View all activity

trl-lib 's collections 7

Comparing DPO with IPO and KTO
A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO.