Self-Hinting Language Models Enhance Reinforcement Learning
Baohao Liao
baohao
AI & ML interests
NLP
Recent Activity
updated
a model about 2 hours ago
baohao/byt5-base-optim_clean_fold0_ep10bs2x16lr1e-4_best published
a model about 2 hours ago
baohao/byt5-base-optim_clean_fold0_ep10bs2x16lr1e-4_best updated
a model about 22 hours ago
baohao/byt5-base-optim_final_fold0_ep20bs32lr2e-4_best