arxiv:2604.18518
Haoge Deng
Bitterdhg
AI & ML interests
None yet
Recent Activity
authored a paper about 4 hours ago
UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models liked a model about 6 hours ago
Yovecents/URSA-1.7B-IBQ512-UDMGRPO-PickScore liked a model about 6 hours ago
Yovecents/URSA-1.7B-IBQ512-UDMGRPO-GenEval