·
AI & ML interests
RL for LLMs/CodeLLMs
Organizations
reshinthadith/math12k-stage3
Viewer
• Updated • 6k • 12
reshinthadith/math12k-stage2
Viewer
• Updated • 4k • 22
reshinthadith/math12k-stage1
Viewer
• Updated • 2k • 18
reshinthadith/the-stack-mujoco-xml
Viewer
• Updated • 48.3k • 13
• 1
reshinthadith/WizardLM_evol_instruct_V2_code_filtered
Viewer
• Updated • 138k • 16
• 1
reshinthadith/basic_code_ppl_eval
Viewer
• Updated • 8.73k • 175
• 4
Updated • 6
reshinthadith/2048_has_code_filtered_base_code_review_python_based_on_property
Viewer
• Updated • 6.4k • 10
reshinthadith/2048_has_code_filtered_base_code_review_python
Viewer
• Updated • 6.4k • 8
reshinthadith/dfg_augmented_mbpp
Viewer
• Updated • 95 • 27
reshinthadith/pairwise-code-review-instruct-critique-revision-python
Viewer
• Updated • 5.24k • 33
• 9
reshinthadith/synthetic_program_synthesis_python_1M
Viewer
• Updated • 654k • 87
• 5
reshinthadith/trial-upload
Updated • 7