Salman Rahman's picture

Salman Rahman PRO

salmannyu

·

https://salmanrahman.net/

AI & ML interests

Natural Language Processing, Deep Learning, Scalable Oversight, and Language Model Evaluation

Recent Activity

updated a model about 11 hours ago

salmannyu/model-checkpoints

published a model about 11 hours ago

salmannyu/model-checkpoints

authored a paper 10 days ago

When Can LLMs Learn to Reason with Weak Supervision?

View all activity

Organizations

salmannyu 's models 24

salmannyu/model-checkpoints

Updated about 11 hours ago

salmannyu/llama_base_thinking_sft_noisy_reward_0_9

salmannyu/llama_base_thinking_sft_majority_vote_math_1024_sample_8k

salmannyu/mid_train_llama_52b_thinking_data_effect_math_8_sample

salmannyu/mid_train_llama_52b_thinking_noisy_reward_math_0.7_sample

salmannyu/mid_train_llama_52b_thinking_noisy_reward_math_0.9_sample

salmannyu/mid_train_llama_52b_thinking_majority_vote_math_1024_sample

salmannyu/mid_train_llama_52b_thinking_data_effect_math_2048_sample

salmannyu/data_effect_scp_do_llama_3b_2048_sample

salmannyu/data_effect_scp_do_llama_3b_8_sample

salmannyu/data_effect_math_do_llama_3b_8_sample

salmannyu/data_effect_math_do_qwen_1_5b_8_sample

salmannyu/Llama-3B-Nemotron-mid-think_sft_nopack_lr1.5e5_ep3

3B • Updated Mar 22 • 3

salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step150

4B • Updated Mar 14 • 1

salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step100

4B • Updated Mar 14

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full-non-think-nopack-lr1.5e5-ep3

3B • Updated Mar 6 • 5

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full-nopack-lr1.5e5-ep3

3B • Updated Mar 6 • 4

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full

Text Generation • 3B • Updated Mar 2 • 1

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-140K-Step

3B • Updated Feb 25 • 2

salmannyu/Qwen2.5-1.5B-Nemotron-Math-52B-Mid-Train-8

Text Generation • 2B • Updated Feb 8 • 2 •

salmannyu/nemotron-train8-52B-Token

2B • Updated Nov 8, 2025 • 3

salmannyu/nemotron-train4

2B • Updated Nov 3, 2025

salmannyu/train3

2B • Updated Nov 3, 2025

salmannyu/nemotron-train2

2B • Updated Nov 3, 2025 • 1