Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
4
22
Salman Rahman
PRO
salmannyu
Follow
Tasninmitu's profile picture
Simonlee711's profile picture
2 followers
·
6 following
https://salmanrahman.net/
AI & ML interests
Natural Language Processing, Deep Learning, Scalable Oversight, and Language Model Evaluation
Recent Activity
updated
a model
about 11 hours ago
salmannyu/model-checkpoints
published
a model
about 11 hours ago
salmannyu/model-checkpoints
authored
a paper
10 days ago
When Can LLMs Learn to Reason with Weak Supervision?
View all activity
Organizations
salmannyu
's models
24
Sort: Recently updated
salmannyu/model-checkpoints
Updated
about 11 hours ago
salmannyu/llama_base_thinking_sft_noisy_reward_0_9
Updated
Apr 15
salmannyu/llama_base_thinking_sft_majority_vote_math_1024_sample_8k
Updated
Apr 12
salmannyu/mid_train_llama_52b_thinking_data_effect_math_8_sample
Updated
Mar 30
salmannyu/mid_train_llama_52b_thinking_noisy_reward_math_0.7_sample
Updated
Mar 30
salmannyu/mid_train_llama_52b_thinking_noisy_reward_math_0.9_sample
Updated
Mar 30
salmannyu/mid_train_llama_52b_thinking_majority_vote_math_1024_sample
Updated
Mar 30
salmannyu/mid_train_llama_52b_thinking_data_effect_math_2048_sample
Updated
Mar 30
salmannyu/data_effect_scp_do_llama_3b_2048_sample
Updated
Mar 30
salmannyu/data_effect_scp_do_llama_3b_8_sample
Updated
Mar 30
salmannyu/data_effect_math_do_llama_3b_8_sample
Updated
Mar 30
salmannyu/data_effect_math_do_qwen_1_5b_8_sample
Updated
Mar 30
salmannyu/Llama-3B-Nemotron-mid-think_sft_nopack_lr1.5e5_ep3
3B
•
Updated
Mar 22
•
3
salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step150
4B
•
Updated
Mar 14
•
1
salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step100
4B
•
Updated
Mar 14
salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full-non-think-nopack-lr1.5e5-ep3
3B
•
Updated
Mar 6
•
5
salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full-nopack-lr1.5e5-ep3
3B
•
Updated
Mar 6
•
4
salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full
Text Generation
•
3B
•
Updated
Mar 2
•
1
salmannyu/Llama-3B-Nemotron-Math-Mid-Train-140K-Step
3B
•
Updated
Feb 25
•
2
salmannyu/Qwen2.5-1.5B-Nemotron-Math-52B-Mid-Train-8
Text Generation
•
2B
•
Updated
Feb 8
•
2
•
salmannyu/nemotron-train8-52B-Token
2B
•
Updated
Nov 8, 2025
•
3
salmannyu/nemotron-train4
2B
•
Updated
Nov 3, 2025
salmannyu/train3
2B
•
Updated
Nov 3, 2025
salmannyu/nemotron-train2
2B
•
Updated
Nov 3, 2025
•
1