Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Jason Wei
JWei05
Follow
0 followers
·
1 following
AI & ML interests
RL, LLMs, DL Theory
Recent Activity
updated
a model
about 16 hours ago
JWei05/dapo-gemma3-12b-pt
published
a model
about 16 hours ago
JWei05/dapo-gemma3-12b-pt
updated
a model
about 17 hours ago
JWei05/gemma3-4b-pt-upscaled-52l-sft-nemotron-cascade2-16k
View all activity
Organizations
models
13
Sort: Recently updated
JWei05/dapo-gemma3-12b-pt
Updated
42 minutes ago
JWei05/dapo-gemma3-4b-pt-upscaled-64l-lr1e7-clip05-warmup50
Updated
about 2 hours ago
JWei05/dapo-gemma3-27b-pt
Updated
about 4 hours ago
JWei05/gemma3-4b-pt-upscaled-52l-sft-nemotron-cascade2-16k
Updated
about 17 hours ago
JWei05/gemma3-4b-pt-upscaled-46l-sft-nemotron-cascade2-16k
Updated
about 17 hours ago
JWei05/gemma3-4b-pt-upscaled-40l-sft-nemotron-cascade2-16k
Updated
about 18 hours ago
JWei05/gemma3-4b-pt-sft-nemotron-cascade2-16k
Updated
about 19 hours ago
JWei05/dapo-gemma3-27b-it
Updated
1 day ago
JWei05/gemma3-4b-pt-off-policy-distilled-from-27b-pt-base
Updated
5 days ago
JWei05/gemma3-12b-pt-off-policy-distilled-from-27bptw20-step80
Updated
6 days ago
View 13 models
datasets
41
Sort: Recently updated
JWei05/Nemotron-Cascade-2-SFT-Data-16k-subset
Viewer
•
Updated
about 19 hours ago
•
16.5k
•
5
JWei05/Nemotron-Cascade-2-SFT-Data-9k-subset
Viewer
•
Updated
about 22 hours ago
•
9k
•
9
JWei05/DAPO-OpenMathInstruct2-34k
Viewer
•
Updated
1 day ago
•
34.8k
•
9
JWei05/DAPO-Gemma3-27B-PT-warmup20-step80-SFT-Data
Viewer
•
Updated
6 days ago
•
34.8k
•
28
JWei05/DAPO-Gemma4-31B-IT-SFT-Data
Viewer
•
Updated
8 days ago
•
34.8k
•
19
JWei05/DAPO-Gemma3-27B-IT-RL-SFT-Data-correct
Viewer
•
Updated
9 days ago
•
41.8k
•
26
JWei05/DAPO-Gemma3-27B-IT-RL-SFT-Data
Viewer
•
Updated
9 days ago
•
69.6k
•
30
JWei05/swe_smith_py_qwen3.5_35b_trajs_1952
Viewer
•
Updated
13 days ago
•
2k
•
56
JWei05/swe_smith_rs_qwen3.5_35b_trajs_2477
Viewer
•
Updated
13 days ago
•
5k
•
47
JWei05/swe_smith_go_qwen3.5_35b_trajs_1448
Viewer
•
Updated
13 days ago
•
1.63k
•
42
View 41 datasets