Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
12
11
Lior Baruch
LBK95
Follow
Gargaz's profile picture
21world's profile picture
2 followers
·
2 following
Lior-Baruch
AI & ML interests
DL
Recent Activity
updated
a model
about 20 hours ago
LBK95/GRPO_Oracle_Llama32-1B-Instruct_LA5_G4_V2
updated
a model
2 days ago
LBK95/GRPO_Oracle_Llama32-1B_LA5_G4_V2
updated
a model
3 days ago
LBK95/GRPO_Oracle_Llama32-1B_LA5_G4_V1
View all activity
Organizations
None yet
LBK95
's models
132
Sort: Recently updated
LBK95/GRPO_Oracle_Llama32-1B-Instruct_LA5_G4_V2
Updated
about 16 hours ago
LBK95/GRPO_Oracle_Llama32-1B_LA5_G4_V2
Updated
1 day ago
LBK95/GRPO_Oracle_Llama32-1B_LA5_G4_V1
Updated
3 days ago
LBK95/GRPO-OracleRM_Q1Q2_V1-adapter-v1
Updated
8 days ago
LBK95/grpo-OracleRM_Async_4responses_V1-adapter-v1
Updated
10 days ago
LBK95/grpo-OracleRM_Async_4responses_V1
Updated
13 days ago
LBK95/grpo-OracleReward_Async_2responses_V1
Updated
26 days ago
LBK95/grpo-OracleReward_Async_V1
Updated
26 days ago
LBK95/grpo-OracleReward_V1
Updated
27 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.15
Updated
29 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.14
Updated
29 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.13
Updated
29 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.12
Updated
29 days ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.11
Updated
about 1 month ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.10
Updated
about 1 month ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.9
Updated
about 1 month ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.8
Updated
about 1 month ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.7
Updated
about 1 month ago
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.6
Updated
Jan 4
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.5
Updated
Jan 4
LBK95/Llama-3.2-1B-Instruct-Reward-Model-Finetuned_V1.4
Updated
Jan 4
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.4
Updated
Jan 4
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.3
Updated
Jan 4
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.2
Updated
Jan 4
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1.1
Updated
Jan 4
LBK95/Llama-3.2-1B-Reward-Model-Finetuned_V1
Updated
Jan 2
LBK95/grpo-smoketest-ultrachat-LengthReward
Updated
Jan 1
LBK95/grpo-smoketest-Instruct-Skywork-Reward
Updated
Dec 31, 2025
LBK95/grpo-smoketest-Skywork-Reward
Updated
Dec 31, 2025
LBK95/Llama-3.2-1B-hf_PPO-LookAhead-5_V1_Second_beta-0
Text Generation
•
Updated
Dec 28, 2025
•
2
Previous
1
2
3
...
5
Next