-
-
-
-
-
-
Inference Providers
Active filters: Reward
Text Classification
• 2B • Updated
• 1
• 2
mradermacher/SmolTulu-1.7b-RM-GGUF
2B • Updated
• 211
mradermacher/SmolTulu-1.7b-RM-i1-GGUF
2B • Updated
• 81
Teen-Different/squiral_maze
Reinforcement Learning
• Updated
Text Classification
• Updated
• 21
• 9
Text Classification
• Updated
• 8
• 1
Text Classification
• Updated
• 103
• 25
Text Classification
• Updated
• 17
• 5
wangclnlp/GRAM-RR-LLaMA-3.1-8B-RewardModel
Text Generation
• 8B • Updated
• 4
• 2
wangclnlp/GRAM-RR-LLaMA-3.2-3B-RewardModel
Text Generation
• 3B • Updated
• 1
mradermacher/GRAM-RR-LLaMA-3.2-3B-RewardModel-GGUF
3B • Updated
• 75
mradermacher/GRAM-RR-LLaMA-3.2-3B-RewardModel-i1-GGUF
3B • Updated
• 38
mradermacher/GRAM-RR-LLaMA-3.1-8B-RewardModel-GGUF
8B • Updated
• 61
• 1
mradermacher/GRAM-RR-LLaMA-3.1-8B-RewardModel-i1-GGUF
8B • Updated
• 237
• 1