Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
InosLihka
/
rhythm_env
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
rhythm_env
/
docs
113 kB
Ctrl+K
Ctrl+K
3 contributors
History:
25 commits
InosLihka
Add SFT v3 + GRPO refine results to README + results.md
666b4ce
about 8 hours ago
references
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
3 days ago
WhatMAkesSubmissionStandOut.md
Safe
3.2 kB
Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves
3 days ago
architecture.md
Safe
40.4 kB
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
3 days ago
entity_definitions.md
Safe
9.46 kB
Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline
3 days ago
environment_design.md
Safe
6 kB
docs: reorganize β 25 files β 4 focused docs
4 days ago
iterations.md
Safe
20 kB
Refactor grader to use openenv.core.rubrics.WeightedSum + Rubric subclasses
2 days ago
results.md
Safe
8.81 kB
Add SFT v3 + GRPO refine results to README + results.md
about 8 hours ago
training.md
Safe
4.84 kB
docs: reorganize β 25 files β 4 focused docs
4 days ago