Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
InosLihka
/
rhythm_env
Running

App Files Files Community
Fetching metadata from the HF Docker repository...
rhythm_env / docs
113 kB
Ctrl+K
Ctrl+K
  • 3 contributors
History: 25 commits
InosLihka's picture
InosLihka
Add SFT v3 + GRPO refine results to README + results.md
666b4ce about 8 hours ago
  • references
    Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 3 days ago
  • WhatMAkesSubmissionStandOut.md
    3.2 kB
    Add plots/ folder: SFT v3 loss + GRPO iter2 reward curves 3 days ago
  • architecture.md
    40.4 kB
    Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 3 days ago
  • entity_definitions.md
    9.46 kB
    Algorithm Distillation: grader v2 with belief_accuracy + SFT pipeline 3 days ago
  • environment_design.md
    6 kB
    docs: reorganize β€” 25 files β†’ 4 focused docs 4 days ago
  • iterations.md
    20 kB
    Refactor grader to use openenv.core.rubrics.WeightedSum + Rubric subclasses 2 days ago
  • results.md
    8.81 kB
    Add SFT v3 + GRPO refine results to README + results.md about 8 hours ago
  • training.md
    4.84 kB
    docs: reorganize β€” 25 files β†’ 4 focused docs 4 days ago