Better Models, Faster Training: Sigmoid Attention for single-cell Foundation Models Paper β’ 2604.27124 β’ Published 8 days ago β’ 5
T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning Paper β’ 2605.02178 β’ Published 3 days ago β’ 5
PhysicianBench: Evaluating LLM Agents in Real-World EHR Environments Paper β’ 2605.02240 β’ Published 3 days ago β’ 6
AcademiClaw: When Students Set Challenges for AI Agents Paper β’ 2605.02661 β’ Published 3 days ago β’ 8
Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling Paper β’ 2604.28075 β’ Published 7 days ago β’ 16
HUMAN-WRITTEN & LEGALLY-SOURCED* Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. β’ 169 items β’ Updated 1 day ago β’ 3
Large Language Models Explore by Latent Distilling Paper β’ 2604.24927 β’ Published 10 days ago β’ 71 β’ 7
Large Language Models Explore by Latent Distilling Paper β’ 2604.24927 β’ Published 10 days ago β’ 71
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 117 items β’ Updated 6 days ago β’ 17
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 117 items β’ Updated 6 days ago β’ 17
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 117 items β’ Updated 6 days ago β’ 17
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning Paper β’ 2509.24372 β’ Published Sep 29, 2025 β’ 14
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper β’ 2502.05171 β’ Published Feb 7, 2025 β’ 155
SelfCodeAlign: Self-Alignment for Code Generation Paper β’ 2410.24198 β’ Published Oct 31, 2024 β’ 25
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 117 items β’ Updated 6 days ago β’ 17