HUMAN-WRITTEN & LEGALLY-SOURCED* Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. • 160 items • Updated 1 day ago • 2
HUMAN-WRITTEN & LEGALLY-SOURCED* Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. • 160 items • Updated 1 day ago • 2
✨ free demo spaces Collection HF Spaces for demoing chat completion models—no ZeroGPU, WebGPU, or BYOK included. Thank you so much to these devs! • 15 items • Updated 10 days ago • 2
FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling Paper • 2603.06199 • Published 5 days ago • 9
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 68 items • Updated 8 days ago • 9
H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs Paper • 2512.01797 • Published Dec 1, 2025 • 9
TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents Paper • 2602.19633 • Published 16 days ago • 7
PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency Paper • 2602.16745 • Published 22 days ago • 8
Benchmark Test-Time Scaling of General LLM Agents Paper • 2602.18998 • Published 18 days ago • 8
QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models Paper • 2602.20309 • Published 16 days ago • 16
Test-Time Training with KV Binding Is Secretly Linear Attention Paper • 2602.21204 • Published 15 days ago • 30
Query-focused and Memory-aware Reranker for Long Context Processing Paper • 2602.12192 • Published 27 days ago • 56
TINY MODELS WITH BIG INTELLIGENCE Collection Tiny (<30B) models that tend to outperform their same-parameter counterparts. • 15 items • Updated 9 days ago • 3