LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model Paper • 2603.01068 • Published 3 days ago • 18
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published 5 days ago • 53
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published 2 days ago • 120
Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper Paper • 2511.04583 • Published Nov 6, 2025 • 5
jina-embeddings-v5-text: Task-Targeted Embedding Distillation Paper • 2602.15547 • Published 15 days ago • 26
IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering Paper • 2602.17687 • Published 27 days ago • 1
view article Article A framework and leaderboard for Retrieval Pipelines evaluation on ViDoRe v3 5 days ago • 10
PyVision-RL: Forging Open Agentic Vision Models via RL Paper • 2602.20739 • Published 8 days ago • 29
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 8 days ago • 89
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 6 days ago • 82
view article Article Alyah ⭐️: Toward Robust Evaluation of Emirati Dialect Capabilities in Arabic LLMs Jan 27 • 24
Open Legal Data Collection A collection of our favorite open-source legal datasets on Hugging Face. • 14 items • Updated 1 day ago • 6
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 13 days ago • 474
Tri Series Collection Introducing our new series of models: Tri-7B, Tri-21B, and Tri-70B-preview-SFT • 12 items • Updated 12 days ago • 11