AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios Paper • 2602.23166 • Published 9 days ago • 31
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 4 days ago • 69
Helios: Real Real-Time Long Video Generation Model Paper • 2603.04379 • Published 3 days ago • 132
InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions Paper • 2603.03646 • Published 3 days ago • 7
DreamWorld: Unified World Modeling in Video Generation Paper • 2603.00466 • Published 7 days ago • 13
RubricBench: Aligning Model-Generated Rubrics with Human Standards Paper • 2603.01562 • Published 5 days ago • 51
Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use Paper • 2603.03205 • Published 4 days ago • 11
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 9 days ago • 86
LLaDA2.1: Speeding Up Text Diffusion via Token Editing Paper • 2602.08676 • Published 26 days ago • 68
gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29, 2025 • 64
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning Paper • 2602.10090 • Published 25 days ago • 51
QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining Paper • 2602.07085 • Published 29 days ago • 187
Self-Improving World Modelling with Latent Actions Paper • 2602.06130 • Published 30 days ago • 30
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models Paper • 2602.04804 • Published about 1 month ago • 46
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing Paper • 2602.03560 • Published Feb 3 • 45