NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems Paper • 2601.11004 • Published 11 days ago • 29
CREAM: Consistency Regularized Self-Rewarding Language Models Paper • 2410.12735 • Published Oct 16, 2024
Empowering Reliable Visual-Centric Instruction Following in MLLMs Paper • 2601.03198 • Published 21 days ago • 1
Empowering Reliable Visual-Centric Instruction Following in MLLMs Paper • 2601.03198 • Published 21 days ago • 1
CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents Paper • 2511.02734 • Published Nov 4, 2025 • 22