InfoSynth: Information-Guided Benchmark Synthesis for LLMs Paper • 2601.00575 • Published 10 days ago • 2
MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Paper • 2512.16909 • Published 25 days ago • 1
ParaStudent: Generating and Evaluating Realistic Student Code by Teaching LLMs to Struggle Paper • 2507.12674 • Published Jul 16, 2025
Efficient and Scalable Estimation of Tool Representations in Vector Space Paper • 2409.02141 • Published Sep 2, 2024
Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public Opinions Paper • 2502.16761 • Published Feb 24, 2025
Virtual Personas for Language Models via an Anthology of Backstories Paper • 2407.06576 • Published Jul 9, 2024 • 1
Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks Paper • 2503.09572 • Published Mar 12, 2025 • 2
Higher-Order Binding of Language Model Virtual Personas: a Study on Approximating Political Partisan Misperceptions Paper • 2504.11673 • Published Apr 16, 2025 • 1
Neural Spectral Methods: Self-supervised learning in the spectral domain Paper • 2312.05225 • Published Dec 8, 2023
Scaling physics-informed hard constraints with mixture-of-experts Paper • 2402.13412 • Published Feb 20, 2024
Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields Paper • 2503.08674 • Published Mar 11, 2025
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning Paper • 2504.18904 • Published Apr 26, 2025 • 9