A collection of reasoning tasks to benchmark model abilities
Sara Candussio
saracandu
AI & ML interests
Reasoning in Large Language Models
Recent Activity
published
a dataset
about 1 hour ago
saracandu/neuroutputs
updated
a model
4 days ago
saracandu/stlenc-distilled-v2
updated
a dataset
5 days ago
saracandu/GSM-Plus-modified