Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Alan Blanchet's picture

Alan Blanchet

Alanox

21world's profile picture

laclouis5's profile picture

·

https://alan-blanchet.fr/

AlanBlanchet

AI & ML interests

None yet

Organizations

Alanox 's collections 1

LLM Evaluation Benchmarks

This collection is here is make references to the evaluation benchmarks we see in traditional LLM papers

Running on CPU Upgrade

245

MMLU-Pro Leaderboard

🥇

245

More advanced and challenging multi-task evaluation
Running on CPU Upgrade

597

GAIA Leaderboard

🦾

597

Submit your model answers to GAIA benchmark and view leaderboard

LLM Evaluation Benchmarks

This collection is here is make references to the evaluation benchmarks we see in traditional LLM papers

Running on CPU Upgrade

245

MMLU-Pro Leaderboard

🥇

245

More advanced and challenging multi-task evaluation
Running on CPU Upgrade

597

GAIA Leaderboard

🦾

597

Submit your model answers to GAIA benchmark and view leaderboard

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs