Nathan Habib's picture

Building on HF

Nathan Habib PRO

SaylorTwift

huggingface

·

AI & ML interests

Evals

Recent Activity

liked a model 3 days ago

baidu/ERNIE-Image

new activity 4 days ago

prithivMLmods/gemma-4-E2B-it-Uncensored-MAX:📊 Add SWE-bench evaluation results for princeton-nlp/SWE-bench_Verified

published an article 4 days ago

Stop benchmarking inference providers

View all activity

Organizations

SaylorTwift 's Spaces 23

Qwen3 8B

Inspect and browse server log files

Qwen2.5 0.5B Instruct Evals

Inspect and view log files in a web interface

Meta Llama 3.1 8b Cb

Inspect and explore log files in a web view

Transformers CB

View and explore server logs in a web interface

Leaderboard Dashboard

Archipelago Simple Task

Inspect and browse log files in a folder

Evasive Bench

Browse and view log files in a web interface

Aime 25

View and analyze log files with interactive inspection tools

Log Run 42

Test

Inspect

Space used to run inspect on hf-jobs

Long Horizon Execution

Display log files in a user-friendly format

EvalFlip - AI Benchmark Universe 🚀

Explore AI benchmarks for math, QA, and multitask understanding

Smollm3 Mmlu Pro

Display log files in a themed view

Inspect Bundle

Hf Providers Tool Calling Dashboard

Load and analyze BFCL results from JSON files

Lighteval Wandb

Visualize project metrics and runs

Wanddb

Visualize project metrics and runs

OpenEvalsDetails

View detailed model outputs for specific benchmarks

Lighteval Test

OpenEvalsModelDetails

A space to compare eval details between popular models

Mt Bench Viz No Compare

Mt Bench Viz