6 22

Thomas Bouvier

tbouvier

https://thomas-bouvier.io

AI & ML interests

HPC for ML, large-scale pretraining, ML4Science

Recent Activity

liked a dataset about 2 months ago

ILSVRC/imagenet-1k

liked a dataset 8 months ago

LEAP/ClimSim_high-res

upvoted an article 8 months ago

Finally, a Replacement for BERT: Introducing ModernBERT

View all activity

Organizations

None yet

liked a dataset about 2 months ago

ILSVRC/imagenet-1k

Viewer • Updated Sep 17, 2025 • 1.43M • 94.4k • 745

liked a dataset 8 months ago

LEAP/ClimSim_high-res

Updated Sep 29, 2023 • 67.8k • 12

upvoted an article 8 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

•

733

liked a dataset 10 months ago

mcherukara/PtychoNN_data

Updated Mar 18, 2025 • 109 • 2

liked 2 models 11 months ago

allenai/ACE2-ERA5

Updated 2 days ago • 70 • 15

microsoft/aurora

Updated Jun 20, 2025 • 50

upvoted an article 12 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Oct 7, 2024

•

liked 3 Spaces about 1 year ago

Memory Viz

🧠

Memory Viz

Predict Memory

🧮

106

Calculate and visualize model memory usage from config

The Ultra-Scale Playbook

🌌

3.73k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article about 1 year ago

Article

Open-R1: Update #1

Feb 2, 2025

•

305

liked 2 datasets about 1 year ago

PleIAs/common_corpus

Viewer • Updated 20 days ago • 69.9k • 120k • 386

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 222k • 984

liked 3 models about 1 year ago

upvoted a collection about 1 year ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 159

liked a model about 1 year ago

answerdotai/ModernBERT-base

Fill-Mask • 0.1B • Updated Jan 15, 2025 • 1.29M • 1.01k

liked 2 Spaces about 1 year ago

TheWell

🌍

Visualization of data from the Well

FineWeb: decanting the web for the finest text data at scale

🍷

1.31k

Generate a curated web‑text dataset for LLM training

Thomas Bouvier

AI & ML interests

Recent Activity

Organizations

tbouvier's activity

Finally, a Replacement for BERT: Introducing ModernBERT

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Memory Viz

Predict Memory

The Ultra-Scale Playbook

Open-R1: Update #1

TheWell

FineWeb: decanting the web for the finest text data at scale