Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

liked a Space 18 minutes ago

Chunte/Thumbnail-Crafter.mini

View all activity

Organizations

upvoted 2 articles 6 days ago

Article

Introducing Waypoint-1: Real-time interactive video diffusion from Overworld

+3

7 days ago

•

25

Article

One Year Since the “DeepSeek Moment”

7 days ago

•

33

upvoted an article 13 days ago

Article

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

22 days ago

•

37

upvoted a paper 18 days ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published 22 days ago • 102

upvoted a paper 19 days ago

Recursive Language Models

Paper • 2512.24601 • Published 27 days ago • 77

upvoted a paper 26 days ago

Evaluating Parameter Efficient Methods for RLVR

Paper • 2512.23165 • Published 29 days ago • 26

upvoted 3 articles about 1 month ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

Dec 18, 2025

•

116

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Dec 9, 2025

•

82

Article

Shadow AI - Where are the CIOs?

Dec 19, 2025

•

31

upvoted a collection about 1 month ago

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 7 days ago • 48

upvoted an article about 1 month ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Dec 15, 2025

•

106

upvoted a paper about 2 months ago

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

Paper • 2508.09726 • Published Aug 13, 2025 • 15

upvoted 2 articles about 2 months ago

Article

Yay! Organizations can now publish blog Articles

Jan 20, 2025

•

53

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

582

upvoted 3 papers about 2 months ago

Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28, 2025 • 9

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 23

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published Nov 10, 2025 • 16

upvoted an article about 2 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

285

upvoted a paper 2 months ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 139

upvoted an article 2 months ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

311