In a Training Loop 🔄

1 12 19

Subarno Sadat Barno

barnobarno666

AI & ML interests

reinforming learning

Recent Activity

liked a dataset about 2 months ago

RUC-AIBOX/OlymMATH-eval

liked a dataset about 2 months ago

brando/olympiad-bench-imo-math-boxed-825-v2-21-08-2024

liked a model 2 months ago

Synthyra/ESM2-8M

View all activity

Organizations

liked 2 datasets about 2 months ago

RUC-AIBOX/OlymMATH-eval

Viewer • Updated May 11, 2025 • 579k • 80 • 4

brando/olympiad-bench-imo-math-boxed-825-v2-21-08-2024

Viewer • Updated Nov 6, 2024 • 1.65k • 46 • 5

liked a model 2 months ago

Synthyra/ESM2-8M

Fill-Mask • 7.52M • Updated 1 day ago • 864 • 2

liked 2 models 3 months ago

biomap-research/proteinglm-100b-int4

50B • Updated Mar 17, 2025 • 189 • 11

Adilbai/ppo-LunarLander-v2

Reinforcement Learning • Updated Jun 9, 2025 • 2

updated a model 3 months ago

barnobarno666/Whisper-medium-bangla

Automatic Speech Recognition • 0.8B • Updated Nov 23, 2025 • 43

published a model 3 months ago

barnobarno666/Whisper-medium-bangla

Automatic Speech Recognition • 0.8B • Updated Nov 23, 2025 • 43

upvoted a collection 4 months ago

Gemma 3 Release

Collection

28 items • Updated Aug 11, 2025 • 613

liked a Space 4 months ago

The Smol Training Playbook

📚

The secrets to building world-class LLMs

upvoted 2 papers 4 months ago

Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published Oct 6, 2025 • 23

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 181

liked a model 4 months ago

unsloth/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Jun 2, 2025 • 152k • 88

liked a model 5 months ago

unsloth/Qwen3-1.7B-Base-unsloth-bnb-4bit

Text Generation • Updated May 13, 2025 • 9.69k • 3

upvoted 2 papers 5 months ago

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16, 2025 • 91

liked a model 5 months ago

unsloth/Qwen3-1.7B-unsloth-bnb-4bit

Text Generation • Updated May 13, 2025 • 45k • 12

liked 2 models 6 months ago

Qwen/Qwen3-1.7B-GPTQ-Int8

Text Generation • 2B • Updated May 21, 2025 • 855 • 7

JunHowie/Qwen3-0.6B-GPTQ-Int4

Text Generation • 0.6B • Updated Sep 3, 2025 • 234 • 1

upvoted a paper 6 months ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 160

upvoted a paper 10 months ago

Efficient Reasoning Models: A Survey

Paper • 2504.10903 • Published Apr 15, 2025 • 21

Subarno Sadat Barno

AI & ML interests

Recent Activity

Organizations

barnobarno666's activity

The Smol Training Playbook