David Limpus

TheRealPilot638

AI & ML interests

HW/SW Co-design for efficient AI inference & training | RL & Control

Recent Activity

upvoted a collection 18 days ago

TraDo Series

upvoted a paper 21 days ago

Recursive Language Models

upvoted a paper 21 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

View all activity

Organizations

upvoted a collection 18 days ago

TraDo Series

Collection

SOTA Diffusion Large Language Models • 5 items • Updated Sep 11, 2025 • 13

upvoted 2 papers 21 days ago

Recursive Language Models

Paper • 2512.24601 • Published about 1 month ago • 78

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 22 days ago • 211

upvoted an article about 1 month ago

Article

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

Jun 28, 2025

•

updated a dataset 6 months ago

TheRealPilot638/Llama-3.1-8B-PRM-Skywork-llama-o1-Math500

Updated Jul 20, 2025

published a dataset 6 months ago

TheRealPilot638/Llama-3.1-8B-PRM-Skywork-llama-o1-Math500

Updated Jul 20, 2025

updated a dataset 6 months ago

TheRealPilot638/Qwen3-8B-report-recreate-Math500

Updated Jul 14, 2025 • 1

published a dataset 7 months ago

TheRealPilot638/Qwen3-8B-report-recreate-Math500

Updated Jul 14, 2025 • 1

updated a dataset 7 months ago

TheRealPilot638/Qwen3-8B-Math500-baseline

Viewer • Updated Jul 12, 2025 • 500 • 3

published a dataset 7 months ago

TheRealPilot638/Qwen3-8B-Math500-baseline

Viewer • Updated Jul 12, 2025 • 500 • 3

updated a dataset 7 months ago

TheRealPilot638/DeepSeek-R1-0528-Qwen3-8B-Math500-baseline

Viewer • Updated Jul 11, 2025 • 500 • 1

upvoted 2 articles 7 months ago

Article

All LLMs Will Be Sparse BitNet Hybrids

May 14, 2025

•

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

•

273

published a dataset 7 months ago

TheRealPilot638/DeepSeek-R1-0528-Qwen3-8B-Math500-baseline

Viewer • Updated Jul 11, 2025 • 500 • 1

updated 2 datasets 7 months ago

TheRealPilot638/Qwen3-8B-BS16-RLHF-PRM-GPQA

Viewer • Updated Jun 26, 2025 • 198 • 1

TheRealPilot638/DeepSeek-R1-Distill-Llama-8B-Reasoning-longToken-GPQA

Viewer • Updated Jun 26, 2025 • 198 • 2

published a dataset 7 months ago

TheRealPilot638/Qwen3-8B-BS16-RLHF-PRM-GPQA

Viewer • Updated Jun 26, 2025 • 198 • 1

updated a dataset 7 months ago

TheRealPilot638/DeepSeek-R1-Distill-Qwen3-8B-Reasoning-longToken-GPQA

Viewer • Updated Jun 25, 2025 • 198 • 6

published 2 datasets 7 months ago

TheRealPilot638/DeepSeek-R1-Distill-Llama-8B-Reasoning-longToken-GPQA

Viewer • Updated Jun 26, 2025 • 198 • 2

TheRealPilot638/DeepSeek-R1-Distill-Qwen3-8B-Reasoning-longToken-GPQA

Viewer • Updated Jun 25, 2025 • 198 • 6

David Limpus

AI & ML interests

Recent Activity

Organizations

TheRealPilot638's activity

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

All LLMs Will Be Sparse BitNet Hybrids

Fine-tuning LLMs to 1.58bit: extreme quantization made easy