Aayush

Aayushfaced

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

openai-community/gpt2

upvoted an article about 2 months ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted an article about 2 months ago

New in llama.cpp: Model Management

View all activity

Organizations

None yet

liked a model about 1 month ago

openai-community/gpt2

Text Generation • Updated Feb 19, 2024 • 7.63M • 3.11k

upvoted 2 articles about 2 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

592

Article

New in llama.cpp: Model Management

Dec 11, 2025

•

118

upvoted 2 collections 3 months ago

NVIDIA Nemotron V2

Collection

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 9 days ago • 102

Inference Optimized Checkpoints (with Model Optimizer)

Collection

A collection of generative models quantized and optimized for inference with Model Optimizer. • 52 items • Updated 2 days ago • 93

upvoted an article 3 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Jun 21, 2025

•

liked 3 Spaces 4 months ago

The Smol Training Playbook

📚

2.98k

The secrets to building world-class LLMs

FineWeb: decanting the web for the finest text data at scale

🍷

1.29k

Read about FineWeb, a large web‑text dataset for LLMs

Robot Learning: A Tutorial

📝

331

Explore the Robot Learning tutorial online

liked a Space 5 months ago

The Ultra-Scale Playbook

🌌

3.69k

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 papers 5 months ago

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15, 2025 • 49

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

liked a dataset 5 months ago

InternRobotics/OmniWorld

Viewer • Updated Jan 8 • 6.35B • 38.3k • 82

upvoted 7 papers 5 months ago

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 57

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 178

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Paper • 2508.07407 • Published Aug 10, 2025 • 98

Aayush

AI & ML interests

Recent Activity

Organizations

Aayushfaced's activity

We Got Claude to Fine-Tune an Open Source LLM

New in llama.cpp: Model Management

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

The Smol Training Playbook

FineWeb: decanting the web for the finest text data at scale

Robot Learning: A Tutorial

The Ultra-Scale Playbook