On Data Engineering for Scaling LLM Terminal Capabilities
Paper β’ 2602.21193 β’ Published β’ 102
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
A practical training guide and recipe for building state-of-the-art agentic coding assistants with open-source 8B parameter models.
This repository consolidates research from Nemotron-Terminal, Klear-AgentForge, GLM-5, and Qwen3-Coder-Next into a single reproducible training pipeline:
Base: nvidia/Nemotron-Terminal-8B
tool_calls support| Benchmark | 8B Target | SOTA Reference |
|---|---|---|
| SWE-bench Verified | 20-40% | Klear-AgentForge: 39.4% |
| BFCL v3 | 65-75% | Klear-AgentForge: 71.5% |
| Terminal-Bench 2.0 | 15-25% | Nemotron-T-14B: 20.2% |
| Aider-Polyglot | 25-40% | Klear-AgentForge: 33.8% |
# Stage 1: SFT on curated agent trajectories
python train_sft.py \
--model nvidia/Nemotron-Terminal-8B \
--dataset mixed_agentic_dataset \
--output_dir ./nexus-coder-sft
# Stage 2: GRPO with execution-verified rewards
python train_grpo.py \
--model ./nexus-coder-sft \
--dataset nvidia/Nemotron-RL-Agentic-SWE-Pivot-v1 \
--output_dir ./nexus-coder-rl
| Dataset | Split | Purpose | Link |
|---|---|---|---|
| SWE-bench/SWE-smith-trajectories | tool (resolved=True) |
SFT: Real repo bug fixing | HF |
| nvidia/Nemotron-Agentic-v1 | interactive_agent + tool_calling |
SFT: Multi-turn tool use | HF |
| xingyaoww/code-act | codeact + general |
SFT: Executable code actions | HF |
| nvidia/Nemotron-RL-Agentic-SWE-Pivot-v1 | train |
RL: Step-level pass-rate rewards | HF |
If you use this recipe, please cite the underlying research:
@article{nemotron-terminal-2026,
title={Nemotron-Terminal: Scalable Training for Terminal-Capable Language Models},
author={NVIDIA},
journal={arXiv:2602.21193},
year={2026}
}
@article{klear-agentforge-2025,
title={Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling},
author={Klear-AI},
journal={arXiv:2511.05951},
year={2025}
}
@article{glm5-2026,
title={GLM-5: from Vibe Coding to Agentic Engineering},
author={Zhipu AI},
journal={arXiv:2602.15763},
year={2026}
}
The training guide and scripts are provided as-is for research and educational purposes. Dataset and base model licenses apply to their respective owners.