YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Terminal Agent - Multi-Task NAT v13
Model Description
This model is fine-tuned from Qwen3-8B on multi-task terminal agent trajectories using Negative-Aware Training (NAT).
Key Features
- 5 Tasks: fix-git, cancel-async-tasks, log-summary-date-ranges, regex-log, pypi-server
- Fixed Tool Signatures: Corrected critical bug where
note_namewas incorrectly removed - Clean Tool Calls: Removed hallucinated parameters (message_title, message_description, message_attachment) from training
- Negative Examples: Includes looping and wrong_command negative examples
Training Details
- Base Model: Qwen/Qwen3-8B
- Training Data: 40 samples (20 positive, 20 negative)
- Epochs: 99/300 (checkpoint at epoch 99)
- Learning Rate: 5e-5
- Batch Size: 4
- Global Step: 899
Evaluation Results
- Overall Success Rate: 56% (14/25 trials)
- fix-git: 4/5 (80%)
- regex-log: 4/5 (80%)
- cancel-async-tasks: 3/5 (60%)
- log-summary-date-ranges: 3/5 (60%)
- pypi-server: 0/5 (0%)
Tool Signatures (Corrected)
shell_exec(id, command, block)shell_write_content_to_file(content, file_path)create_note(note_name, content)append_note(note_name, content)read_note(note_name)
Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained(
"alievak/terminal_agent_multitask_nat_v13",
torch_dtype="auto",
device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(
"alievak/terminal_agent_multitask_nat_v13"
)
# Example usage
messages = [
{"role": "system", "content": "You are a terminal agent..."},
{"role": "user", "content": "Fix the git repository..."}
]
inputs = tokenizer.apply_chat_template(messages, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=2048)
response = tokenizer.decode(outputs[0], skip_special_tokens=True)
V13 Fixes
- KEEP note_name - Required by runtime (was incorrectly removed in v12)
- System prompt uses note_name - Matches runtime expectations
- Remove only hallucinated params - message_title, message_description, message_attachment
- Added tool call validation - Catches signature issues before training
Known Issues
- Model still outputs hallucinated parameters (412 occurrences in eval) - embedded in base model
- pypi-server fails due to missing mkdir before write
- Action sequence divergence from teacher trajectories
Model Size
- Total: ~16GB (4 safetensors)
- Architecture: 8.2B parameters
License
MIT License
Citation
If you use this model, please cite:
@misc{terminal_agent_v13,
title={Terminal Agent Multi-Task NAT v13},
author={alievak},
year={2026},
url={https://huggingface.co/alievak/terminal_agent_multitask_nat_v13}
}
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support