-
Qwen/Qwen2.5-0.5B-Instruct
Text Generation • 0.5B • Updated • 6.12M • 478 -
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int4
Text Generation • 0.5B • Updated • 1.04k • 9 -
Qwen/Qwen2.5-0.5B-Instruct-GPTQ-Int8
Text Generation • 0.5B • Updated • 642 • 10 -
Qwen/Qwen2.5-1.5B-Instruct
Text Generation • 2B • Updated • 7.72M • • 630
Sree Harsha Nelaturu
deepmage121
AI & ML interests
Data and Compute Efficient Deep Learning.
Recent Activity
updated
a dataset 5 days ago
deepmage121/drafter_split_training published
a dataset 5 days ago
deepmage121/drafter_split_training new activity
18 days ago
evaleval/alphaxiv_datastore:Add alphaXiv SOTA raw scrape (5,069 files, 237MB)