embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead
Image-Text-to-Text • 2B • Updated • 1.71k • 7
Efficient Drop-In Replacement for the Classification Head in Language Model Inference. https://github.com/embedl/flash-head
On-Device benchmarks across devices and models.