Β·
AI & ML interests
Benchmark, Code Generation Model
Organizations
view article π’ INT4 vs FP4: The Future of 4-Bit Quantization
onekq
β’ β’ 6
view article π Muon Optimizer: The Power of Collective Momentum
onekq
β’ β’ 6
view article β³ Optimizer: What Does It Do and Why We Need It
onekq
β’ β’ 7
view article π³ QAT: The Art of Growing a Bonsai Model
onekq
β’ β’ 15
view article Vision Tokens vs Text Tokens: Understanding the 10Γ Compression
onekq
β’ β’ 6
published an article over 1 year ago view article Does Daily Software Engineering Work Need Reasoning Models?
onekq
β’ β’ 5
published an article over 1 year ago view article All LLMs Write Great Code, But Some Make (A Lot) Fewer Mistakes
onekq
β’ β’ 5