AI & ML interests

Training efficient language models (MiniLLM, MiniPLM)

t1101675 
in MiniLLM/MiniPLM-Qwen-1.2B about 1 year ago

Add link to code

#1 opened about 1 year ago by
nielsr

Add link to code

#1 opened about 1 year ago by
nielsr
t1101675 
in MiniLLM/Pretrain-Qwen-1.2B about 1 year ago

Add link to code

#1 opened about 1 year ago by
nielsr
t1101675 
in MiniLLM/Pretrain-Qwen-500M about 1 year ago

No changes needed

#1 opened about 1 year ago by
nielsr
t1101675 
in MiniLLM/Pretrain-Qwen-200M about 1 year ago

Add link to code

#1 opened about 1 year ago by
nielsr

Add link to code

#1 opened about 1 year ago by
nielsr

No changes

#1 opened about 1 year ago by
nielsr