DeepSeek
company
Verified
AI & ML interests
None defined yet.
Recent Activity
Papers
DeepSeek-OCR 2: Visual Causal Flow
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
-
Chat with DeepSeek-VL2-small
🌍582Chat with images: ask questions and get AI answers
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • Updated • 203k • 244 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 87.3k • 175 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • Updated • 3.54k • 379
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • Updated • 5.67k • 679 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 188 • 82 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 5.79k • 102 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • Updated • 251k • • 537
DeepSeek-VL model series
DeepSeek LLM series
DeepSeek MoE series
-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • Updated • 31.2k • • 953 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 578 • 56 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 346k • • 1.25k -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • Updated • 172k • 671
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 552k • • 13k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 3.21k • 945 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 270k • • 741 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 1.23M • • 1.52k
DeepSeek Math series
-
deepseek-ai/DeepSeek-Math-V2
Text Generation • 685B • Updated • 4.74k • 681 -
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 3.78k • 147 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 2.16k • 91 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 3.15k • 86
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
models for paper expert-specialized fine-tuning
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 7.38k • 564 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 98.5k • 472 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 8.79k • 143 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • Updated • 148k • 155
-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • Updated • 31.2k • • 953 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 578 • 56 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 346k • • 1.25k -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • Updated • 172k • 671
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 552k • • 13k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 3.21k • 945 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 270k • • 741 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 1.23M • • 1.52k
DeepSeek Math series
-
deepseek-ai/DeepSeek-Math-V2
Text Generation • 685B • Updated • 4.74k • 681 -
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 3.78k • 147 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 2.16k • 91 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 3.15k • 86
-
Chat with DeepSeek-VL2-small
🌍582Chat with images: ask questions and get AI answers
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • Updated • 203k • 244 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 87.3k • 175 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • Updated • 3.54k • 379
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • Updated • 5.67k • 679 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 188 • 82 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 5.79k • 102 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • Updated • 251k • • 537
models for paper expert-specialized fine-tuning
DeepSeek-VL model series
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 7.38k • 564 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 98.5k • 472 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 8.79k • 143 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • Updated • 148k • 155
DeepSeek LLM series
DeepSeek MoE series