Edmond Jacoupeau

edmond

AI & ML interests

None yet

Recent Activity

commentedon a paper about 4 hours ago

Rethinking Cross-Layer Information Routing in Diffusion Transformers

liked a dataset 19 days ago

geronimobasso/drone-audio-detection-samples

upvoted a paper 20 days ago

Let ViT Speak: Generative Language-Image Pre-training

View all activity

Organizations

commented a paper about 4 hours ago

Rethinking Cross-Layer Information Routing in Diffusion Transformers

Paper • 2605.20708 • Published 6 days ago • 75 •

liked a dataset 19 days ago

geronimobasso/drone-audio-detection-samples

Viewer • Updated Mar 9, 2025 • 180k • 2.42k • 26

upvoted a paper 20 days ago

Let ViT Speak: Generative Language-Image Pre-training

Paper • 2605.00809 • Published 25 days ago • 32

commented on NEO-unify: Building Native Multimodal Unified Models End to End 27 days ago

This comment has been hidden

commented on NEO-unify: Building Native Multimodal Unified Models End to End 27 days ago

Thanks !
Hopefully https://github.com/facebookresearch/tuna-2 soon also

liked 3 models about 1 month ago

upvoted an article 2 months ago

Article

Introducing NVIDIA Cosmos Policy for Advanced Robot Control

nvidia

•

Jan 29

• 48

liked a model 3 months ago

nvidia/DiffiT

Updated Mar 9 • 11

commented on NEO-unify: Building Native Multimodal Unified Models End to End 3 months ago

Model available on HF ? 👀

upvoted an article 3 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 163

liked 3 models 3 months ago

google/siglip2-giant-opt-patch16-256

Zero-Shot Image Classification • 2B • Updated Feb 21, 2025 • 5.88k • 4

facebook/dinov2-base

Image Feature Extraction • 86.6M • Updated Jan 17, 2024 • 3.25M • 180

nyu-visionx/RAE-collections

Unconditional Image Generation • Updated Mar 1 • 47

liked a model 4 months ago

facebook/dinov3-vitl16-pretrain-lvd1689m

Image Feature Extraction • 0.3B • Updated Aug 19, 2025 • 723k • 295

upvoted a paper 4 months ago

Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models

Paper • 2601.19834 • Published Jan 27 • 25

liked 2 models 4 months ago

Qwen/Qwen3-Embedding-8B

Feature Extraction • 8B • Updated Jul 7, 2025 • 1.63M • • 687

zehongma/DeCo

Updated Nov 25, 2025 • 4

upvoted a paper 5 months ago

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Paper • 2512.24165 • Published Dec 30, 2025 • 52