arxiv:2601.16208
Jihan Yang
jihanyang
AI & ML interests
Computer Vision, Multimodality, Embodied AI
Recent Activity
upvoted a paper about 5 hours ago
LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation upvoted a paper 3 months ago
Beyond Language Modeling: An Exploration of Multimodal Pretraining upvoted a paper 3 months ago
Solaris: Building a Multiplayer Video World Model in Minecraft