arxiv:2505.04620
Shengqiong Wu
ChocoWu
AI & ML interests
Large Language Model, Multimodal learning, Scene graph Generation
Recent Activity
upvoted
a
paper
21 days ago
SemanticGen: Video Generation in Semantic Space
upvoted
a
paper
25 days ago
Kling-Omni Technical Report
upvoted
a
paper
about 1 month ago
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder