2 9 2

Xue Yang

yangxue

yangxue0827

AI & ML interests

None yet

Recent Activity

upvoted a paper about 17 hours ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

upvoted a paper about 19 hours ago

RISE-Video: Can Video Generators Decode Implicit World Rules?

submitted a paper about 19 hours ago

RISE-Video: Can Video Generators Decode Implicit World Rules?

View all activity

Organizations

upvoted a paper about 17 hours ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published 2 days ago • 45

upvoted a paper about 19 hours ago

RISE-Video: Can Video Generators Decode Implicit World Rules?

Paper • 2602.05986 • Published 1 day ago • 22

submitted a paper to Daily Papers about 19 hours ago

RISE-Video: Can Video Generators Decode Implicit World Rules?

Paper • 2602.05986 • Published 1 day ago • 22

updated a dataset 1 day ago

VisionXLab/RISE-Video

Preview • Updated 1 day ago • 8

published a dataset 1 day ago

VisionXLab/RISE-Video

Preview • Updated 1 day ago • 8

upvoted a collection about 2 months ago

SGI-Bench

Collection

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows • 9 items • Updated Dec 24, 2025 • 31

upvoted 2 papers about 2 months ago

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10, 2024 • 6

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

Paper • 2512.08478 • Published Dec 9, 2025 • 77

upvoted 2 papers 4 months ago

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 109

NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints

Paper • 2510.08565 • Published Oct 9, 2025 • 21

upvoted a paper 10 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306

authored 9 papers 10 months ago

H2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection

Paper • 2210.06742 • Published Oct 13, 2022 • 1

Self-supervised Character-to-Character Distillation for Text Recognition

Paper • 2211.00288 • Published Nov 1, 2022

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10, 2024 • 6

GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Paper • 2411.18624 • Published Nov 27, 2024

Parameter-Inverted Image Pyramid Networks

Paper • 2406.04330 • Published Jun 6, 2024 • 1

Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft

Paper • 2312.09238 • Published Dec 14, 2023

Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding

Paper • 2501.07783 • Published Jan 14, 2025 • 8

A Simple Aerial Detection Baseline of Multimodal Language Models

Paper • 2501.09720 • Published Jan 16, 2025 • 2

PointOBB: Learning Oriented Object Detection via Single Point Supervision

Paper • 2311.14757 • Published Nov 23, 2023

Xue Yang

AI & ML interests

Recent Activity

Organizations

yangxue's activity