1 5 4

SII-baibizhe

baibizhe

baibizhe

AI & ML interests

SII is an institution dedicated to innovation in education and research in the field of AI.

Recent Activity

authored a paper 1 day ago

Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards

upvoted a paper 1 day ago

Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards

upvoted a paper 3 months ago

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

View all activity

Organizations

authored a paper 1 day ago

Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards

Paper • 2602.02555 • Published Jan 30 • 1

upvoted a paper 1 day ago

Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards

Paper • 2602.02555 • Published Jan 30 • 1

upvoted a paper 3 months ago

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

Paper • 2511.15705 • Published Nov 19, 2025 • 97

liked a dataset 4 months ago

CodeGoat24/UniGenBench

Updated Oct 25, 2025 • 58 • 3

liked a Space 4 months ago

UniGenBench Leaderboard (English)

🏅

UniGenBench: a unified T2I generation benchmark.

liked a model 4 months ago

CodeGoat24/UniGenBench-EvalModel-qwen-72b-v1

Image-Text-to-Text • 73B • Updated Oct 25, 2025 • 270 • 3

liked a dataset 4 months ago

CodeGoat24/UniGenBench-Eval-Images

Preview • Updated 11 days ago • 3.18k • 4

New activity in xuan-luo/FlexiDepth-Llama-3-8B-Instruct 6 months ago

thanks for your wonderful work. question about accuracy in paper

#2 opened 6 months ago by

baibizhe

upvoted a collection 7 months ago

DigitalGene

Collection

DigitalGene • 4 items • Updated Aug 11, 2025 • 2

updated a dataset 7 months ago

baibizhe/Digital_Gene_Benchmark

Viewer • Updated Jul 29, 2025 • 4.69k • 16

published a dataset 7 months ago

baibizhe/Digital_Gene_Benchmark

Viewer • Updated Jul 29, 2025 • 4.69k • 16

upvoted 2 papers 9 months ago

VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos

Paper • 2506.10857 • Published Jun 12, 2025 • 30

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

Paper • 2505.21327 • Published May 27, 2025 • 83

updated a model over 1 year ago

baibizhe/submit_for_ccfchip

Updated Jun 30, 2024

updated a model almost 2 years ago

baibizhe/ganllm

Updated Mar 20, 2024

replied to vladbogo's post almost 2 years ago

fp16 run by cun and mem , maybe 700GB RAM required

SII-baibizhe

AI & ML interests

Recent Activity

Organizations

baibizhe's activity

UniGenBench Leaderboard (English)

thanks for your wonderful work. question about accuracy in paper