5 101 32

Ha-Yeong Choi

Ha0

https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

hayeong0

AI & ML interests

Speech Synthesis, Voice Conversion, Generative Models

Recent Activity

upvoted a paper 2 days ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

upvoted a paper 2 days ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

upvoted a paper 7 days ago

DMax: Aggressive Parallel Decoding for dLLMs

View all activity

Organizations

None yet

upvoted 2 papers 2 days ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published 5 days ago • 68

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published 7 days ago • 74

upvoted 2 papers 7 days ago

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published 9 days ago • 50

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published 10 days ago • 182

upvoted a paper 10 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 12 days ago • 107

upvoted a paper 18 days ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published 22 days ago • 155

upvoted a paper about 1 month ago

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

Paper • 2603.12267 • Published Mar 12 • 13

liked a model about 2 months ago

Qwen/Qwen3.5-0.8B

Image-Text-to-Text • 0.9B • Updated Mar 2 • 2.66M • 498

upvoted a paper 2 months ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published Feb 4 • 267

upvoted an article 3 months ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Jan 5

•

upvoted a paper 3 months ago

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40

upvoted 3 papers 4 months ago

DreamOmni3: Scribble-based Editing and Generation

Paper • 2512.22525 • Published Dec 27, 2025 • 15

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published Dec 10, 2025 • 74

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 177

liked a dataset 5 months ago

yenopoya/thousand-voices-trauma

Updated Oct 24, 2025 • 24 • 4

upvoted a paper 6 months ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27, 2025 • 59

upvoted a paper 8 months ago

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27, 2025 • 32

liked a model 8 months ago

LiquidAI/LFM2-350M

Text Generation • 0.4B • Updated 18 days ago • 33k • 249

upvoted a paper 8 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 211

upvoted a paper 9 months ago

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29, 2025 • 142

Ha-Yeong Choi

AI & ML interests

Recent Activity

Organizations

Ha0's activity

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR