University of Texas at Austin

university

Verified

https://www.utexas.edu

AI & ML interests

None defined yet.

Recent Activity

atutej submitted a paper 1 day ago

EntRGi: Entropy Aware Reward Guidance for Diffusion Language Models

SP2001 authored a paper 11 days ago

Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts

SP2001 submitted a paper 11 days ago

Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts

View all activity

Papers

EntRGi: Entropy Aware Reward Guidance for Diffusion Language Models

Adaptive Evidence Weighting for Audio-Spatiotemporal Fusion

View all Papers

atutej

submitted a paper to Daily Papers 1 day ago

EntRGi: Entropy Aware Reward Guidance for Diffusion Language Models

Paper • 2602.05000 • Published 3 days ago • 1

Sunny111

posted an update 22 days ago

Post

1610

Are you familiar with reverse residual connections or looping in language models?

Excited to share my Looped-GPT blog post and codebase 🚀
https://github.com/sanyalsunny111/Looped-GPT

TL;DR: looping during pre-training improves generalization.

Plot shows GPT2 LMs pre-trained with 15.73B OWT tokens

P.S. This is my first post here — I have ~4 followers and zero expectations for reach 😄

3 replies

·

ChristinaW

authored a paper 2 months ago

Mitigating Intra- and Inter-modal Forgetting in Continual Learning of Unified Multimodal Models

Paper • 2512.03125 • Published Dec 2, 2025 • 2

cotran2

authored a paper 5 months ago

Arch-Router: Aligning LLM Routing with Human Preferences

Paper • 2506.16655 • Published Jun 19, 2025 • 17

gdhe17

authored 3 papers 8 months ago

Noise Contrastive Alignment of Language Models with Explicit Rewards

Paper • 2402.05369 • Published Feb 8, 2024 • 2

Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models

Paper • 2405.04233 • Published May 7, 2024 • 3

Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

Paper • 2506.08009 • Published Jun 9, 2025 • 30

lytang

authored a paper 9 months ago

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Paper • 2505.13444 • Published May 19, 2025 • 17

gdhe17

authored a paper 11 months ago

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

Paper • 2503.01103 • Published Mar 3, 2025 • 5

gdhe17

authored a paper 12 months ago

RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers

Paper • 2502.15894 • Published Feb 21, 2025 • 20

XCLiu

authored a paper 12 months ago

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Paper • 2502.06608 • Published Feb 10, 2025 • 39

joydeep-b

authored a paper about 1 year ago

PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

Paper • 2502.01584 • Published Feb 3, 2025 • 9

peihaowang

authored a paper about 1 year ago

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding

Paper • 2501.00712 • Published Jan 1, 2025 • 6

gdhe17

authored a paper about 1 year ago

Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis

Paper • 2312.03491 • Published Dec 6, 2023 • 34

XCLiu

authored a paper about 1 year ago

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published Nov 12, 2024 • 31

XCLiu

authored 4 papers over 1 year ago

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published Oct 17, 2024 • 35

SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow

Paper • 2407.12718 • Published Jul 17, 2024

Consistency Flow Matching: Defining Straight Flows with Velocity Consistency

Paper • 2407.02398 • Published Jul 2, 2024 • 18

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Paper • 2405.08748 • Published May 14, 2024 • 23

lytang

authored a paper almost 2 years ago

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

Paper • 2404.10774 • Published Apr 16, 2024 • 6