vector-institute/Qwen3-4B-UnBias-Plus-SFT
Text Generation • 4B • Updated
• 24
None defined yet.
When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation