Nikita Kezins
entfane
AI & ML interests
LLM post-training, adversarial training, safety, knowledge transfer
Recent Activity
updated a model about 1 hour ago
entfane/toxic_gemma2b_classifier published a model about 1 hour ago
entfane/toxic_gemma2b_classifier upvoted a paper 12 days ago
Blockwise Advantage Estimation for Multi-Objective RL with Verifiable Rewards