University of Toronto CSSLab

university

https://csslab.cs.toronto.edu/

AI & ML interests

None defined yet.

Recent Activity

difanjiao submitted a paper 4 days ago

LLM Safety From Within: Detecting Harmful Content with Internal Representations

difanjiao updated a model 4 days ago

UofTCSSLab/SIREN-Llama-3.1-8B

difanjiao published a model 4 days ago

UofTCSSLab/SIREN-Llama-3.1-8B

View all activity

Papers

LLM Safety From Within: Detecting Harmful Content with Internal Representations

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

View all Papers

UofTCSSLab 's datasets

None public yet