Rene's picture

1 4

Rene

Rene1996

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

upvoted a paper 24 days ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

upvoted a paper 4 months ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

View all activity

Organizations

upvoted a paper about 5 hours ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

Paper • 2602.14457 • Published 5 days ago • 21

upvoted a paper 24 days ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Paper • 2601.18491 • Published 26 days ago • 125

upvoted a paper 4 months ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published Oct 9, 2025 • 22

upvoted a paper 12 months ago

Iterative Value Function Optimization for Guided Decoding

Paper • 2503.02368 • Published Mar 4, 2025 • 15