Nikita Kezins's picture

Nikita Kezins

entfane

·

AI & ML interests

LLM post-training, adversarial training, safety, knowledge transfer

Recent Activity

updated a dataset 8 days ago

entfane/violent_eval

published a dataset 10 days ago

entfane/violent_eval

updated a model 10 days ago

entfane/gpt2_constitutional_classifier_violence

View all activity

Organizations

New activity in huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated about 1 month ago

Как создавать изображения ?

#9 opened about 2 months ago by

New activity in mistralai/Voxtral-Mini-4B-Realtime-2602 about 2 months ago

How to add another language ?

#22 opened 2 months ago by

TheRealTancrede

New activity in lmstudio-community/DeepSeek-R1-Distill-Qwen-7B-GGUF 5 months ago

🚩 Report: Ethical issue(s)

#4 opened about 1 year ago by

New activity in openai/gpt-oss-20b 5 months ago

so much censorship

#48 opened 8 months ago by

New activity in moonshotai/Kimi-K2-Thinking 5 months ago

Token Count Calculation in SFT Data Distribution Curation

#31 opened 5 months ago by

New activity in Qwen/Qwen2.5-3B 5 months ago

Is it actually a base model?

#6 opened 5 months ago by

New activity in openai/gpt-oss-20b 8 months ago

CUDA out of memory issues when running gptoss model on colab T4

#99 opened 8 months ago by

Not able to deploy gpt-oss-20b model in A100s

#124 opened 8 months ago by

Unable to load gpt-oss-20b on dual L40 (48GB) GPUs with vLLM

#136 opened 8 months ago by

New activity in ethicalabs/computer-says-no 8 months ago

Diversity of responses

#2 opened 8 months ago by

New activity in yasserrmd/gpt-oss-coder-20b 8 months ago

Reasoning effort during training

#1 opened 8 months ago by

New activity in openai/gpt-oss-20b 8 months ago

NVIDIA L40S GPU's for MXFP4 quantization

#100 opened 8 months ago by

question: setting reasoning effort

#66 opened 8 months ago by

New activity in QuixiAI/dolphin-r1 8 months ago

creation process?

#7 opened about 1 year ago by

New activity in openai/gpt-oss-20b 8 months ago

Thinking but no solution?

#54 opened 8 months ago by

OOM on 3090

#60 opened 8 months ago by

New activity in suriya7/t5-base-text-to-sql 9 months ago

french to sql model

#2 opened 9 months ago by

New activity in Qwen/Qwen3-Reranker-0.6B 9 months ago

reranker0.6b and embedding0.6b are the same model weights？

#6 opened 10 months ago by

New activity in ScienceOne-AI/S1-Base-8B 9 months ago

Benchmarks

#1 opened 9 months ago by

New activity in HuggingFaceTB/SmolLM2-135M-Instruct 9 months ago

Release of SFT tuned model

#8 opened over 1 year ago by