-
Iterative Layer Pruning for Efficient Translation Inference
Paper • 2510.22763 • Published -
ymoslem/wmt25-ces-deu-24layers-2e-5lr-news-commentary
Text Generation • 6B • Updated • 1 -
ymoslem/wmt25-ces-deu-20layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 2 -
ymoslem/wmt25-ces-deu-16layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 1
Yasmin Moslem PRO
ymoslem
AI & ML interests
Machine Translation, Speech Translation, Large Language Models, Natural Language Processing
Recent Activity
liked
a model about 8 hours ago
Qwen/Qwen3.5-35B-A3B-FP8 upvoted a collection about 17 hours ago
Quantized VibeThinker-1.5B updated
a Space 2 days ago
AfriNLP/README Organizations
MT Quality Estimation
Models for reference-free quality estimation of machine translation
-
ymoslem/ModernBERT-base-long-context-qe-v1
Text Classification • 0.1B • Updated • 6 • 5 -
ymoslem/ModernBERT-large-qe-v1
Text Classification • 0.4B • Updated • 1 • 2 -
ymoslem/xlm-roberta-large-qe-v1
Text Classification • 0.6B • Updated • 2 • 1 -
ymoslem/ModernBERT-large-qe-maxlen512-v1
Text Classification • 0.4B • Updated • 1 • 1
WMT-Model-Compression
-
Iterative Layer Pruning for Efficient Translation Inference
Paper • 2510.22763 • Published -
ymoslem/wmt25-ces-deu-24layers-2e-5lr-news-commentary
Text Generation • 6B • Updated • 1 -
ymoslem/wmt25-ces-deu-20layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 2 -
ymoslem/wmt25-ces-deu-16layers-2e-5lr-news-commentary
Text Generation • 5B • Updated • 1
MT Quality Estimation
Models for reference-free quality estimation of machine translation
-
ymoslem/ModernBERT-base-long-context-qe-v1
Text Classification • 0.1B • Updated • 6 • 5 -
ymoslem/ModernBERT-large-qe-v1
Text Classification • 0.4B • Updated • 1 • 2 -
ymoslem/xlm-roberta-large-qe-v1
Text Classification • 0.6B • Updated • 2 • 1 -
ymoslem/ModernBERT-large-qe-maxlen512-v1
Text Classification • 0.4B • Updated • 1 • 1
models 69
ymoslem/wmt25-eng-arz-16layers-2e-5lr-news-commentary
Text Generation • 5B • Updated
• 1
ymoslem/wmt25-eng-arz-20layers-2e-5lr-news-commentary
Text Generation • 5B • Updated
• 1
ymoslem/wmt25-eng-arz-24layers-2e-5lr-news-commentary
Text Generation • 6B • Updated
• 1
ymoslem/aya-expanse-8b-eng-arz-16layers
Text Generation • 5B • Updated
• 2
ymoslem/aya-expanse-8b-eng-arz-20layers
Text Generation • 5B • Updated
• 5
ymoslem/aya-expanse-8b-eng-arz-24layers
Text Generation • 6B • Updated
• 1
ymoslem/aya-expanse-8b-20layers-cs-de-iter
Text Generation • 5B • Updated
• 4
ymoslem/wmt25-ces-deu-16layers-2e-5lr-news-commentary
Text Generation • 5B • Updated
• 1
ymoslem/wmt25-ces-deu-20layers-2e-5lr-news-commentary
Text Generation • 5B • Updated
• 2
ymoslem/wmt25-ces-deu-24layers-2e-5lr-news-commentary
Text Generation • 6B • Updated
• 1
datasets 41
ymoslem/AIME-clustered
Viewer
• Updated
• 951 • 26
ymoslem/TeleQnA-clustered-2
Viewer
• Updated
• 10k • 13
ymoslem/news-commentary-eng-arz
Viewer
• Updated
• 83.7k • 72
ymoslem/flores-test-pruning
Viewer
• Updated
• 1.1k • 16
ymoslem/TeleQnA-processed
Viewer
• Updated
• 10k • 24
ymoslem/Anhui-Telecom-QA
Viewer
• Updated
• 157k • 7 • 2
ymoslem/TeleQnA-clustered-3
Viewer
• Updated
• 10k • 12
ymoslem/Law-StackExchange
Viewer
• Updated
• 24.4k • 287 • 31
ymoslem/IWSLT2025-Test
Viewer
• Updated
• 772 • 23
ymoslem/news-commentary-en-ar
Viewer
• Updated
• 84.3k • 9 • 1