-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 5.79k • 55 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 384k • 115 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.24M • • 521 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 449k • 199
Aleksei Dorkin PRO
adorkin
AI & ML interests
Computational Linguistics
Recent Activity
liked
a dataset
about 20 hours ago
openmed-community/MedReason-Stenographic
new activity
1 day ago
tartuNLP/finepdfs-et:[bot] Conversion to Parquet
updated
a dataset
1 day ago
tartuNLP/finepdfs-et
Organizations
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • 12B • Updated • 5.79k • 55 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 384k • 115 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.24M • • 521 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • 4B • Updated • 449k • 199
Code RL Datasets
spaces
6
Sleeping
1
NLI Zero Shot Classification
🔍
Zero-shot classification based on natural language inference
Sleeping
2
GliLem
🤓
Lemmatization disambiguation for Estonian with GliNER
Running
SigLIP2 + Clothes
🤔
Text-to-image clothing search using SigLIP2
Sleeping
1
M-CLIP + Clothes
🦀
Text-to-image clothing search using multilingual CLIP
Sleeping
1
Tweet Emoji Predictor
🧐
Predict an emoji for your tweet (...your X?)
Sleeping
Sõnajaht Demo
🐠
Keeltevaheline pöördsõnastik
datasets
15
adorkin/tulu-3-sft-mixture
Viewer
•
Updated
•
939k
•
2
adorkin/extended_tweet_emojis
Viewer
•
Updated
•
52.7k
•
65
•
3
adorkin/cosmopedia-v2-translate-append-instructions-et
Viewer
•
Updated
•
6.85k
•
19
adorkin/flan-v2-converted-en
Viewer
•
Updated
•
58.2k
•
11
adorkin/mala-bilingual-et-en-scores
Viewer
•
Updated
•
50.9M
•
51
adorkin/dclm-sample-13k-en-et-translation
Viewer
•
Updated
•
13.7k
•
11
adorkin/nllb-et-en-scores
Viewer
•
Updated
•
22M
•
21
adorkin/Magpie-Llama-3.1-Pro-300K-Filtered-18K-sample-et
Viewer
•
Updated
•
36.6k
•
21
•
1
adorkin/general-instruction-augmented-corpora
Viewer
•
Updated
•
20M
•
279
•
1
adorkin/dbpedia-entity-est
Viewer
•
Updated
•
4.69M
•
26