A collection of items telated the the MMTEB release
AI & ML interests
Massive Text Embeddings Benchmark
Recent Activity
Papers
MAEB: Massive Audio Embedding Benchmark
HUME: Measuring the Human-Model Performance Gap in Text Embedding Task
Organization Card
MTEB is a Python framework for evaluating embeddings and retrieval systems for both text and image. MTEB covers more than 1000 languages and diverse tasks, from classics like classification and clustering to use-case specialized tasks such as legal, code, or healthcare retrieval.
You can get started using mteb, check out our documentation.
| Overview | |
|---|---|
| π Leaderboard | The interactive leaderboard of the benchmark |
| Get Started. | |
| π Get Started | Overview of how to use mteb |
| π€ Defining Models | How to use existing model and define custom ones |
| π Selecting tasks | How to select tasks, benchmarks, splits etc. |
| π Running Evaluation | How to run the evaluations, including cache management, speeding up evaluations etc. |
| π Loading Results | How to load and work with existing model results |
| Overview. | |
| π Tasks | Overview of available tasks |
| π Benchmarks | Overview of available benchmarks |
| π€ Models | Overview of available Models |
| Contributing | |
| π€ Adding a model | How to submit a model to MTEB and to the leaderboard |
| π©βπ» Adding a dataset | How to add a new task/dataset to MTEB |
| π©βπ» Adding a benchmark | How to add a new benchmark to MTEB and to the leaderboard |
| π€ Contributing | How to contribute to MTEB and set it up for development |
spaces 5
pinned
Running on CPU Upgrade
7.09k
MTEB Leaderboard
π₯
Embedding Leaderboard
Running
37
MTEB Legacy Leaderboard
π₯
Explore and filter MTEB model benchmark results
Running
Featured
11
Leaderboard Dev
π’
Dedicated display for RTEB benchmark results
Running
116
MTEB Arena
β
Display MTEB Arena interface
datasets 1,541
mteb/results
Updated
β’ 238k β’ 1
mteb/Vidore3EnergyOCRRetrieval
Updated
mteb/Vidore3PhysicsOCRRetrieval
Viewer
β’ Updated
β’ 90.3k
mteb/Vidore3FinanceFrOCRRetrieval
Preview
β’ Updated
mteb/Vidore3HrOCRRetrieval
Viewer
β’ Updated
β’ 70.9k
mteb/Vidore3PharmaceuticalsOCRRetrieval
Viewer
β’ Updated
β’ 76.1k
mteb/Vidore3ComputerScienceOCRRetrieval
Viewer
β’ Updated
β’ 47.2k
mteb/Vidore3IndustrialOCRRetrieval
Viewer
β’ Updated
β’ 91.3k
mteb/Vidore3FinanceEnOCRRetrieval
Viewer
β’ Updated
β’ 72.1k
mteb/llm-eval-big_patent_clustering
Viewer
β’ Updated
β’ 3 β’ 18