opendatalab/MinerU2.5-Pro-2604-1.2B
Image-Text-to-Text • 1B • Updated • 43.5k • 101
OpenDataLab provides high-quality open datasets and tools for large models. China Large model corpus Data Alliance open source data service designated platform
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale