AI & ML interests
LLMs
Recent Activity
KlusAI
Where AI research meets real-world impact
๐ What We're About
KlusAI bridges the gap between cutting-edge AI research and production systems. We publish our datasets and models openly to advance the field โ 9M+ synthetic training examples and counting.
Research Themes:
- ๐งฌ Synthetic Data Generation โ Large-scale training data without privacy concerns
- โก Efficient AI Systems โ Models that run on consumer hardware
- ๐ Multilingual NLP โ With deep Romanian language expertise
๐ Featured Publication
Synthetic Data Generation Using Large Language Models
Advances in Text and Code โ IEEE Access, 2025
Our comprehensive survey on generating training data using LLMs. How enterprises can generate training data at scale โ reducing annotation costs, addressing data scarcity, and enabling fine-tuning without exposing sensitive data.
๐ Read on IEEE Xplore ยท ๐ arXiv Preprint
๐ฌ Flagship Project: TinyFabulist
TinyFabulist is our open research programme on large-scale synthetic narrative generation. We demonstrate that small, efficient models can produce high-quality training data at scale.
| Release | Description | Size |
|---|---|---|
| TinyFabulist v1 | Synthetic English Fables | ~3M examples |
| Upcoming | Multilingual extensions, evaluation benchmarks | โ |
Key principles:
- ๐ Scale โ 9M+ synthetic training examples generated
- ๐ง Efficiency โ All content produced with โค8B parameter models
- ๐ Openness โ Generation scripts, pipelines, and methodology shared publicly
๐ Paper (arXiv) ยท ๐ป Code (GitHub)
๐ฆ What You'll Find Here
- Datasets โ Large-scale synthetic training corpora for fine-tuning and research
- Models โ Efficient, instruction-tuned models optimized for specific tasks
- Evaluation โ Benchmarks and tooling for synthetic data quality assessment
๐ค Work With Us
Beyond open research, we offer enterprise AI services:
| Service | Description |
|---|---|
| AI Strategy | Define your AI roadmap and implementation plan |
| Custom Development | Bespoke AI solutions tailored to your domain |
| Model Training | Fine-tuning and deploying models for your use case |
| MLOps & Infrastructure | Scalable pipelines and production deployment |
Need custom synthetic data or domain-specific models? We partner with organizations on applied research challenges.
๐ซ Get in Touch
| Purpose | Contact |
|---|---|
| Research collaboration | research@klusai.com |
| Enterprise services | services@klusai.com |
| General inquiries | hello@klusai.com |
Technical questions? Open an issue on the relevant dataset or model repository.
Applied Research ยท AI Services ยท Ventures
klusai.com ยท GitHub ยท X