AI & ML interests

LLMs

Recent Activity

mihainadasย  updated a dataset 21 days ago
klusai/diacritics-ro
mihainadasย  updated a model 21 days ago
klusai/diacritics-llmic-v2-lora
mihainadasย  published a model 21 days ago
klusai/diacritics-llmic-v2-lora
View all activity

Organization Card

KlusAI
Where AI research meets real-world impact

Website GitHub X Research


๐Ÿ” What We're About

KlusAI bridges the gap between cutting-edge AI research and production systems. We publish our datasets and models openly to advance the field โ€” 9M+ synthetic training examples and counting.

Research Themes:

  • ๐Ÿงฌ Synthetic Data Generation โ€” Large-scale training data without privacy concerns
  • โšก Efficient AI Systems โ€” Models that run on consumer hardware
  • ๐ŸŒ Multilingual NLP โ€” With deep Romanian language expertise

๐Ÿ“„ Featured Publication

Synthetic Data Generation Using Large Language Models

Advances in Text and Code โ€” IEEE Access, 2025

Our comprehensive survey on generating training data using LLMs. How enterprises can generate training data at scale โ€” reducing annotation costs, addressing data scarcity, and enabling fine-tuning without exposing sensitive data.

๐Ÿ“– Read on IEEE Xplore ยท ๐Ÿ“ arXiv Preprint


๐Ÿ”ฌ Flagship Project: TinyFabulist

TinyFabulist is our open research programme on large-scale synthetic narrative generation. We demonstrate that small, efficient models can produce high-quality training data at scale.

Release Description Size
TinyFabulist v1 Synthetic English Fables ~3M examples
Upcoming Multilingual extensions, evaluation benchmarks โ€”

Key principles:

  • ๐Ÿ“Š Scale โ€” 9M+ synthetic training examples generated
  • ๐Ÿ”ง Efficiency โ€” All content produced with โ‰ค8B parameter models
  • ๐Ÿ”“ Openness โ€” Generation scripts, pipelines, and methodology shared publicly

๐Ÿ“„ Paper (arXiv) ยท ๐Ÿ’ป Code (GitHub)


๐Ÿ“ฆ What You'll Find Here

  • Datasets โ€” Large-scale synthetic training corpora for fine-tuning and research
  • Models โ€” Efficient, instruction-tuned models optimized for specific tasks
  • Evaluation โ€” Benchmarks and tooling for synthetic data quality assessment

๐Ÿค Work With Us

Beyond open research, we offer enterprise AI services:

Service Description
AI Strategy Define your AI roadmap and implementation plan
Custom Development Bespoke AI solutions tailored to your domain
Model Training Fine-tuning and deploying models for your use case
MLOps & Infrastructure Scalable pipelines and production deployment

Need custom synthetic data or domain-specific models? We partner with organizations on applied research challenges.


๐Ÿ“ซ Get in Touch

Purpose Contact
Research collaboration research@klusai.com
Enterprise services services@klusai.com
General inquiries hello@klusai.com

Technical questions? Open an issue on the relevant dataset or model repository.


Applied Research ยท AI Services ยท Ventures
klusai.com ยท GitHub ยท X