HAQ NAWAZ MALIK's picture

Building on HF

HAQ NAWAZ MALIK

Omarrran

·

https://haq-nawaz-malik.github.io/

AI & ML interests

None yet

Recent Activity

published a model about 24 hours ago

Omarrran/IITD_MISN_Hiring_Assignment_Haq_Nawaz

updated a model 1 day ago

Omarrran/IITD_MISN_Hiring_Assignment_Haq_Nawaz

updated a model 17 days ago

Omarrran/Kashmiri_Unigram_Tokenizer

View all activity

Organizations

authored a paper about 1 month ago

ks-pret-5m: a 5 million word, 12 million token kashmiri pretraining dataset

Paper • 2604.11066 • Published Apr 13

authored a paper 4 months ago

synthocr-gen: A synthetic ocr dataset generator for low-resource languages- breaking the data barrier

Paper • 2601.16113 • Published Jan 22

authored 2 papers 5 months ago

ks-lit-3m: A 3.1 million word kashmiri text dataset for large language model pretraining

Paper • 2601.01091 • Published Jan 3

600k-ks-ocr: a large-scale synthetic dataset for optical character recognition in kashmiri script

Paper • 2601.01088 • Published Jan 3