talkie-1930-13b-it

talkie-1930-13b-it is a 13B vintage language model. It is an instruction-tuned post-train of talkie-1930-13b-base, which was trained on 260B tokens of pre-1931 English-language text.

talkie-1930-13b-it was finetuned using a novel dataset of instruction-response pairs extracted from pre-1931 reference works, including etiquette manuals, encyclopedias, and letter-writing manuals. The model then underwent reinforcement learning (online DPO with an LLM-as-a-judge) to improve instruction-following ability.

Read more about talkie in our report.

Reference code to run talkie is available on GitHub.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ 10 Ask for provider support

Model tree for talkie-lm/talkie-1930-13b-it

Finetuned
(1)
this model
Finetunes
2 models

Spaces using talkie-lm/talkie-1930-13b-it 3

Collection including talkie-lm/talkie-1930-13b-it