talkie-1930-13b-it

talkie-1930-13b-it is a 13B vintage language model. It is an instruction-tuned post-train of talkie-1930-13b-base, which was trained on 260B tokens of pre-1931 English-language text.

talkie-1930-13b-it was finetuned using a novel dataset of instruction-response pairs extracted from pre-1931 reference works, including etiquette manuals, encyclopedias, and letter-writing manuals. The model then underwent reinforcement learning (online DPO with an LLM-as-a-judge) to improve instruction-following ability.

Model tree for talkie-lm/talkie-1930-13b-it

Base model

talkie-lm/talkie-1930-13b-base

Finetuned

(1)

this model

Finetunes

2 models

Spaces using talkie-lm/talkie-1930-13b-it 3

Collection including talkie-lm/talkie-1930-13b-it

talkie-13b

Collection

talkie-1930-13b is a vintage language model trained on pre-1931 English-language text. See https://github.com/talkie-lm/talkie to run talkie. • 3 items • Updated 7 days ago • 19