talkie-1930-13b-it Transformers Port

This repository packages talkie-lm/talkie-1930-13b-it for Hugging Face Transformers with trust_remote_code=True.

The model is a 13B decoder-only vintage language model trained on pre-1931 English-language text and instruction-tuned with pre-1931 reference-work instruction data, followed by online DPO reinforcement learning.

Usage

Install the runtime dependencies first:

pip install "transformers>=5.6.0" "accelerate>=1.0.0" "safetensors>=0.5.0" "tiktoken>=0.6.0"
from transformers import AutoModelForCausalLM, AutoTokenizer

repo = "Abstract4700/talkie-1930-13b-it-hf-transformers"

tok = AutoTokenizer.from_pretrained(repo, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    repo,
    trust_remote_code=True,
    dtype="bfloat16",
    device_map="auto",
)

messages = [
    {"role": "user", "content": "Write an essay predicting what life will be like in the year 1960."}
]
inputs = tok.apply_chat_template(messages, return_tensors="pt", add_generation_prompt=True).to(model.device)
out = model.generate(inputs, max_new_tokens=300, do_sample=True, temperature=0.7)
print(tok.decode(out[0][inputs.shape[1]:], skip_special_tokens=True))

Notes

  • The tokenizer is an exact tiktoken wrapper around the original vocab.txt.
  • The chat template is native to AutoTokenizer.apply_chat_template.
  • The model implements standard Transformers causal LM generation with KV caching.
  • The BF16 CUDA forward path mirrors the reference Talkie inference wrapper's autocast behavior internally.
  • generation_config.json disables top-k filtering by default to match the reference Talkie sampler defaults.
  • The checkpoint is stored as sharded safetensors.
  • Numerical parity against the reference Talkie implementation on a 195-token passage: max absolute logit difference 0.0, mean absolute logit difference 0.0.
Downloads last month
598
Safetensors
Model size
13B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Abstract4700/talkie-1930-13b-it-hf-transformers

Finetuned
(6)
this model
Finetunes
1 model