Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
CompactAI-O
/
Shard-1
like
6
Follow
CompactAI
41
PyTorch
English
small-lm
gemma4-attention
muon
swiglu
experimental
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
Shard-1
/
models
336 MB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Crownelius
Initial release: Shard-40m-v1 (54.5M dense transformer, anneal final)
025878f
verified
1 day ago
model.pt
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
109 MB
xet
Initial release: Shard-40m-v1 (54.5M dense transformer, anneal final)
1 day ago
pretrain.pt
pickle
Detected Pickle imports (4)
"torch.FloatStorage"
,
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
227 MB
xet
Initial release: Shard-40m-v1 (54.5M dense transformer, anneal final)
1 day ago
tokenizer.json
550 kB
Initial release: Shard-40m-v1 (54.5M dense transformer, anneal final)
1 day ago