Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deqing 's Collections
Fourier Language Model
Convergent Evolution
Convergent Evolution (Addition)
Convergent Evolution (Architecture and Optimizer)
Convergent Evolution (Data)

Convergent Evolution (Data)

updated 16 days ago
Upvote
-

  • deqing/convergent-llama-300M-muon-original

    Text Generation • 0.3B • Updated 28 days ago • 858

  • deqing/convergent-llama-300M-muon-unigram

    Text Generation • 0.3B • Updated 28 days ago • 303

  • deqing/convergent-llama-300M-muon-isolate-1

    Text Generation • 0.3B • Updated 26 days ago • 1.48k

  • deqing/convergent-llama-300M-muon-swap_numbers

    Text Generation • 0.3B • Updated 28 days ago • 330

  • deqing/convergent-llama-300M-muon-isolate-2

    Text Generation • 0.3B • Updated 25 days ago • 1.3k

  • deqing/convergent-llama-300M-muon-isolate-8

    Text Generation • 0.3B • Updated 25 days ago • 2.26k • 1

  • deqing/convergent-llama-300M-muon-window-2

    Text Generation • 0.3B • Updated 25 days ago • 7.47k

  • deqing/convergent-llama-300M-muon-window-4

    Text Generation • 0.3B • Updated 26 days ago • 4.62k

  • deqing/convergent-llama-300M-muon-window-8

    Text Generation • 0.3B • Updated 26 days ago • 3.69k

  • deqing/convergent-llama-300M-muon-window-64

    Text Generation • 0.3B • Updated 25 days ago • 1.27k • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs