Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deqing 's Collections
Fourier Language Model
Convergent Evolution
Convergent Evolution (Addition)
Convergent Evolution (Architecture and Optimizer)
Convergent Evolution (Data)

Convergent Evolution (Data)

updated 1 day ago
Upvote
-

  • deqing/convergent-llama-300M-muon-original

    Text Generation • 0.3B • Updated 13 days ago • 789

  • deqing/convergent-llama-300M-muon-unigram

    Text Generation • 0.3B • Updated 13 days ago • 265

  • deqing/convergent-llama-300M-muon-isolate-1

    Text Generation • 0.3B • Updated 11 days ago • 7.26k

  • deqing/convergent-llama-300M-muon-swap_numbers

    Text Generation • 0.3B • Updated 13 days ago • 299

  • deqing/convergent-llama-300M-muon-isolate-2

    Text Generation • 0.3B • Updated 10 days ago • 1.23k

  • deqing/convergent-llama-300M-muon-isolate-8

    Text Generation • 0.3B • Updated 10 days ago • 2.18k • 1

  • deqing/convergent-llama-300M-muon-window-2

    Text Generation • 0.3B • Updated 11 days ago • 7.53k

  • deqing/convergent-llama-300M-muon-window-4

    Text Generation • 0.3B • Updated 12 days ago • 8.52k

  • deqing/convergent-llama-300M-muon-window-8

    Text Generation • 0.3B • Updated 11 days ago • 3.76k

  • deqing/convergent-llama-300M-muon-window-64

    Text Generation • 0.3B • Updated 11 days ago • 1.21k • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs