Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections trending this week

Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers

Paper • 2311.10642 • Published Nov 17, 2023 • 25

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Paper • 2311.10702 • Published Nov 17, 2023 • 19

text-generation

HuggingFaceH4/zephyr-7b-beta

Text Generation • 7B • Updated Oct 16, 2024 • 119k • • 1.84k

"Abracadabra NYC Coupon Code for Enchanting Deals"

Welcome to Abracadabra NYC, a land where magic and savings mix! <a href="https://www.coupontive.com/view/abracadabran"> abracadabra nyc coupon cod</a>

TheBloke/Tess-M-Creative-v1.0-GGUF

34B • Updated Nov 19, 2023 • 132 • 9
garage-bAInd/Platypus-30B

Text Generation • 33B • Updated Jan 3, 2024 • 957 • 17
MayaPH/GodziLLa2-70B

Text Generation • Updated Jan 12, 2024 • 938 • 38
QuixiAI/Samantha-1.11-70b

Text Generation • Updated May 20, 2024 • 121 • 67

Memory Augmented Language Models through Mixture of Word Experts

Paper • 2311.10768 • Published Nov 15, 2023 • 19
GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 246

Build error

Featured

342

Yi-34B-Chat

🔥

342

System 2 Attention (is something you might need too)

Paper • 2311.11829 • Published Nov 20, 2023 • 43
Transformers are Multi-State RNNs

Paper • 2401.06104 • Published Jan 11, 2024 • 39
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 627

Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers

Paper • 2311.10642 • Published Nov 17, 2023 • 25

TheBloke/Tess-M-Creative-v1.0-GGUF

34B • Updated Nov 19, 2023 • 132 • 9
garage-bAInd/Platypus-30B

Text Generation • 33B • Updated Jan 3, 2024 • 957 • 17
MayaPH/GodziLLa2-70B

Text Generation • Updated Jan 12, 2024 • 938 • 38
QuixiAI/Samantha-1.11-70b

Text Generation • Updated May 20, 2024 • 121 • 67

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Paper • 2311.10702 • Published Nov 17, 2023 • 19

Memory Augmented Language Models through Mixture of Word Experts

Paper • 2311.10768 • Published Nov 15, 2023 • 19
GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 246

text-generation

HuggingFaceH4/zephyr-7b-beta

Text Generation • 7B • Updated Oct 16, 2024 • 119k • • 1.84k

Build error

Featured

342

Yi-34B-Chat

🔥

342

"Abracadabra NYC Coupon Code for Enchanting Deals"

Welcome to Abracadabra NYC, a land where magic and savings mix! <a href="https://www.coupontive.com/view/abracadabran"> abracadabra nyc coupon cod</a>

System 2 Attention (is something you might need too)

Paper • 2311.11829 • Published Nov 20, 2023 • 43
Transformers are Multi-State RNNs

Paper • 2401.06104 • Published Jan 11, 2024 • 39
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 627

Previous
1
...
17,884
17,885
17,886
17,887
17,888
...
18,806
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs