Daily papers Collection by cb160 Nov 21, 2023 - Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers Paper • 2311.10642 • Published Nov 17, 2023 • 25
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers Paper • 2311.10642 • Published Nov 17, 2023 • 25
LLM_Alignment Collection by whr94621 Nov 21, 2023 - Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2 Paper • 2311.10702 • Published Nov 17, 2023 • 19
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2 Paper • 2311.10702 • Published Nov 17, 2023 • 19
text-generation Collection by douglarek Nov 21, 2023 - HuggingFaceH4/zephyr-7b-beta Text Generation • 7B • Updated Oct 16, 2024 • 119k • • 1.84k
"Abracadabra NYC Coupon Code for Enchanting Deals" Welcome to Abracadabra NYC, a land where magic and savings mix! <a href="https://www.coupontive.com/view/abracadabran"> abracadabra nyc coupon cod</a> Collection by johnwilson456 Nov 21, 2023 1
To Test Collection by ninjaman12 Nov 26, 2023 - TheBloke/Tess-M-Creative-v1.0-GGUF 34B • Updated Nov 19, 2023 • 132 • 9 garage-bAInd/Platypus-30B Text Generation • 33B • Updated Jan 3, 2024 • 957 • 17 MayaPH/GodziLLa2-70B Text Generation • Updated Jan 12, 2024 • 938 • 38 QuixiAI/Samantha-1.11-70b Text Generation • Updated May 20, 2024 • 121 • 67
LLM_Eval Collection by whr94621 Nov 23, 2023 - Memory Augmented Language Models through Mixture of Word Experts Paper • 2311.10768 • Published Nov 15, 2023 • 19 GAIA: a benchmark for General AI Assistants Paper • 2311.12983 • Published Nov 21, 2023 • 246
Memory Augmented Language Models through Mixture of Word Experts Paper • 2311.10768 • Published Nov 15, 2023 • 19
Attention Collection by amenur Mar 1, 2024 - System 2 Attention (is something you might need too) Paper • 2311.11829 • Published Nov 20, 2023 • 43 Transformers are Multi-State RNNs Paper • 2401.06104 • Published Jan 11, 2024 • 39 The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 627
System 2 Attention (is something you might need too) Paper • 2311.11829 • Published Nov 20, 2023 • 43
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 627
Daily papers Collection by cb160 Nov 21, 2023 - Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers Paper • 2311.10642 • Published Nov 17, 2023 • 25
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers Paper • 2311.10642 • Published Nov 17, 2023 • 25
To Test Collection by ninjaman12 Nov 26, 2023 - TheBloke/Tess-M-Creative-v1.0-GGUF 34B • Updated Nov 19, 2023 • 132 • 9 garage-bAInd/Platypus-30B Text Generation • 33B • Updated Jan 3, 2024 • 957 • 17 MayaPH/GodziLLa2-70B Text Generation • Updated Jan 12, 2024 • 938 • 38 QuixiAI/Samantha-1.11-70b Text Generation • Updated May 20, 2024 • 121 • 67
LLM_Alignment Collection by whr94621 Nov 21, 2023 - Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2 Paper • 2311.10702 • Published Nov 17, 2023 • 19
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2 Paper • 2311.10702 • Published Nov 17, 2023 • 19
LLM_Eval Collection by whr94621 Nov 23, 2023 - Memory Augmented Language Models through Mixture of Word Experts Paper • 2311.10768 • Published Nov 15, 2023 • 19 GAIA: a benchmark for General AI Assistants Paper • 2311.12983 • Published Nov 21, 2023 • 246
Memory Augmented Language Models through Mixture of Word Experts Paper • 2311.10768 • Published Nov 15, 2023 • 19
text-generation Collection by douglarek Nov 21, 2023 - HuggingFaceH4/zephyr-7b-beta Text Generation • 7B • Updated Oct 16, 2024 • 119k • • 1.84k
"Abracadabra NYC Coupon Code for Enchanting Deals" Welcome to Abracadabra NYC, a land where magic and savings mix! <a href="https://www.coupontive.com/view/abracadabran"> abracadabra nyc coupon cod</a> Collection by johnwilson456 Nov 21, 2023 1
Attention Collection by amenur Mar 1, 2024 - System 2 Attention (is something you might need too) Paper • 2311.11829 • Published Nov 20, 2023 • 43 Transformers are Multi-State RNNs Paper • 2401.06104 • Published Jan 11, 2024 • 39 The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 627
System 2 Attention (is something you might need too) Paper • 2311.11829 • Published Nov 20, 2023 • 43
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 627