Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length Paper • 2404.08801 • Published Apr 12, 2024 • 66
Branch-Solve-Merge Improves Large Language Model Evaluation and Generation Paper • 2310.15123 • Published Oct 23, 2023 • 8
The Turking Test: Can Language Models Understand Instructions? Paper • 2010.11982 • Published Oct 22, 2020
RoBERTa: A Robustly Optimized BERT Pretraining Approach Paper • 1907.11692 • Published Jul 26, 2019 • 9
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Paper • 2206.04615 • Published Jun 9, 2022 • 5
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation Paper • 2305.01569 • Published May 2, 2023 • 2