Portuguese LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: • 17 items • Updated 22 minutes ago • 43
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 64
Zephyr 7B Collection Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 8 items • Updated 2 days ago • 152