Hkang/summarize_sft-test_lm-pythia1b-oai-summary-PPO-0KL-1ep-base_seed-42_numex-250 Viewer • Updated Apr 23, 2025 • 250 • 47
Hkang/summarize_sft-test_lm-pythia1b-oai-summary-PPO-0KL-1ep-final_seed-42_numex-250 Viewer • Updated Apr 23, 2025 • 250 • 21
Hkang/summarize_sft-test_lm-pythia1b-oai-summary-PPO-1ep-final_seed-42_numex-250_PPO-with-bt-sft_64 Viewer • Updated Apr 22, 2025 • 250 • 64
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_PPO-sft_64 Viewer • Updated Apr 22, 2025 • 250 • 85
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_6KBON_only-BON_64 Viewer • Updated Apr 21, 2025 • 250 • 28
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_4KBON_only-BON_64 Viewer • Updated Apr 20, 2025 • 250 • 26
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_3KBON_only-BON_64 Viewer • Updated Apr 20, 2025 • 250 • 10
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_1KBON_only-BON_64 Viewer • Updated Apr 20, 2025 • 250 • 66
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_9Kreward_only-BON_64 Viewer • Updated Apr 20, 2025 • 250 • 44
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_8Kreward_only-BON_64 Viewer • Updated Apr 20, 2025 • 250 • 8
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_6Kreward_only-BON_64 Viewer • Updated Apr 20, 2025 • 250 • 6
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_4Kreward_only-BON_64 Viewer • Updated Apr 20, 2025 • 250 • 2
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_3Kreward_only-BON_64 Viewer • Updated Apr 20, 2025 • 250 • 61
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_1Kreward_only-BON_64 Viewer • Updated Apr 20, 2025 • 250 • 64
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_12Knew-BON_64 Viewer • Updated Apr 20, 2025 • 250 • 8
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_11Knew-BON_64 Viewer • Updated Apr 20, 2025 • 250 • 100
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_9Knew-BON_64 Viewer • Updated Apr 19, 2025 • 250 • 5
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_8Knew-BON_64 Viewer • Updated Apr 19, 2025 • 250 • 66
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_6Knew-BON_64 Viewer • Updated Apr 19, 2025 • 250 • 57
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_4Knew-BON_64 Viewer • Updated Apr 18, 2025 • 250 • 7
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_3Knew-BON_64 Viewer • Updated Apr 18, 2025 • 250 • 67
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_1Knew-BON_64 Viewer • Updated Apr 18, 2025 • 250 • 10
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_16K-BON_64 Viewer • Updated Apr 17, 2025 • 250 • 8
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_14K-BON_64 Viewer • Updated Apr 17, 2025 • 250 • 45
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_12K-BON_64 Viewer • Updated Apr 17, 2025 • 250 • 6
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_11K-BON_64 Viewer • Updated Apr 17, 2025 • 250 • 10
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_9K-BON_64 Viewer • Updated Apr 16, 2025 • 250 • 63
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_8K-BON_64 Viewer • Updated Apr 16, 2025 • 250 • 64
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_6K-BON_64 Viewer • Updated Apr 16, 2025 • 250 • 30
Hkang/summarize_sft-test_lm-EleutherAI_pythia-1b_seed-42_numex-250_lr3e8_4K-BON_64 Viewer • Updated Apr 15, 2025 • 250 • 6