Shirley Wu's picture

15

Shirley Wu

shirwu

·

https://cs.stanford.edu/~shirwu/

AI & ML interests

None yet

Organizations

shirwu 's models 44

shirwu/official-hotpotqa-hotpotqa_four_agents_pipeline-hint_generator-iter0

Updated Mar 6, 2025

shirwu/official-hotpotqa-hotpotqa_four_agents_pipeline-answer_generator-iter0

Updated Mar 6, 2025

shirwu/debug_state_dict

Updated Feb 17, 2025

shirwu/trainsize200_iter3_rerun-hotpotqa-hotpotqa_two_agents_pipeline-answer_generator-iter0

Updated Feb 16, 2025

shirwu/trainsize200_iter3-hotpotqa-hotpotqa_two_agents_pipeline-answer_generator-iter0

Updated Feb 16, 2025

shirwu/debug

Text Classification • 8B • Updated Feb 16, 2025 • 8

shirwu/default

Updated Feb 16, 2025

shirwu/iter_debug-hotpotqa-hotpotqa_two_agents_pipeline-answer_generator-iter0

Updated Feb 16, 2025

shirwu/hotpotqa_two_agents_pipeline-hint_generator-iter2

Updated Feb 16, 2025

shirwu/hotpotqa_two_agents_pipeline-answer_generator-iter2

Updated Feb 16, 2025

shirwu/hotpotqa_two_agents_pipeline-hint_generator-iter1

Updated Feb 16, 2025

shirwu/hotpotqa_two_agents_pipeline-answer_generator-iter1

Updated Feb 16, 2025

shirwu/hotpotqa_two_agents_pipeline-hint_generator-iter0

Updated Feb 16, 2025

shirwu/hotpotqa_two_agents_pipeline-answer_generator-iter0

Updated Feb 16, 2025

shirwu/reward_model_train_final

Updated Feb 16, 2025

shirwu/reward_model_train_debug

1B • Updated Feb 16, 2025 • 7

shirwu/output

Text Classification • 8B • Updated Feb 16, 2025 • 3

shirwu/test_save_load2

Text Classification • 1B • Updated Feb 16, 2025 • 3

shirwu/test_save_load

Text Classification • 1B • Updated Feb 16, 2025 • 16

shirwu/rm_final_Llama-3.1-1B-Instruct

Text Classification • 1B • Updated Feb 15, 2025 • 5

shirwu/rm_final_Llama-3.1-8B-Instruct

Text Classification • 8B • Updated Feb 15, 2025 • 7

shirwu/iter_debug

Text Classification • 8B • Updated Feb 14, 2025 • 6

shirwu/rm_Llama-3.1-8B-Instruct

Updated Feb 14, 2025

shirwu/rm__freezelast_oldtemplatequantLlama-3.1-8B-Instruct

Updated Feb 14, 2025

shirwu/rmlr-2e-6freezelast_oldtemplatequantLlama-3.1-8B-Instruct

Updated Feb 14, 2025

shirwu/rmlr-1e-5freezelast_oldtemplatequantLlama-3.1-8B-Instruct

Updated Feb 14, 2025

shirwu/rm_freezelast_oldtemplate_quant_Llama-3.1-8B-Instruct

Updated Feb 14, 2025

shirwu/rm_freeze-last_quant_Llama-3.1-8B-Instruct

Updated Feb 14, 2025

shirwu/rm_freeze-last_quant_Skywork-Reward-Llama-3.1-8B-v0.2

Updated Feb 14, 2025

shirwu/rm_unfreeze_last_Llama-3.1-8B-Instruct

Updated Feb 14, 2025