·
AI & ML interests
None yet
Organizations
None yet
JayHyeon/Qwen_0.5-cDPO_5e-7_1.0vpo_constant_0.1label_smoothing
Text Generation
• 0.6B • Updated
• 1
JayHyeon/Qwen_0.5-DPO_5e-7_1.0vpo_constant
Text Generation
• 0.6B • Updated
• 4
JayHyeon/Qwen_3B-DPO_5e-7_1.0vpo_constant-1ep
Updated
JayHyeon/qwen_0.5-DPO_5e-7_1.0vpo_constant-1ep
Updated
JayHyeon/Llama-DPO_5e-7_1.0vpo_constant-1ep
Updated
JayHyeon/Qwen_0.5-DPO_3e-6_1.0vpo_constant-1ep
Updated
JayHyeon/pythia-2.8b-rDPO_5e-7_1.0vpo_constant-1ep_0.3label_smoothing
Text Generation
• 3B • Updated
JayHyeon/pythia-2.8b-cDPO_5e-7_1.0vpo_constant-1ep_0.3label_smoothing
Text Generation
• 3B • Updated
JayHyeon/pythia-2.8b-rDPO_5e-7_1.0vpo_constant-1ep_0.1label_smoothing
Text Generation
• 3B • Updated
• 1
JayHyeon/pythia-2.8b-cDPO_5e-7_1.0vpo_constant-1ep_0.1label_smoothing
Text Generation
• 3B • Updated
• 2
JayHyeon/pythia-2.8b-2e-5-1ep
Text Generation
• 3B • Updated
• 1
JayHyeon/Qwen_1.5B-math-rDPO_5e-7_0.3lsmooth-1.0vpo_constant-1ep
Text Generation
• 2B • Updated
• 2
JayHyeon/Qwen_1.5B-math-rDPO_5e-7_0.1lsmooth-1.0vpo_constant-1ep
Text Generation
• 2B • Updated
JayHyeon/Qwen_1.5B-math-cDPO_5e-7_0.3lsmooth-1.0vpo_constant-1ep
Text Generation
• 2B • Updated
JayHyeon/Qwen_1.5B-math-cDPO_5e-7_0.1lsmooth-1.0vpo_constant-1ep
Text Generation
• 2B • Updated
JayHyeon/Qwen_1.5B-math-cDPO_5e-7_1.0vpo_constant-1ep
Updated
JayHyeon/Qwen_0.5-ultrainteract_SPPO_5e-7-1ep
Text Generation
• 0.6B • Updated
JayHyeon/Qwen_0.5-ultrainteract_SLiC_5e-7-1ep
Text Generation
• 0.6B • Updated
JayHyeon/llama-DPOP_3e-6-1ep_0alp_0.5bdpo_lam_5dpop_lam
Updated
JayHyeon/llama-IRPO_3e-6-1ep_1alp_0.5bdpo_lam_0dpop_lam
Text Generation
• 1B • Updated
JayHyeon/llama-BDPO_3e-6-1ep_0alp_0.5bdpo_lam_0dpop_lam
Text Generation
• 1B • Updated
JayHyeon/llama-DPO_1e-6-1ep_0alp_0.5bdpo_lam_0dpop_lam
Text Generation
• 1B • Updated
JayHyeon/llama-DPOP_1e-6-1ep_0alp_0.5bdpo_lam_5dpop_lam
Text Generation
• 1B • Updated
JayHyeon/llama-IRPO_1e-6-1ep_1alp_0.5bdpo_lam_0dpop_lam
Text Generation
• 1B • Updated
• 1
JayHyeon/llama-BDPO_1e-6-1ep_0alp_0.5bdpo_lam_0dpop_lam
Text Generation
• 1B • Updated
• 1
JayHyeon/llama-DPO_5e-7-1ep_0alp_0.5bdpo_lam_0dpop_lam
Text Generation
• 1B • Updated
JayHyeon/llama-DPOP_5e-7-1ep_0alp_0.5bdpo_lam_5dpop_lam
Text Generation
• 1B • Updated
• 4
JayHyeon/llama-IRPO_5e-7-1ep_1alp_0.5bdpo_lam_0dpop_lam
Text Generation
• 1B • Updated
JayHyeon/llama-BDPO_5e-7-1ep_0alp_0.5bdpo_lam_0dpop_lam
Text Generation
• 1B • Updated
JayHyeon/llama-DPO_3e-7-1ep_0alp_0.5bdpo_lam_0dpop_lam
Text Generation
• 1B • Updated