Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning Paper • 2510.27623 • Published Oct 31 • 12
GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving Paper • 2510.11769 • Published Oct 13 • 25
Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training Paper • 2510.04996 • Published Oct 6 • 15
Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training Paper • 2510.04996 • Published Oct 6 • 15 • 2
weqweasdas/from_default_filtered_openr1_with_scores_filtered_05_and_filtered_allwrong Viewer • Updated Sep 18 • 25k • 26
weqweasdas/from_default_filtered_openr1_with_scores_filtered_05_and_filtered_allwrong Viewer • Updated Sep 18 • 25k • 26