Wei Xiong's picture

Wei Xiong

weqweasdas

·

https://weixiongust.github.io/WeiXiongUST/index.html

AI & ML interests

Machine learning, RLHF

Recent Activity

upvoted a paper about 1 month ago

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

updated a dataset about 1 month ago

weqweasdas/qwen15b_train_simple_subset5k_for_difficulty_transition

published a dataset about 1 month ago

weqweasdas/qwen15b_train_simple_subset5k_for_difficulty_transition

View all activity

Organizations

upvoted a paper about 1 month ago

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Paper • 2510.27623 • Published Oct 31 • 12

updated a dataset about 1 month ago

weqweasdas/qwen15b_train_simple_subset5k_for_difficulty_transition

Viewer • Updated Oct 26 • 5k • 98

published a dataset about 1 month ago

weqweasdas/qwen15b_train_simple_subset5k_for_difficulty_transition

Viewer • Updated Oct 26 • 5k • 98

upvoted a paper about 2 months ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

Paper • 2510.11769 • Published Oct 13 • 25

upvoted 2 papers 2 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 266

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

Paper • 2510.04996 • Published Oct 6 • 15

commented a paper 2 months ago

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

Paper • 2510.04996 • Published Oct 6 • 15 •

updated a dataset 2 months ago

weqweasdas/ultrafeedback_binarized_processed

Viewer • Updated Oct 4 • 61.1k • 25

published a dataset 2 months ago

weqweasdas/ultrafeedback_binarized_processed

Viewer • Updated Oct 4 • 61.1k • 25

updated a dataset 2 months ago

weqweasdas/qwen7b_prompt_difficult

Viewer • Updated Sep 29 • 15.7k • 46

published a dataset 2 months ago

weqweasdas/qwen7b_prompt_difficult

Viewer • Updated Sep 29 • 15.7k • 46

updated a dataset 2 months ago

weqweasdas/qwen7b_openr1_with_scores_sub

Viewer • Updated Sep 28 • 57.7k • 24

published a dataset 2 months ago

weqweasdas/qwen7b_openr1_with_scores_sub

Viewer • Updated Sep 28 • 57.7k • 24

updated a dataset 3 months ago

weqweasdas/qwen7b_openr1_with_scores_filtered_0375

Viewer • Updated Sep 25 • 24.3k • 17

published a dataset 3 months ago

weqweasdas/qwen7b_openr1_with_scores_filtered_0375

Viewer • Updated Sep 25 • 24.3k • 17

updated a dataset 3 months ago

weqweasdas/qwen7b_openr1_with_scores

Viewer • Updated Sep 23 • 75k • 39

published a dataset 3 months ago

weqweasdas/qwen7b_openr1_with_scores

Viewer • Updated Sep 23 • 75k • 39

updated a dataset 3 months ago

weqweasdas/from_default_filtered_openr1_with_scores_filtered_05_and_filtered_allwrong

Viewer • Updated Sep 18 • 25k • 26

published a dataset 3 months ago

weqweasdas/from_default_filtered_openr1_with_scores_filtered_05_and_filtered_allwrong

Viewer • Updated Sep 18 • 25k • 26

updated a dataset 3 months ago

weqweasdas/validate

Viewer • Updated Sep 16 • 1.68k • 31