sumail's picture

sumail

sumailmao

·

chongqichuizi875

AI & ML interests

None yet

Recent Activity

commentedon a paper about 17 hours ago

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

updated a collection about 19 hours ago

Flow-DPPO: GenEval2

upvoted a paper about 20 hours ago

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

View all activity

Organizations

models 0

None public yet

datasets 0

None public yet