SHILONG DENG's picture

3 9

SHILONG DENG

zczlsde

·

zczlsde

AI & ML interests

RL, NLP

Recent Activity

authored a paper 5 days ago

A Unified Framework for Rethinking Policy Divergence Measures in GRPO

upvoted a paper 5 days ago

A Unified Framework for Rethinking Policy Divergence Measures in GRPO

updated a model 4 months ago

zczlsde/qwen

View all activity

Organizations

upvoted a paper 5 days ago

A Unified Framework for Rethinking Policy Divergence Measures in GRPO

Paper • 2602.05494 • Published 6 days ago • 2

upvoted 2 papers 4 months ago

Large Language Models Are Neurosymbolic Reasoners

Paper • 2401.09334 • Published Jan 17, 2024 • 3

QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL

Paper • 2510.00967 • Published Oct 1, 2025 • 12