Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
SHILONG DENG's picture
3 9

SHILONG DENG

zczlsde
qingyuanwu's profile picture Benyucong's profile picture HollowMan6's profile picture
·
  • zczlsde

AI & ML interests

RL, NLP

Recent Activity

authored a paper 5 days ago
A Unified Framework for Rethinking Policy Divergence Measures in GRPO
upvoted a paper 5 days ago
A Unified Framework for Rethinking Policy Divergence Measures in GRPO
updated a model 4 months ago
zczlsde/qwen
View all activity

Organizations

COMP0087_GROUP8_22-23's profile picture

upvoted a paper 5 days ago

A Unified Framework for Rethinking Policy Divergence Measures in GRPO

Paper • 2602.05494 • Published 6 days ago • 2
upvoted 2 papers 4 months ago

Large Language Models Are Neurosymbolic Reasoners

Paper • 2401.09334 • Published Jan 17, 2024 • 3

QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL

Paper • 2510.00967 • Published Oct 1, 2025 • 12
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs