Павлов Роман
tangqianyi
AI & ML interests
Agent systems for real-world tasks.
Recent Activity
upvoted a paper about 8 hours ago
DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off upvoted a paper about 9 hours ago
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model liked a model 5 days ago
tencent/HY-World-2.0Organizations
None yet