2 3 3

Alex

ShiningMaker

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-7B-v1

liked a model about 2 months ago

ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-3B-v1

upvoted a paper 3 months ago

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

View all activity

Organizations

None yet

liked 2 models about 2 months ago

ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-7B-v1

Visual Document Retrieval • 8B • Updated Nov 4 • 75 • 17

ApsaraStackMaaS/EvoQwen2.5-VL-Retriever-3B-v1

Visual Document Retrieval • 4B • Updated Nov 4 • 8.52k • 12

upvoted a paper 3 months ago

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

Paper • 2509.19803 • Published Sep 24 • 120

upvoted 2 papers 4 months ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Paper • 2508.21104 • Published Aug 28 • 35

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118

New activity in meituan/DeepSeek-R1-Block-INT8 10 months ago

After deploying with the latest sglang, I found that the responses when calling the interface were chaotic.

#13 opened 10 months ago by

ShiningMaker

New activity in QuixiAI/DeepSeek-R1-AWQ 10 months ago

skips the thinking process

#5 opened 11 months ago by

muzizon

updated 2 collections over 1 year ago

Qwen

Collection

0 items • Updated Mar 3

Qwen1.5

Collection

0 items • Updated Mar 3

liked a model over 1 year ago

Qwen/Qwen1.5-7B-Chat

Text Generation • 8B • Updated Apr 30, 2024 • 16.9k • 185