388 518

Yu li

Yukkkop

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling

liked a Space 5 days ago

victor/nanbeige

upvoted a paper 10 days ago

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

View all activity

Organizations

None yet

upvoted a paper 2 days ago

GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling

Paper • 2604.18556 • Published about 1 month ago • 5

liked a Space 5 days ago

Nanbeige 4.1 3B

🔮

Chat with Nanbeige AI locally in your browser

upvoted 8 papers 10 days ago

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper • 2605.05566 • Published 14 days ago • 37

Echoes as Anchors: Probabilistic Costs and Attention Refocusing in LLM Reasoning

Paper • 2602.06600 • Published Feb 6 • 3

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10, 2024 • 40

ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning

Paper • 2605.00380 • Published 20 days ago • 7

Evaluating the Progression of Large Language Model Capabilities for Small-Molecule Drug Design

Paper • 2604.16279 • Published Apr 17 • 1

liked 2 models 12 days ago

oumoumad/ltx-2.3-dearchive-lora

Video-to-Video • Updated 11 days ago • 33

lablab-ai-amd-developer-hackathon/CyberSecQwen-4B

Text Generation • 4B • Updated 12 days ago • 715 • 11

upvoted 2 articles 12 days ago

Article

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

lablab-ai-amd-developer-hackathon

•

12 days ago

• 8

Article

EMO: Pretraining mixture of experts for emergent modularity

allenai

•

12 days ago

• 37

upvoted 6 papers 14 days ago

Generative Video Motion Editing with 3D Point Tracks

Paper • 2512.02015 • Published Dec 1, 2025 • 4

ÜberWeb: Insights from Multilingual Curation for a 20-Trillion-Token Dataset

Paper • 2602.15210 • Published Feb 25 • 1

Kakugo: Distillation of Low-Resource Languages into Small Language Models

Paper • 2601.14051 • Published Jan 20 • 1

BYOL: Bring Your Own Language Into LLMs

Paper • 2601.10804 • Published Jan 15 • 1

Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials

Paper • 2404.16829 • Published Apr 25, 2024 • 5

Gamayun's Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM

Paper • 2512.21580 • Published Dec 25, 2025 • 9