Yu li's picture

Yu li

Yukkkop

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 10 hours ago

Representation Forcing for Bottleneck-Free Unified Multimodal Models

upvoted a paper about 10 hours ago

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

upvoted a paper about 10 hours ago

GrepSeek: Training Search Agents for Direct Corpus Interaction

View all activity

Organizations

None yet

upvoted 4 papers about 10 hours ago

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Paper • 2605.31604 • Published 4 days ago • 42

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Paper • 2605.31584 • Published 4 days ago • 32

GrepSeek: Training Search Agents for Direct Corpus Interaction

Paper • 2605.29307 • Published 5 days ago • 84

GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration

Paper • 2605.31039 • Published 4 days ago • 30

upvoted 4 papers 2 days ago

Parallelized Hierarchical Connectome: A Spatiotemporal Recurrent Framework for Spiking State-Space Models

Paper • 2604.01295 • Published 13 days ago • 1

Scalable Learning in Structured Recurrent Spiking Neural Networks without Backpropagation

Paper • 2605.00402 • Published May 1 • 1

Triplet-Block Diffusion RWKV

Paper • 2605.25969 • Published 8 days ago • 20

Rethinking Cross-Layer Information Routing in Diffusion Transformers

Paper • 2605.20708 • Published 13 days ago • 109

upvoted a paper 4 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published 6 days ago • 68

upvoted a paper 14 days ago

GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling

Paper • 2604.18556 • Published Apr 20 • 7

upvoted 8 papers 22 days ago

Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration

Paper • 2605.05566 • Published 26 days ago • 37

Echoes as Anchors: Probabilistic Costs and Attention Refocusing in LLM Reasoning

Paper • 2602.06600 • Published Feb 6 • 3

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10, 2024 • 40

ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning

Paper • 2605.00380 • Published May 1 • 7

EMO: Pretraining Mixture of Experts for Emergent Modularity

Paper • 2605.06663 • Published 26 days ago • 12

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 26 days ago • 80

Introspective Diffusion Language Models

Paper • 2604.11035 • Published Apr 13 • 25

Evaluating the Progression of Large Language Model Capabilities for Small-Molecule Drug Design

Paper • 2604.16279 • Published Apr 17 • 1

upvoted 2 articles 24 days ago

Article

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

lablab-ai-amd-developer-hackathon

•

24 days ago

• 10

Article

EMO: Pretraining mixture of experts for emergent modularity

allenai

•

24 days ago

• 38