GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling Paper • 2604.18556 • Published about 1 month ago • 5
Nonsense Helps: Prompt Space Perturbation Broadens Reasoning Exploration Paper • 2605.05566 • Published 14 days ago • 37
Echoes as Anchors: Probabilistic Costs and Attention Refocusing in LLM Reasoning Paper • 2602.06600 • Published Feb 6 • 3
PowerInfer-2: Fast Large Language Model Inference on a Smartphone Paper • 2406.06282 • Published Jun 10, 2024 • 40
ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning Paper • 2605.00380 • Published 20 days ago • 7
EMO: Pretraining Mixture of Experts for Emergent Modularity Paper • 2605.06663 • Published 14 days ago • 11
Evaluating the Progression of Large Language Model Capabilities for Small-Molecule Drug Design Paper • 2604.16279 • Published Apr 17 • 1
lablab-ai-amd-developer-hackathon/CyberSecQwen-4B Text Generation • 4B • Updated 12 days ago • 715 • 11
view article Article CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models lablab-ai-amd-developer-hackathon • 12 days ago • 8
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 12 days ago • 37
ÜberWeb: Insights from Multilingual Curation for a 20-Trillion-Token Dataset Paper • 2602.15210 • Published Feb 25 • 1
Kakugo: Distillation of Low-Resource Languages into Small Language Models Paper • 2601.14051 • Published Jan 20 • 1
Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials Paper • 2404.16829 • Published Apr 25, 2024 • 5
Gamayun's Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM Paper • 2512.21580 • Published Dec 25, 2025 • 9