YuvrajSingh9886/LFM2.5-350M-grpo-summarization-quality-bleu-rouge Summarization • 0.4B • Updated 9 days ago • 125 • 1
Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning Paper • 2605.06130 • Published 17 days ago • 110
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 24 days ago • 57
FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption Paper • 2604.28157 • Published 24 days ago • 2
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
Action Images: End-to-End Policy Learning via Multiview Video Generation Paper • 2604.06168 • Published Apr 7 • 14
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 629
PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning Paper • 2603.26653 • Published Mar 27 • 18
6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models Paper • 2603.18742 • Published Mar 19 • 10
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published Mar 26 • 156