28 18

Emiko Takahashi

lazy-45

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

SOD: Step-wise On-policy Distillation for Small Language Model Agents

upvoted a paper 7 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

liked a model 9 days ago

google-bert/bert-base-uncased

View all activity

Organizations

None yet

upvoted a paper 5 days ago

SOD: Step-wise On-policy Distillation for Small Language Model Agents

Paper • 2605.07725 • Published 24 days ago • 25

upvoted a paper 7 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 25 days ago • 231

upvoted 2 papers 9 days ago

Training Large Language Models to Predict Clinical Events

Paper • 2605.12817 • Published 20 days ago • 17

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 12 days ago • 204

upvoted 2 papers 11 days ago

OmniHumanoid: Streaming Cross-Embodiment Video Generation with Paired-Free Adaptation

Paper • 2605.12038 • Published 20 days ago • 4

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 20 days ago • 195

upvoted a paper 18 days ago

Audio-Visual Intelligence in Large Foundation Models

Paper • 2605.04045 • Published 27 days ago • 35

upvoted a paper 21 days ago

UniSD: Towards a Unified Self-Distillation Framework for Large Language Models

Paper • 2605.06597 • Published 25 days ago • 15

upvoted a paper 23 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published 25 days ago • 111

upvoted a paper 24 days ago

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

Paper • 2605.05185 • Published 26 days ago • 101

upvoted a paper about 1 month ago

Scaling Test-Time Compute for Agentic Coding

Paper • 2604.16529 • Published Apr 16 • 12

upvoted 5 papers about 2 months ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

Towards a Medical AI Scientist

Paper • 2603.28589 • Published Mar 30 • 90

upvoted 4 papers 2 months ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 343

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 352

SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise

Paper • 2602.12783 • Published Feb 13 • 246

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 136

Emiko Takahashi

AI & ML interests

Recent Activity

Organizations

lazy-45's activity