From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published 18 days ago • 256
Running on A100 208 Omnilingual ASR Media Transcription 🌍 208 Transcribe audio or video into text in multiple languages
MiroThinker-v1.0 Collection Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling • 7 items • Updated 7 days ago • 40
From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning Paper • 2509.23768 • Published Sep 28 • 48
MolmoAct Collection All models for the MolmoAct (Multimodal Open Language Model for Action) release. • 10 items • Updated 2 days ago • 31
VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory Paper • 2506.18903 • Published Jun 23 • 22