Medical google/medgemma-1.5-4b-it Image-Text-to-Text • 4B • Updated 22 days ago • 113k • 606 google/medsiglip-448 Zero-Shot Image Classification • 0.9B • Updated Jul 10, 2025 • 42.8k • 137 google/medgemma-27b-it Image-Text-to-Text • Updated Jul 10, 2025 • 84.4k • 346 google/medgemma-27b-text-it Text Generation • Updated Sep 16, 2025 • 47.8k • • 427
Audio nvidia/audio-flamingo-3-hf Audio-Text-to-Text • 8B • Updated 23 days ago • 182k • 183 facebook/sam-audio-large Updated Dec 30, 2025 • 9.4k • 398 google/medasr Automatic Speech Recognition • Updated 7 days ago • 11.9k • 310 FunAudioLLM/Fun-CosyVoice3-0.5B-2512 Text-to-Speech • Updated Feb 3 • 282k • 536
OCR lightonai/LightOnOCR-1B-1025 Image-to-Text • Updated Feb 20 • 159k • 249 tencent/HunyuanOCR Image-Text-to-Text • 1.0B • Updated Jan 13 • 195k • 748 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 6 days ago • 96.6k • 609 PaddlePaddle/PP-DocLayoutV3 Image Segmentation • Updated Jan 30 • 30.3k • 73
Judge ai-forever/pollux-judge-32b Text Generation • 33B • Updated Jun 27, 2025 • 1.75k • • 5 ai-forever/pollux-judge-32b-r Text Generation • 33B • Updated Jun 27, 2025 • 7
Ru text encoders ai-forever/ru-en-RoSBERTa Feature Extraction • 0.4B • Updated Sep 26, 2024 • 299k • • 78 Tochka-AI/ruRoPEBert-e5-base-512 Feature Extraction • 0.1B • Updated Mar 13, 2024 • 15 Tochka-AI/ruRoPEBert-e5-base-2k Feature Extraction • 0.1B • Updated Mar 13, 2024 • 720 • 11
VLMs Qwen/Qwen2-VL-7B-Instruct Image-Text-to-Text • 8B • Updated Feb 6, 2025 • 3.26M • 1.27k NVEagle/Eagle-X5-13B-Chat Image-Text-to-Text • 15B • Updated Sep 16, 2024 • 14 • 28 internlm/internlm-xcomposer2d5-7b Visual Question Answering • Updated Jul 22, 2024 • 392 • 210 AIRI-Institute/OmniFusion Updated Apr 10, 2024 • 59
VLA models nvidia/Alpamayo-R1-10B Robotics • 11B • Updated Mar 27 • 44.6k • 394 nvidia/GR00T-N1.6-3B Robotics • 3B • Updated Dec 15, 2025 • 29.1k • 87 tencent/HY-Embodied-0.5 Image-Text-to-Text • 4B • Updated 22 days ago • 2.9k • 905
Translate google/translategemma-12b-it Image-Text-to-Text • Updated Jan 28 • 16.7k • 294 tencent/HY-MT1.5-1.8B Translation • Updated Jan 1 • 22.1k • 1.16k google/translategemma-4b-it Image-Text-to-Text • 5B • Updated Jan 28 • 247k • 766
Video encoders google/videoprism-lvt-base-f16r288 Video Classification • Updated Jul 29, 2025 • 23.9k • 12 nvidia/omni-embed-nemotron-3b Sentence Similarity • 5B • Updated 11 days ago • 12.8k • 119
Datasets for Embodied agibot-world/AgiBotWorld-Alpha Viewer • Updated Sep 29, 2025 • 49.8M • 11.8k • 220 nvidia/PhysicalAI-Autonomous-Vehicles Updated 29 days ago • 251k • 861 genrobot2025/10Kh-RealOmin-OpenData Updated 12 days ago • 233k • 211
Text2Image stabilityai/stable-diffusion-3-medium Text-to-Image • Updated Aug 12, 2024 • 3.84k • • 4.95k black-forest-labs/FLUX.2-dev Image-to-Image • Updated Feb 17 • 212k • • 1.62k fal/FLUX.2-dev-Turbo Text-to-Image • Updated Dec 30, 2025 • 5.46k • • 368 black-forest-labs/FLUX.2-klein-4B Image-to-Image • Updated Feb 24 • 265k • • 662
Medical google/medgemma-1.5-4b-it Image-Text-to-Text • 4B • Updated 22 days ago • 113k • 606 google/medsiglip-448 Zero-Shot Image Classification • 0.9B • Updated Jul 10, 2025 • 42.8k • 137 google/medgemma-27b-it Image-Text-to-Text • Updated Jul 10, 2025 • 84.4k • 346 google/medgemma-27b-text-it Text Generation • Updated Sep 16, 2025 • 47.8k • • 427
VLA models nvidia/Alpamayo-R1-10B Robotics • 11B • Updated Mar 27 • 44.6k • 394 nvidia/GR00T-N1.6-3B Robotics • 3B • Updated Dec 15, 2025 • 29.1k • 87 tencent/HY-Embodied-0.5 Image-Text-to-Text • 4B • Updated 22 days ago • 2.9k • 905
Audio nvidia/audio-flamingo-3-hf Audio-Text-to-Text • 8B • Updated 23 days ago • 182k • 183 facebook/sam-audio-large Updated Dec 30, 2025 • 9.4k • 398 google/medasr Automatic Speech Recognition • Updated 7 days ago • 11.9k • 310 FunAudioLLM/Fun-CosyVoice3-0.5B-2512 Text-to-Speech • Updated Feb 3 • 282k • 536
Translate google/translategemma-12b-it Image-Text-to-Text • Updated Jan 28 • 16.7k • 294 tencent/HY-MT1.5-1.8B Translation • Updated Jan 1 • 22.1k • 1.16k google/translategemma-4b-it Image-Text-to-Text • 5B • Updated Jan 28 • 247k • 766
OCR lightonai/LightOnOCR-1B-1025 Image-to-Text • Updated Feb 20 • 159k • 249 tencent/HunyuanOCR Image-Text-to-Text • 1.0B • Updated Jan 13 • 195k • 748 PaddlePaddle/PaddleOCR-VL-1.5 Image-Text-to-Text • 1.0B • Updated 6 days ago • 96.6k • 609 PaddlePaddle/PP-DocLayoutV3 Image Segmentation • Updated Jan 30 • 30.3k • 73
Video encoders google/videoprism-lvt-base-f16r288 Video Classification • Updated Jul 29, 2025 • 23.9k • 12 nvidia/omni-embed-nemotron-3b Sentence Similarity • 5B • Updated 11 days ago • 12.8k • 119
Judge ai-forever/pollux-judge-32b Text Generation • 33B • Updated Jun 27, 2025 • 1.75k • • 5 ai-forever/pollux-judge-32b-r Text Generation • 33B • Updated Jun 27, 2025 • 7
Datasets for Embodied agibot-world/AgiBotWorld-Alpha Viewer • Updated Sep 29, 2025 • 49.8M • 11.8k • 220 nvidia/PhysicalAI-Autonomous-Vehicles Updated 29 days ago • 251k • 861 genrobot2025/10Kh-RealOmin-OpenData Updated 12 days ago • 233k • 211
Ru text encoders ai-forever/ru-en-RoSBERTa Feature Extraction • 0.4B • Updated Sep 26, 2024 • 299k • • 78 Tochka-AI/ruRoPEBert-e5-base-512 Feature Extraction • 0.1B • Updated Mar 13, 2024 • 15 Tochka-AI/ruRoPEBert-e5-base-2k Feature Extraction • 0.1B • Updated Mar 13, 2024 • 720 • 11
Text2Image stabilityai/stable-diffusion-3-medium Text-to-Image • Updated Aug 12, 2024 • 3.84k • • 4.95k black-forest-labs/FLUX.2-dev Image-to-Image • Updated Feb 17 • 212k • • 1.62k fal/FLUX.2-dev-Turbo Text-to-Image • Updated Dec 30, 2025 • 5.46k • • 368 black-forest-labs/FLUX.2-klein-4B Image-to-Image • Updated Feb 24 • 265k • • 662
VLMs Qwen/Qwen2-VL-7B-Instruct Image-Text-to-Text • 8B • Updated Feb 6, 2025 • 3.26M • 1.27k NVEagle/Eagle-X5-13B-Chat Image-Text-to-Text • 15B • Updated Sep 16, 2024 • 14 • 28 internlm/internlm-xcomposer2d5-7b Visual Question Answering • Updated Jul 22, 2024 • 392 • 210 AIRI-Institute/OmniFusion Updated Apr 10, 2024 • 59