Efficient Feature Distillation for Zero-shot Annotation Object Detection Paper • 2303.12145 • Published Mar 21, 2023
ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation Paper • 2308.03793 • Published Aug 4, 2023 • 12
Implicit Neural Representation Facilitates Unified Universal Vision Encoding Paper • 2601.14256 • Published Jan 20 • 7
ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations Paper • 2606.11188 • Published 4 days ago • 24
SPAN: Spatial Pyramid Attention Network forImage Manipulation Localization Paper • 2009.00726 • Published Sep 1, 2020
MixNorm: Test-Time Adaptation Through Online Normalization Estimation Paper • 2110.11478 • Published Oct 21, 2021
Large Language Models are Good Prompt Learners for Low-Shot Image Classification Paper • 2312.04076 • Published Dec 7, 2023
BitLM: Unlocking Multi-Token Language Generation with Bitwise Continuous Diffusion Paper • 2605.11577 • Published May 12
SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification Paper • 2103.16725 • Published Mar 30, 2021
ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations Paper • 2606.11188 • Published 4 days ago • 24
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play Paper • 2509.25541 • Published Sep 29, 2025 • 142
Seedream 4.0: Toward Next-generation Multimodal Image Generation Paper • 2509.20427 • Published Sep 24, 2025 • 85
Atlas: Multi-Scale Attention Improves Long Context Image Modeling Paper • 2503.12355 • Published Mar 16, 2025 • 12
Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference Paper • 2502.13542 • Published Feb 19, 2025 • 1
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling Paper • 2505.11196 • Published May 16, 2025 • 14
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling Paper • 2505.11196 • Published May 16, 2025 • 14
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling Paper • 2505.11196 • Published May 16, 2025 • 14 • 2
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs Paper • 2502.17422 • Published Feb 24, 2025 • 7