transformer KV Cache Steering for Inducing Reasoning in Small Language Models Paper • 2507.08799 • Published Jul 11 • 40
KV Cache Steering for Inducing Reasoning in Small Language Models Paper • 2507.08799 • Published Jul 11 • 40
video Vidi: Large Multimodal Models for Video Understanding and Editing Paper • 2504.15681 • Published Apr 22 • 14
Vidi: Large Multimodal Models for Video Understanding and Editing Paper • 2504.15681 • Published Apr 22 • 14
microsoft phi 4 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 16 days ago • 288k • 1.55k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 16 days ago • 288k • 1.55k
foolingaround google/flan-t5-large 0.8B • Updated Jul 17, 2023 • 478k • 855 stepfun-ai/GOT-OCR-2.0-hf Image-Text-to-Text • 0.6B • Updated Jan 31 • 11.3k • 219
transformer KV Cache Steering for Inducing Reasoning in Small Language Models Paper • 2507.08799 • Published Jul 11 • 40
KV Cache Steering for Inducing Reasoning in Small Language Models Paper • 2507.08799 • Published Jul 11 • 40
video Vidi: Large Multimodal Models for Video Understanding and Editing Paper • 2504.15681 • Published Apr 22 • 14
Vidi: Large Multimodal Models for Video Understanding and Editing Paper • 2504.15681 • Published Apr 22 • 14
microsoft phi 4 microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 16 days ago • 288k • 1.55k
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 16 days ago • 288k • 1.55k
foolingaround google/flan-t5-large 0.8B • Updated Jul 17, 2023 • 478k • 855 stepfun-ai/GOT-OCR-2.0-hf Image-Text-to-Text • 0.6B • Updated Jan 31 • 11.3k • 219