Flash-VStream: Efficient Real-Time Understanding for Long Video Streams
Paper
•
2506.23825
•
Published
We proposed Flash-VStream, an efficient VLM with a novel Flash Memory mechanism that enables real-time understanding and Q&A of extremely long video streams. Our model achieves outstanding accuracy and efficiency on EgoSchema, MLVU, LVBench, MVBench and Video-MME Benchmarks.
This project is licensed under the Apache 2.0 License.