Collections
Discover the best community collections!
Collections including paper arxiv:2403.09029
-
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 129 -
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 55 -
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Paper • 2403.09394 • Published • 27
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 55 -
Cleaner Pretraining Corpus Curation with Neural Web Scraping
Paper • 2402.14652 • Published -
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Paper • 2403.11703 • Published • 17
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 55 -
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 12.9k • 378 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • 8B • Updated • 366 • 191 -
laion/filtered-wit
Viewer • Updated • 2.8M • 7.03k • 10
-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 81 -
bigcode/starcoder2-15b
Text Generation • 16B • Updated • 5.43k • 649 -
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 122 -
mixedbread-ai/mxbai-rerank-large-v1
Text Ranking • 0.4B • Updated • 30.4k • 136
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 55 -
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Paper • 2403.12968 • Published • 25 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 72 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 78
-
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 12.9k • 378 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • 8B • Updated • 366 • 191 -
Screenshot to HTML
⚡911Convert screenshots to HTML code
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 55
-
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 129 -
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 55 -
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Paper • 2403.09394 • Published • 27
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 55 -
Cleaner Pretraining Corpus Curation with Neural Web Scraping
Paper • 2402.14652 • Published -
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Paper • 2403.11703 • Published • 17
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 55 -
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Paper • 2403.12968 • Published • 25 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 72 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 78
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 55 -
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 12.9k • 378 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • 8B • Updated • 366 • 191 -
laion/filtered-wit
Viewer • Updated • 2.8M • 7.03k • 10
-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 81 -
bigcode/starcoder2-15b
Text Generation • 16B • Updated • 5.43k • 649 -
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 122 -
mixedbread-ai/mxbai-rerank-large-v1
Text Ranking • 0.4B • Updated • 30.4k • 136
-
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 12.9k • 378 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • 8B • Updated • 366 • 191 -
Screenshot to HTML
⚡911Convert screenshots to HTML code
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 55