Ashish Tanwer
ashishtanwer
AI & ML interests
None yet
Recent Activity
liked a model 6 days ago
dealignai/Gemma-4-31B-JANG_4M-CRACK liked a model 6 days ago
poolside/Laguna-XS.2 liked a model 6 days ago
deepseek-ai/DeepSeek-V4-FlashOrganizations
RAG
DataLabelling
LLM
- Running3.26k
AnyCoder
π3.26kGenerate full app code from a simple description
- RunningAgentsFeatured272
Qwen2.5 Coder Artifacts
π’272Generate and preview web app code from a text description
- Build errorAgentsFeatured922
QwQ-32B-Preview
π922QwQ-32B-Preview
- Runtime error14k
Open LLM Leaderboard
π14kTrack, rank and evaluate open LLMs and chatbots
Evals
ClassicalML
Paper and resources for Classical ML
InfraML
Agents
Transformer
-
sentence-transformers/all-mpnet-base-v2
Sentence Similarity β’ 0.1B β’ Updated β’ 36.1M β’ β’ 1.29k -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper β’ 1910.10683 β’ Published β’ 18 -
google-t5/t5-base
Translation β’ Updated β’ 2.57M β’ β’ 774 -
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 122
DataCleaning
Dataset
-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Paper β’ 2306.01116 β’ Published β’ 45 -
HuggingFaceFW/fineweb
Viewer β’ Updated β’ 52.5B β’ 909k β’ 2.79k -
tiiuae/falcon-refinedweb
Viewer β’ Updated β’ 968M β’ 23k β’ 911 -
LLaMA: Open and Efficient Foundation Language Models
Paper β’ 2302.13971 β’ Published β’ 23
Training
-
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper β’ 1910.10683 β’ Published β’ 18 -
AutoTrain: No-code training for state-of-the-art models
Paper β’ 2410.15735 β’ Published β’ 59 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper β’ 2405.00732 β’ Published β’ 122 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper β’ 2106.09685 β’ Published β’ 60
Diffusion
DataCrawling
Agents
RAG
Transformer
-
sentence-transformers/all-mpnet-base-v2
Sentence Similarity β’ 0.1B β’ Updated β’ 36.1M β’ β’ 1.29k -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper β’ 1910.10683 β’ Published β’ 18 -
google-t5/t5-base
Translation β’ Updated β’ 2.57M β’ β’ 774 -
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 122
DataLabelling
DataCleaning
LLM
- Running3.26k
AnyCoder
π3.26kGenerate full app code from a simple description
- RunningAgentsFeatured272
Qwen2.5 Coder Artifacts
π’272Generate and preview web app code from a text description
- Build errorAgentsFeatured922
QwQ-32B-Preview
π922QwQ-32B-Preview
- Runtime error14k
Open LLM Leaderboard
π14kTrack, rank and evaluate open LLMs and chatbots
Dataset
-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Paper β’ 2306.01116 β’ Published β’ 45 -
HuggingFaceFW/fineweb
Viewer β’ Updated β’ 52.5B β’ 909k β’ 2.79k -
tiiuae/falcon-refinedweb
Viewer β’ Updated β’ 968M β’ 23k β’ 911 -
LLaMA: Open and Efficient Foundation Language Models
Paper β’ 2302.13971 β’ Published β’ 23
Evals
Training
-
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper β’ 1910.10683 β’ Published β’ 18 -
AutoTrain: No-code training for state-of-the-art models
Paper β’ 2410.15735 β’ Published β’ 59 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper β’ 2405.00732 β’ Published β’ 122 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper β’ 2106.09685 β’ Published β’ 60
ClassicalML
Paper and resources for Classical ML
Diffusion
InfraML