euclaise

https://euclaise.xyz

euclaise

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

RAT: Bridging RNN Efficiency and Attention Accuracy in Language Modeling

upvoted a paper 1 day ago

RAT+: Train Dense, Infer Sparse -- Recurrence Augmented Attention for Dilated Inference

liked a dataset 3 days ago

TuringEnterprises/Open-RL

View all activity

Organizations

upvoted 2 papers 1 day ago

RAT: Bridging RNN Efficiency and Attention Accuracy in Language Modeling

Paper • 2507.04416 • Published Jul 6, 2025 • 1

RAT+: Train Dense, Infer Sparse -- Recurrence Augmented Attention for Dilated Inference

Paper • 2602.18196 • Published 26 days ago • 1

liked 8 datasets 3 days ago

liked a model 3 days ago

Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview

Text Generation • Updated Jan 15 • 227 • 52

upvoted 2 papers 3 days ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published 9 days ago • 53

Lost in Backpropagation: The LM Head is a Gradient Bottleneck

Paper • 2603.10145 • Published 8 days ago • 10

liked a model 6 days ago

OpenMOSE/RWKV-GLM-4.7-Flash-exp

Text Generation • 30B • Updated about 6 hours ago • 152 • 2

liked a dataset 6 days ago

pszemraj/LocalLLaMA-comments

Viewer • Updated 6 days ago • 1.51M • 14 • 1

liked a model 13 days ago

Qwen/Qwen3.5-9B

Image-Text-to-Text • 10B • Updated 16 days ago • 2.27M • • 900

liked 2 models 17 days ago

LiquidAI/LFM2-24B-A2B-GGUF

Text Generation • 24B • Updated 29 days ago • 35.7k • 109

LocoreMind/LocoOperator-4B

Text Generation • 4B • Updated 22 days ago • 17.2k • 288

upvoted 2 papers 17 days ago

Online Vector Quantized Attention

Paper • 2602.03922 • Published Feb 3 • 1

Softmax Linear Attention: Reclaiming Global Competition

Paper • 2602.01744 • Published Feb 2 • 1

euclaise

AI & ML interests

Recent Activity

Organizations

euclaise's activity