3 18 41

Ramanauskiene Edita

EditaZ

https://github.com/EditaNEmilis

AI & ML interests

None yet

Recent Activity

upvoted a paper 27 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted an article 29 days ago

Transformers v5: Simple model definitions powering the AI ecosystem

liked a model 30 days ago

Tongyi-MAI/Z-Image-Turbo

View all activity

Organizations

None yet

upvoted a paper 27 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 28 days ago • 241

upvoted an article 29 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

30 days ago

•

259

upvoted a paper about 1 month ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19 • 226

upvoted a paper 2 months ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27 • 177

upvoted a paper 3 months ago

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

upvoted 2 papers 4 months ago

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25 • 346

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10 • 660

upvoted an article 4 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

748

upvoted a paper 4 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 259

upvoted 2 papers 5 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 266

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 316

upvoted a paper 6 months ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Paper • 2506.18095 • Published Jun 22 • 66

upvoted a paper over 1 year ago

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 115

upvoted an article over 1 year ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

•

241

upvoted a paper over 1 year ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 157

upvoted an article over 1 year ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

•

295

upvoted a paper almost 2 years ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 189

upvoted a paper about 2 years ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 260

Ramanauskiene Edita

AI & ML interests

Recent Activity

Organizations

EditaZ's activity

Transformers v5: Simple model definitions powering the AI ecosystem

Uncensor any LLM with abliteration

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Welcome Llama 3 - Meta's new open LLM