Ivan Deryabin's picture

6 46

Ivan Deryabin

alexlink

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 6 days ago

liked a model 7 days ago

unsloth/GLM-4.7-Flash-REAP-23B-A3B-GGUF

liked a model 12 days ago

unsloth/GLM-4.7-Flash-GGUF

View all activity

Organizations

None yet

upvoted a collection 6 days ago

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 26 items • Updated 5 days ago • 98

upvoted a collection 9 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.62k

upvoted a collection 10 months ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10, 2025 • 216

upvoted 2 collections over 1 year ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Dec 23, 2025 • 309

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Dec 31, 2025 • 685

upvoted a paper over 1 year ago

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Paper • 2407.11062 • Published Jul 10, 2024 • 10