Alexey Gorbatovski's picture

3 8

Alexey Gorbatovski

Myashka

·

Myashka

AI & ML interests

NLP Alignment

Recent Activity

authored a paper 5 days ago

ESSA: Evolutionary Strategies for Scalable Alignment

upvoted a paper 23 days ago

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

commented on a paper about 2 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

View all activity

Organizations

None yet

Papers 5

arxiv:2507.04453

arxiv:2502.01237

arxiv:2404.09656

arxiv:2402.10644

models 37

Myashka/Qwen2.5-7B-UltraChat200K_EMA_SFT-Lr_3e_6-Alpha_0.01

Text Generation • 8B • Updated Sep 1 • 4

Myashka/Qwen2.5-7B-UltraChat200K_SFT-Lr_3e_6

8B • Updated Sep 1 • 7

Myashka/gpt-imdb-kto-beta_0.1

Text Generation • 0.1B • Updated Dec 17, 2023 • 9

Myashka/gpt-imdb-hinge-beta_0.1

Text Generation • 0.1B • Updated Dec 9, 2023 • 12

Myashka/gpt-imdb-dpo_annealing

Text Generation • 0.1B • Updated Dec 9, 2023 • 4

Myashka/gpt-imdb-alpha_0.3-beta_0.1

Text Generation • 0.1B • Updated Dec 9, 2023 • 8

Myashka/gpt-imdb-ipo-beta_0.3

Text Generation • 0.1B • Updated Dec 9, 2023 • 9

Myashka/gpt-imdb-ipo-beta_0.1

Text Generation • 0.1B • Updated Dec 9, 2023 • 23

Myashka/gpt-imdb-ipo_annealing

Text Generation • 0.1B • Updated Dec 9, 2023 • 4

Myashka/gpt-imdb-alpha_0.5-beta_0.1

Text Generation • 0.1B • Updated Dec 9, 2023 • 10

datasets 11

Myashka/CryptoNews_50_50

Viewer • Updated Mar 23, 2024 • 1.15k • 57

Myashka/CryptoNews

Viewer • Updated Mar 17, 2024 • 1.15k • 27

Myashka/gpt2-imdb-constractive

Viewer • Updated Dec 4, 2023 • 59.1k • 50

Myashka/SO_Python_basics_QA_human_pref

Viewer • Updated Nov 5, 2023 • 185k • 31

Myashka/SO-Python_basics_QA-filtered-2023-T5_paraphrased-tanh_score

Viewer • Updated Aug 23, 2023 • 117k • 40

Myashka/SO_Python_basics_QA_human_preferences_no_gen

Viewer • Updated Jul 29, 2023 • 6.17k • 43

Myashka/SO-Python_basics_QA-filtered-2023-tanh_score

Viewer • Updated Jul 25, 2023 • 30k • 24

Myashka/SO-Python_QA-filtered-2023-no_code-tanh_score

Viewer • Updated Jul 18, 2023 • 66.1k • 98 • 2

Myashka/SO-Python_QA-filtered-2023-tanh_score

Viewer • Updated Jul 14, 2023 • 69.5k • 33

Myashka/SO-Python_QA-filtered-2023-tanh_score-after_2023_02

Viewer • Updated Jul 13, 2023 • 1.06k • 34 • 1

View 11 datasets