·
AI & ML interests
None yet
Organizations
andrewsiah/Qwen-2.5-1.5B-Instruct-Datamix
Text Generation
•
2B
•
Updated
•
10
andrewsiah/Qwen-2.5-7B-Simple-RL
Text Generation
•
8B
•
Updated
•
6
andrewsiah/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
5
andrewsiah/Qwen2.5-1.5B-Open-R1-Distill
Updated
Reinforcement Learning
•
Updated
andrewsiah/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
5
Reinforcement Learning
•
Updated
andrewsiah/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
•
23
andrewsiah/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
1