Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a model
4 days ago
mistralai/Mistral-Large-3-675B-Instruct-2512-BF16
liked
a dataset
5 days ago
natolambert/GeneralThought-430K-filtered