Varad Pimpalkhute
DaoistKalki
AI & ML interests
Few-shot learning, generalization, multi-modality
Recent Activity
upvoted a paper 3 days ago
Efficient Agentic Reasoning Through Self-Regulated Simulative Planning upvoted a paper 6 months ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices upvoted a paper 6 months ago
The Path Not Taken: RLVR Provably Learns Off the Principals