AI & ML interests

AI, Evaluations, RL

Recent Activity

lorenss  updated a dataset about 1 month ago
hud-evals/SheetBench-50
jdchawla29  updated a dataset about 1 month ago
hud-evals/SheetBench-50
lorenss  updated a dataset about 2 months ago
hud-evals/SpreadSheetBench-200
View all activity

hud-evals 's models

None public yet