Rajdeep Borgohain
rbgo
AI & ML interests
Solving language barriers.
Organizations
LLM-Alignment Papers
-
Concrete Problems in AI Safety
Paper β’ 1606.06565 β’ Published β’ 1 -
The Off-Switch Game
Paper β’ 1611.08219 β’ Published β’ 1 -
Learning to summarize from human feedback
Paper β’ 2009.01325 β’ Published β’ 4 -
Truthful AI: Developing and governing AI that does not lie
Paper β’ 2110.06674 β’ Published β’ 1
All About LLMs
Finetuning
LLM-Alignment Papers
-
Concrete Problems in AI Safety
Paper β’ 1606.06565 β’ Published β’ 1 -
The Off-Switch Game
Paper β’ 1611.08219 β’ Published β’ 1 -
Learning to summarize from human feedback
Paper β’ 2009.01325 β’ Published β’ 4 -
Truthful AI: Developing and governing AI that does not lie
Paper β’ 2110.06674 β’ Published β’ 1
PPO Trainers
All About LLMs