Archives for Brian Christain The Alignment Problem
The human labellers supervise and evaluate the model’s output by the tool, even if they have not read the books themselves.
The human labellers supervise and evaluate the model’s output by the tool, even if they have not read the books themselves.