Archives for The Alignment Problem in AI


The human labellers supervise and evaluate the model’s output by the tool, even if they have not read the books themselves.


The human labellers supervise and evaluate the model’s output by the tool, even if they have not read the books themselves.

