If 80 per cent of machine learning work is data preparation, then ensuring data quality is the most important work of a machine learning team