Object detection forms the foundation of many other downstream computer vision tasks, such as image segmentation, image captions, object tracking, and more.