25 Mar A beginner’s guide to Video and Language Pre-training models (VL-PTMs) Sourabh Mehta AI With the recent advances of self-supervised learning, pre-training techniques play a vital role in learning visual and language representation.