Modern AI Models for Vision and Multimodal Understanding is a course that will enable you to understand and build systems that interpret images, text, and more—just like today’s leading AI models.