Path detail

Multimodal and Vision Study Guide

Vision, audio, diffusion, robotics, and reinforcement-learning resources for broader AI systems.

Step 1Required
Computer Vision

Starts with a solid on-ramp into the topic.

Self-guided
Open study URL
Step 2Required
AWS DeepRacer

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 3Required
Optimize spend and performance with Azure AI Foundry Provisioned Throughput reservations

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 4Optional
Create a multimodal analysis solution with Azure Content Understanding

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 5Optional
Introduction to computer vision concepts

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 6Optional
Read text in images

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 7Optional
Get started with computer vision in Azure

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 8Optional
Create vision models with Azure AI Custom Vision

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 9Optional
Deep Reinforcement Learning Course

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 10Optional
Robotics Course

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 11Optional
Community Computer Vision Course

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 12Optional
Audio Course

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 13Optional
Diffusion Course

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 14Optional
Develop a vision-enabled generative AI application

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 15Optional
Develop a speech-capable generative AI application

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 16Optional
Develop computer vision solutions with Microsoft Foundry

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 17Optional
Develop computer vision solutions in Azure

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 18Optional
Robotics Course

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 19Optional
Community Computer Vision Course

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 20Optional
Audio Course

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 21Optional
Diffusion Course

Builds on the previous step with a complementary provider angle.

Self-guided
Open study URL
Step 22Optional
Robotics Course

Rounds out the path with broader depth or specialization.

Self-guided
Open study URL
OpenCourseMap