Building AI Agents with Multimodal Models

Just like how humans have multiple senses to perceive the world around them, computers have a variety of sensors to help perceive the human world.

Generative AILLMAI AgentsMachine LearningMultimodal
Provider
NVIDIA DLI
Duration
8 hrs
Mode
live
Pricing
Price not stated

Catalog checked Mar 16, 2026. Enrollment happens on the provider website; progress tracking happens here.

Open provider page

What you will cover

Deep Learning, Generative AI/LLM, generative AI, LLM systems, AI agents, machine learning

Recommended next

LLM Foundations for Builders
A free, self-paced introduction to modern large language model systems.
Review course
Build with MCP
A guided introduction to Model Context Protocol concepts and tool-enabled apps.
Review course
Machine Learning Refresher
Refresh the statistics and ML foundations needed for advanced GenAI work.
Review course
Related

Keep the path moving

Verified freebasic

A free, self-paced introduction to modern large language model systems.

LLMGenerative AIPrompt Engineering
5 hrsself-pacedChecked Mar 1, 2026
Verified freeamateur

Build with MCP

Hugging Face

A guided introduction to Model Context Protocol concepts and tool-enabled apps.

MCPAI Agents
4 hrsself-pacedChecked Mar 8, 2026
Verified freebasic

Refresh the statistics and ML foundations needed for advanced GenAI work.

Machine LearningPython FoundationsStatistics
12 hrsself-pacedChecked Feb 22, 2026
Building AI Agents with Multimodal Models | OpenCourseMap