• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
Log In
Join for Free
Coursera
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how to build and apply models that work with text, images, audio, and video together, and use them for real-world applications like chatbots, search, and creative tools.

Skip to search results

Filter by

Subject
Required
 *

Language
Required
 *

The language used throughout the course, in both instruction and assessments.

Learning Product
Required
 *

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn career credentials while taking courses that count towards your Master’s degree.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.
Earn a university-issued career credential in a flexible, interactive format.

Level
Required
 *

Duration
Required
 *

Skills
Required
 *

Subtitles
Required
 *

Educator
Required
 *

Find the Best Multimodal AI Course for Your Goals

  • Status: New
    New
    Status: Free Trial
    Free Trial
    S

    Skillshare

    AI Video Avatars: Launch Your Automated Social Media Machine

    Skills you'll gain: Animations, ChatGPT, Video Production, Generative AI, Education Software and Technology, Content Creation, AI Personalization, Prompt Engineering, Blogs, Scripting, Augmented and Virtual Reality (AR/VR), Media Production, Persona Development

    Mixed · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    C

    Coursera Instructor Network

    Next-Generation AI Assistant: Claude by Anthropic

    Skills you'll gain: Prompt Engineering, Anthropic Claude, Marketing Materials, Copywriting, ChatGPT, Drive Engagement, Generative AI, Product Promotion, Promotional Strategies, New Product Development

    4.1
    Rating, 4.1 out of 5 stars
    ·
    19 reviews

    Beginner · Course · 1 - 4 Weeks

  • Status: Free
    Free
    D

    DeepLearning.AI

    Introducing Multimodal Llama 3.2

    Skills you'll gain: Tool Calling, LLM Application, Multimodal Prompts, Prompt Patterns, Prompt Engineering, Large Language Modeling

    4.5
    Rating, 4.5 out of 5 stars
    ·
    11 reviews

    Beginner · Project · Less Than 2 Hours

  • Status: Free
    Free
    D

    DeepLearning.AI

    Practical Multi AI Agents and Advanced Use Cases with crewAI

    Skills you'll gain: LLM Application, Large Language Modeling, LangChain, Generative AI Agents, Artificial Intelligence and Machine Learning (AI/ML), Tool Calling, Agentic systems, Automation, Application Deployment, Test Execution Engine, Program Development, Test Case

    4.8
    Rating, 4.8 out of 5 stars
    ·
    18 reviews

    Beginner · Project · Less Than 2 Hours

  • Status: Free Trial
    Free Trial
    V

    Vanderbilt University

    Generative AI Cybersecurity & Privacy for Leaders: A Primer

    Skills you'll gain: Responsible AI, Generative AI, Data Ethics, Business Ethics, Information Privacy, Threat Modeling, Personally Identifiable Information, Threat Detection, Security Awareness, Security Controls, Cybersecurity, Law, Regulation, and Compliance, Prompt Engineering

    4.8
    Rating, 4.8 out of 5 stars
    ·
    63 reviews

    Beginner · Course · 1 - 3 Months

  • Status: New
    New
    Status: Preview
    Preview
    W

    Whizlabs

    Exam Prep AI-900: Microsoft Certified Azure AI Fundamentals

    Skills you'll gain: Responsible AI, Generative AI, Microsoft Azure, Image Analysis, Computer Vision, OpenAI, Artificial Intelligence, Natural Language Processing, Machine Learning, Deep Learning, Text Mining, Data Science

    Beginner · Course · 1 - 3 Months

  • Status: Free
    Free
    G

    Google Cloud

    Multimodality with Gemini

    Skills you'll gain: Google Gemini, Multimodal Prompts, Google Cloud Platform, Generative AI, Artificial Intelligence, LLM Application, Prompt Engineering

    Intermediate · Project · Less Than 2 Hours

  • Status: Free Trial
    Free Trial
    E

    Edureka

    Generative AI Applications and Popular Tools

    Skills you'll gain: Responsible AI, Artificial Intelligence, Content Creation, Computer Programming Tools, Python Programming

    4.8
    Rating, 4.8 out of 5 stars
    ·
    12 reviews

    Beginner · Course · 1 - 3 Months

  • Status: Free
    Free
    S

    Scrimba

    Build an AI Personal Assistant with a Vector Database

    Skills you'll gain: AI Personalization, Web Development, Web Applications, HTML and CSS, Prompt Engineering, Natural Language Processing, Javascript, Databases, Database Management, ChatGPT, LLM Application, Real Time Data, Generative AI Agents

    Intermediate · Guided Project · Less Than 2 Hours

  • G

    Google Cloud

    Introduction to Vertex AI Embeddings: Text and Multimodal

    Skills you'll gain: Google Cloud Platform, Artificial Intelligence, Image Analysis, LLM Application, Cloud API, Generative AI, Text Mining

    Beginner · Project · Less Than 2 Hours

  • G

    Google Cloud

    Use Vertex AI Studio for Healthcare

    Skills you'll gain: Prompt Engineering, Image Analysis, Google Gemini, Generative AI Agents, Generative AI, Google Cloud Platform, Health Informatics, Healthcare Industry Knowledge

    Beginner · Project · Less Than 2 Hours

  • G

    Google Cloud

    Multimodal Retrieval Augmented Generation (RAG) using the Vertex AI Gemini API

    Skills you'll gain: Google Gemini, Multimodal Prompts, Query Languages, Data Manipulation, Data Store, Metadata Management, Text Mining, Cloud API, Generative AI, Google Cloud Platform, Image Analysis, Cloud Computing, Artificial Intelligence

    4.1
    Rating, 4.1 out of 5 stars
    ·
    8 reviews

    Intermediate · Project · Less Than 2 Hours

Searches related to multimodal ai

build multimodal generative ai applications
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
introduction to vertex ai embeddings: text and multimodal
multimodal rag with gpt – build smarter search & ai systems
build a diy multimodal question answering system with vertex ai
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
1…789…230

In summary, here are 10 of our most popular multimodal ai courses

  • AI Video Avatars: Launch Your Automated Social Media Machine: Skillshare
  • Next-Generation AI Assistant: Claude by Anthropic: Coursera Instructor Network
  • Introducing Multimodal Llama 3.2: DeepLearning.AI
  • Practical Multi AI Agents and Advanced Use Cases with crewAI: DeepLearning.AI
  • Generative AI Cybersecurity & Privacy for Leaders: A Primer: Vanderbilt University
  • Exam Prep AI-900: Microsoft Certified Azure AI Fundamentals: Whizlabs
  • Multimodality with Gemini: Google Cloud
  • Generative AI Applications and Popular Tools: Edureka
  • Build an AI Personal Assistant with a Vector Database: Scrimba
  • Introduction to Vertex AI Embeddings: Text and Multimodal: Google Cloud

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Manage Cookie Preferences
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2025 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok