• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
  • Log In
  • Join for Free
    Coursera
    • Browse
    • Multimodal Ai

    Multimodal AI Courses

    Multimodal AI courses can help you learn how to build and apply models that work with text, images, audio, and video together, and use them for real-world applications like chatbots, search, and creative tools.

    Skip to search results

    Filter by

    Subject
    Required
     *

    Language
    Required
     *

    The language used throughout the course, in both instruction and assessments.

    Learning Product
    Required
     *

    Build job-relevant skills in under 2 hours with hands-on tutorials.
    Learn from top instructors with graded assignments, videos, and discussion forums.
    Learn a new tool or skill in an interactive, hands-on environment.
    Get in-depth knowledge of a subject by completing a series of courses and projects.
    Earn career credentials from industry leaders that demonstrate your expertise.
    Earn career credentials while taking courses that count towards your Master’s degree.
    Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.
    Earn a university-issued career credential in a flexible, interactive format.

    Level
    Required
     *

    Duration
    Required
     *

    Skills
    Required
     *

    Subtitles
    Required
     *

    Educator
    Required
     *

    Find the Best Multimodal AI Course for Your Goals

    • Status: New
      New
      Status: Free Trial
      Free Trial
      P

      Pearson

      Programming Generative AI

      Skills you'll gain: Generative AI, Large Language Modeling, PyTorch (Machine Learning Library), Generative Model Architectures, Multimodal Prompts, Image Analysis, Computer Vision, Artificial Neural Networks, Natural Language Processing, Deep Learning, Prompt Engineering, Image Quality, Text Mining, Data Manipulation, Unsupervised Learning, Performance Tuning

      Intermediate · Specialization · 1 - 4 Weeks

    • Status: Free
      Free
      D

      DeepLearning.AI

      Building Multimodal Search and RAG

      Skills you'll gain: Multimodal Prompts, LLM Application, Large Language Modeling, Generative AI, Image Analysis, Applied Machine Learning, Unsupervised Learning, Unstructured Data

      4.5
      Rating, 4.5 out of 5 stars
      ·
      34 reviews

      Intermediate · Project · Less Than 2 Hours

    • Status: New
      New
      Status: Free Trial
      Free Trial
      U

      University of Colorado Boulder

      Computer Vision

      Skills you'll gain: Image Analysis, Computer Vision, Computer Graphics, Unsupervised Learning, Multimodal Prompts, Artificial Intelligence and Machine Learning (AI/ML), Deep Learning, Visualization (Computer Graphics), Artificial Intelligence, Data Ethics, Computational Thinking, Generative AI, Applied Machine Learning, Generative Model Architectures, Linear Algebra, Data Processing, Data Transformation, Probability Distribution

      Build toward a degree

      4.3
      Rating, 4.3 out of 5 stars
      ·
      12 reviews

      Intermediate · Specialization · 1 - 3 Months

    • Status: New
      New
      Status: Free Trial
      Free Trial
      S

      Starweaver

      Executive AI Leadership Mastery

      Skills you'll gain: Responsible AI, Technology Roadmaps, Organizational Change, Stakeholder Engagement, Google Gemini, Anthropic Claude, Business Strategy, Strategic Leadership, Business Leadership, ChatGPT, Leadership, Business Transformation, Change Management, Content Strategy, Corporate Communications, Digital Media Strategy, Non-Verbal Communication, Verbal Communication Skills, Communication Strategies, Communication

      4
      Rating, 4 out of 5 stars
      ·
      8 reviews

      Intermediate · Specialization · 1 - 4 Weeks

    • Status: New
      New
      Status: Free Trial
      Free Trial
      I

      IBM

      Building AI Agents and Agentic Workflows

      Skills you'll gain: LangChain, Tool Calling, LangGraph, LLM Application, Agentic systems, Generative AI Agents, Responsible AI, Artificial Intelligence and Machine Learning (AI/ML), Generative AI, Application Design, Prompt Engineering, Large Language Modeling, Collaborative Software, Software Design Patterns, System Design and Implementation, Software Development, Python Programming, Application Development, Real Time Data, Data Science

      4.8
      Rating, 4.8 out of 5 stars
      ·
      69 reviews

      Intermediate · Specialization · 1 - 3 Months

    • Status: New
      New
      Status: Free Trial
      Free Trial
      I

      IBM

      IBM RAG and Agentic AI

      Skills you'll gain: Prompt Engineering, LangChain, Tool Calling, LangGraph, Agentic systems, Multimodal Prompts, Generative AI, LLM Application, Generative AI Agents, Responsible AI, OpenAI, Artificial Intelligence and Machine Learning (AI/ML), Application Design, Application Development, Large Language Modeling, UI Components, Semantic Web, Data Storage Technologies, Databases, Software Development

      4.6
      Rating, 4.6 out of 5 stars
      ·
      309 reviews

      Advanced · Professional Certificate · 3 - 6 Months

    What brings you to Coursera today?

    • Status: New
      New
      Status: Free Trial
      Free Trial
      M

      Microsoft

      AI-powered Customer Intelligence with Microsoft Copilot

      Skills you'll gain: Microsoft Copilot, Prompt Engineering, Customer Insights, Sales Strategy, Customer Analysis, Competitive Analysis, Sales Pipelines, Microsoft 365, Persona Development, Data Cleansing, Sales Management, Data Quality, Sales, Anomaly Detection, Customer Relationship Management (CRM) Software, Data Ethics, Generative AI, Marketing Design, Marketing Automation, Customer experience improvement

      4.6
      Rating, 4.6 out of 5 stars
      ·
      61 reviews

      Beginner · Specialization · 3 - 6 Months

    • Status: Free Trial
      Free Trial
      V

      Vanderbilt University

      Agentic AI and AI Agents for Leaders

      Skills you'll gain: Prompt Engineering, ChatGPT, Generative AI Agents, Prompt Patterns, Generative AI, Workflow Management, Agentic systems, LLM Application, Productivity, OpenAI, Artificial Intelligence, AI Personalization, Business Process Automation, AI Product Strategy, Personalized Service, Large Language Modeling, Automation, Responsible AI, Artificial Intelligence and Machine Learning (AI/ML), Expense Management

      4.8
      Rating, 4.8 out of 5 stars
      ·
      7.9K reviews

      Beginner · Specialization · 1 - 3 Months

    • Status: New
      New
      Status: Free Trial
      Free Trial
      E

      Edureka

      Generative AI for Automation

      Skills you'll gain: Prompt Patterns, Generative AI Agents, Business Process Automation, Make.com, Large Language Modeling, Automation, ChatGPT, Microsoft Power Automate/Flow, LLM Application, LangChain, Responsible AI, Workflow Management, OpenAI, Tool Calling, No-Code Development, Multimodal Prompts, Slack (Software), Process Optimization, Application Programming Interface (API), Decision Support Systems

      4.7
      Rating, 4.7 out of 5 stars
      ·
      6 reviews

      Beginner · Specialization · 1 - 3 Months

    • Status: New
      New
      Status: Free Trial
      Free Trial
      P

      Pearson

      Programming Generative AI: Unit 3

      Skills you'll gain: Multimodal Prompts, Generative AI, Generative Model Architectures, Image Analysis, Prompt Engineering, Image Quality, Computer Vision, Deep Learning, Natural Language Processing, Performance Tuning

      Intermediate · Course · 1 - 4 Weeks

    • Status: New
      New
      Status: Free Trial
      Free Trial
      U

      University of Colorado Boulder

      Modern AI Models for Vision and Multimodal Understanding

      Skills you'll gain: Multimodal Prompts, Artificial Intelligence and Machine Learning (AI/ML), Linear Algebra

      Build toward a degree

      Advanced · Course · 1 - 4 Weeks

    • Status: Free Trial
      Free Trial
      M

      Microsoft

      Generative AI for Sales Professionals

      Skills you'll gain: Microsoft Copilot, Forecasting, Sales Strategy, Sales Presentation, Customer Analysis, Sales Pipelines, Sales Enablement, Data Cleansing, Sales Management, Time Series Analysis and Forecasting, Responsible AI, Sales, Customer Relationship Management (CRM) Software, Taking Meeting Minutes, Microsoft Teams, Email Automation, Customer Insights, Data Quality, Meeting Facilitation, Customer Data Management

      4.5
      Rating, 4.5 out of 5 stars
      ·
      15 reviews

      Beginner · Specialization · 1 - 3 Months

    Searches related to multimodal ai

    build multimodal generative ai applications
    multimodal generative ai: vision, speech, and assistants
    modern ai models for vision and multimodal understanding
    introduction to vertex ai embeddings: text and multimodal
    multimodal rag with gpt – build smarter search & ai systems
    multimodal retrieval augmented generation (rag) using the vertex ai gemini api
    build a diy multimodal question answering system with vertex ai
    1234…225

    In summary, here are 10 of our most popular multimodal ai courses

    • Programming Generative AI: Pearson
    • Building Multimodal Search and RAG: DeepLearning.AI
    • Computer Vision: University of Colorado Boulder
    • Executive AI Leadership Mastery: Starweaver
    • Building AI Agents and Agentic Workflows: IBM
    • IBM RAG and Agentic AI: IBM
    • AI-powered Customer Intelligence with Microsoft Copilot: Microsoft
    • Agentic AI and AI Agents for Leaders: Vanderbilt University
    • Generative AI for Automation: Edureka
    • Programming Generative AI: Unit 3: Pearson

    Frequently Asked Questions about Multimodal Ai

    Browse the Multimodal AI courses below—popular starting points on Coursera.

    • Building Multimodal Search and RAG: DeepLearning.AI
    • Modern AI Models for Vision and Multimodal Understanding: University of Colorado Boulder
    • Build Multimodal Generative AI Applications: IBM ‎

    Yes, you can start learning Multimodal AI on Coursera for free by accessing the first module of many courses at no cost. This includes video lessons, readings, and even graded assignments—plus Coursera Coach support when available. If you want to keep learning, earn a certificate, or unlock the full course, you can upgrade or apply for financial aid.‎

    The specific skills and knowledge you will gain depend on the course you enroll in, but some common skills include multimodal model design, combining text, images, audio, and video, building multimodal applications, and applying them to chatbots, search, and creative tools.‎

    This FAQ content has been made available for informational purposes only. Learners are advised to conduct additional research to ensure that courses and other credentials pursued meet their personal, professional, and financial goals.

    Other topics to explore

    Arts and Humanities
    338 courses
    Business
    1095 courses
    Computer Science
    668 courses
    Data Science
    425 courses
    Information Technology
    145 courses
    Health
    471 courses
    Math and Logic
    70 courses
    Personal Development
    137 courses
    Physical Science and Engineering
    413 courses
    Social Sciences
    401 courses
    Language Learning
    150 courses

    Coursera Footer

    Skills

    • Artificial Intelligence (AI)
    • Cybersecurity
    • Data Analytics
    • Digital Marketing
    • English Speaking
    • Generative AI (GenAI)
    • Microsoft Excel
    • Microsoft Power BI
    • Project Management
    • Python

    Certificates & Programs

    • Google Cybersecurity Certificate
    • Google Data Analytics Certificate
    • Google IT Support Certificate
    • Google Project Management Certificate
    • Google UX Design Certificate
    • IBM Data Analyst Certificate
    • IBM Data Science Certificate
    • Machine Learning Certificate
    • Microsoft Power BI Data Analyst Certificate
    • UI / UX Design Certificate

    Industries & Careers

    • Business
    • Computer Science
    • Data Science
    • Education & Teaching
    • Engineering
    • Finance
    • Healthcare
    • Human Resources (HR)
    • Information Technology (IT)
    • Marketing

    Career Resources

    • Career Aptitude Test
    • Examples of Strengths and Weaknesses for Job Interviews
    • High-Income Skills to Learn
    • How Does Cryptocurrency Work?
    • How to Highlight Duplicates in Google Sheets
    • How to Learn Artificial Intelligence
    • Popular Cybersecurity Certifications
    • Preparing for the PMP Certification
    • Signs You Will Get the Job After an Interview
    • What Is Artificial Intelligence?

    Coursera

    • About
    • What We Offer
    • Leadership
    • Careers
    • Catalog
    • Coursera Plus
    • Professional Certificates
    • MasterTrack® Certificates
    • Degrees
    • For Enterprise
    • For Government
    • For Campus
    • Become a Partner
    • Social Impact
    • Free Courses
    • Share your Coursera learning story

    Community

    • Learners
    • Partners
    • Beta Testers
    • Blog
    • The Coursera Podcast
    • Tech Blog

    More

    • Press
    • Investors
    • Terms
    • Privacy
    • Help
    • Accessibility
    • Contact
    • Articles
    • Directory
    • Affiliates
    • Modern Slavery Statement
    • Manage Cookie Preferences
    Learn Anywhere
    Download on the App Store
    Get it on Google Play
    Logo of Certified B Corporation
    © 2025 Coursera Inc. All rights reserved.
    • Coursera Facebook
    • Coursera Linkedin
    • Coursera Twitter
    • Coursera YouTube
    • Coursera Instagram
    • Coursera TikTok