• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
Log In
Join for Free
Coursera
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how to build and apply models that work with text, images, audio, and video together, and use them for real-world applications like chatbots, search, and creative tools.

Skip to search results

Filter by

Subject
Required
 *

Language
Required
 *

The language used throughout the course, in both instruction and assessments.

Learning Product
Required
 *

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn career credentials while taking courses that count towards your Master’s degree.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.
Earn a university-issued career credential in a flexible, interactive format.

Level
Required
 *

Duration
Required
 *

Skills
Required
 *

Subtitles
Required
 *

Educator
Required
 *

Find the Best Multimodal AI Course for Your Goals

  • G

    Google Cloud

    Configuring Vector Search in AlloyDB

    Skills you'll gain: Google Cloud Platform, Database Administration, Query Languages, Database Management Systems, Generative AI, Data Store, Machine Learning

    Beginner · Project · Less Than 2 Hours

  • Status: New
    New
    Status: Preview
    Preview
    C

    Coursera Instructor Network

    Master Compliance with MS 365 Copilot: EU Law

    Skills you'll gain: Compliance Management, Microsoft Copilot, Regulation and Legal Compliance, Compliance Reporting, Legal Technology, General Data Protection Regulation (GDPR), Microsoft 365, Audit Working Papers, Process Design, Workflow Management, Risk Analysis, Prompt Engineering

    Advanced · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Preview
    Preview
    C

    Coursera

    Cleaning, Organizing, and Speeding Up SQL

    Skills you'll gain: Data Cleansing, Database Design, Data Integration, Data Import/Export, SQL, Data Maintenance, Data Manipulation, Database Management, Relational Databases, Data Quality, Stored Procedure, Data Integrity, Performance Tuning

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Preview
    Preview
    W

    Whizlabs

    Exam Prep: Google Cloud Certified Cloud Digital Leader

    Skills you'll gain: Data Storage Technologies, Containerization, Cloud Infrastructure, Network Architecture

    Beginner · Course · 1 - 3 Months

  • G

    Google Cloud

    Introduction to Gemini for Google Workspace - Português

    Skills you'll gain: Google Gemini, Responsible AI, Productivity Software, Google Workspace, Generative AI, Data Ethics, Artificial Intelligence, Business Software, Content Creation

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Introduction to Gemini for Google Workspace - Indonesian

    Skills you'll gain: Google Gemini, Responsible AI, Google Workspace, Productivity Software, Generative AI, Google Docs, Artificial Intelligence, Gmail

    Beginner · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    G

    Google Cloud

    Create Image Captioning Models - 繁體中文

    Skills you'll gain: Image Analysis, Generative AI, Deep Learning, Generative Model Architectures, Large Language Modeling, Applied Machine Learning, Computer Vision

    Advanced · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Preview
    Preview
    N

    Northeastern University

    Statistical Learning for Engineering Part 1

    Skills you'll gain: Unsupervised Learning, Supervised Learning, Regression Analysis, Applied Machine Learning, Statistical Modeling, Machine Learning Algorithms, PyTorch (Machine Learning Library), Statistical Methods, Statistical Machine Learning, Machine Learning, Predictive Analytics, Predictive Modeling, Artificial Intelligence and Machine Learning (AI/ML), Deep Learning, Unstructured Data, Probability & Statistics, Dimensionality Reduction, Algorithms

    Intermediate · Course · 1 - 3 Months

  • Status: New
    New
    G

    Google Cloud

    Websitemodernisierung mit generativer KI in Google Cloud

    Skills you'll gain: Generative AI, Web Content, AI Personalization, Web Analytics and SEO, Prompt Engineering, Google Cloud Platform, Large Language Modeling

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    G

    Google Cloud

    Agentspace로 더 신속하게 지식 교환하기

    Skills you'll gain: Google Workspace, OAuth, Productivity Software, Email Automation, Intranet, Generative AI Agents, Collaborative Software, LLM Application, Authentications, Enterprise Application Management, Information Architecture, Application Programming Interface (API), Identity and Access Management, Data Access, Data Store, Web Applications

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    G

    Google Cloud

    Acelera el intercambio de conocimientos con Agentspace

    Skills you'll gain: Productivity Software, Collaborative Software, OAuth, Google Cloud Platform, Generative AI Agents, AI Product Strategy, Enterprise Application Management, Data Integration, Application Programming Interface (API), Information Architecture, Agentic systems, AWS Identity and Access Management (IAM), Data Store

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Meet - Português Brasileiro

    Skills you'll gain: Google Gemini, Google Workspace, Generative AI, AI Personalization, Image Quality, Language Interpretation, Translation, and Studies, Real Time Data

    Beginner · Course · 1 - 4 Weeks

Searches related to multimodal ai

build multimodal generative ai applications
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
introduction to vertex ai embeddings: text and multimodal
multimodal rag with gpt – build smarter search & ai systems
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
build a diy multimodal question answering system with vertex ai
1…213214215…231

In summary, here are 10 of our most popular multimodal ai courses

  • Configuring Vector Search in AlloyDB: Google Cloud
  • Master Compliance with MS 365 Copilot: EU Law: Coursera Instructor Network
  • Cleaning, Organizing, and Speeding Up SQL: Coursera
  • Exam Prep: Google Cloud Certified Cloud Digital Leader: Whizlabs
  • Introduction to Gemini for Google Workspace - Português: Google Cloud
  • Introduction to Gemini for Google Workspace - Indonesian: Google Cloud
  • Create Image Captioning Models - 繁體中文: Google Cloud
  • Statistical Learning for Engineering Part 1: Northeastern University
  • Websitemodernisierung mit generativer KI in Google Cloud: Google Cloud
  • Agentspace로 더 신속하게 지식 교환하기: Google Cloud

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Manage Cookie Preferences
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2025 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok