• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
Log In
Join for Free
Coursera
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how to build and apply models that work with text, images, audio, and video together, and use them for real-world applications like chatbots, search, and creative tools.

Skip to search results

Filter by

Subject
Required
 *

Language
Required
 *

The language used throughout the course, in both instruction and assessments.

Learning Product
Required
 *

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn career credentials while taking courses that count towards your Master’s degree.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.
Earn a university-issued career credential in a flexible, interactive format.

Level
Required
 *

Duration
Required
 *

Skills
Required
 *

Subtitles
Required
 *

Educator
Required
 *

Find the Best Multimodal AI Course for Your Goals

  • G

    Google Cloud

    Gemini in Google Drive - Français

    Skills you'll gain: Google Gemini, Google Workspace, Gmail, Generative AI, File Management

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini in Google Drive - 日本語版

    Skills you'll gain: Google Gemini, Google Workspace, Prompt Engineering, Generative AI, Prompt Engineering Tools, File Management

    Beginner · Course · 1 - 4 Weeks

  • C

    Clemson University

    Master of Science in Computer Science

    Skills you'll gain:

    Earn a degree

    Degree · 1 - 4 Years

  • U

    University of London

    Bachelor of Science in Computer Science

    Skills you'll gain: Virtual Reality, Human Computer Interaction, Agile Software Development, Animations, Data Ethics, React Native, Event-Driven Programming, Game Design, Responsive Web Design, Web Applications, Natural Language Processing, Unsupervised Learning, Combinatorics, Node.JS, Network Security, Scikit Learn (Machine Learning Library), Secure Coding, Generative AI, Full-Stack Web Development, Usability Testing

    Earn a degree

    Degree · 1 - 4 Years

  • Status: New
    New
    G

    Google Cloud

    Google Threat Intelligence - 简体中文

    Skills you'll gain: Threat Detection, Cyber Threat Intelligence, Cyber Threat Hunting, Endpoint Detection and Response, Threat Modeling, Cybersecurity, Incident Response, Security Information and Event Management (SIEM), Continuous Monitoring, Vulnerability Management, Vulnerability Assessments, Google Gemini, Generative AI

    Intermediate · Course · 1 - 3 Months

  • G

    Google Cloud

    Intégrer des applications avec Gemini 1.0 Pro sur Google Cloud

    Skills you'll gain: Google Gemini, Generative AI, LLM Application, Google Cloud Platform, Application Development, Application Programming Interface (API), Development Testing

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Gemini for Cloud Architects - 日本語版

    Skills you'll gain: Google Gemini, Google Cloud Platform, Kubernetes, Prompt Engineering, Cloud Infrastructure, Infrastructure as Code (IaC), Application Deployment, Unix Commands, Cloud Computing Architecture

    Beginner · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    L

    LearnQuest

    Mastering Digital Prospecting Foundations

    Skills you'll gain: Sales Prospecting, Lead Generation, Cross-Channel Marketing, Sales Development, Global Marketing, Communication Strategies, Customer Engagement, LinkedIn, Sales Pipelines, Sales Strategy, AI Personalization, Personalized Service, Customer Relationship Management (CRM) Software, Automation

    Intermediate · Course · 1 - 4 Weeks

  • U

    University of Illinois Urbana-Champaign

    The Future of Healthcare and Emerging Trends

    Skills you'll gain: Social Determinants Of Health, Health Equity, Digital Transformation, Business Modeling, Consumer Behaviour

    Build toward a degree

    Beginner · Course · 1 - 4 Weeks

  • U

    University of Leeds

    Master of Science in Data Science (Statistics)

    Skills you'll gain: Data Ethics, Statistical Hypothesis Testing, Statistical Machine Learning, Regression Analysis, R (Software), Exploratory Data Analysis, Bayesian Statistics, Statistical Methods, Statistical Visualization, Classification And Regression Tree (CART), Network Analysis, Planning, Data Visualization, Data Manipulation, Data Analysis, Statistical Inference, Statistical Modeling, Linear Algebra, Artificial Intelligence and Machine Learning (AI/ML), Object Oriented Programming (OOP)

    Earn a degree

    Degree · 1 - 4 Years

  • U

    Universidad de los Andes

    Maestría en Inteligencia Artificial

    Skills you'll gain: Real-Time Operating Systems, Supervised Learning, Semantic Web, LLM Application, Unsupervised Learning, Computer Vision, Cloud-Native Computing, Reinforcement Learning, Dimensionality Reduction, Natural Language Processing, Continuous Deployment, Biomedical Engineering, Project Closure, Artificial Intelligence, Deep Learning, Game Theory, Machine Learning, Responsible AI, CI/CD, Probability & Statistics

    Earn a degree

    Degree · 1 - 4 Years

  • Status: New
    New
    G

    Google Cloud

    Google Threat Intelligence 한국어

    Skills you'll gain: Cyber Threat Intelligence, Threat Detection, Threat Management, Cyber Threat Hunting, MITRE ATT&CK Framework, Cybersecurity, Incident Response, Google Gemini, Cloud Security, Vulnerability Assessments, AI Product Strategy, Interactive Data Visualization

    Intermediate · Course · 1 - 3 Months

Searches related to multimodal ai

build multimodal generative ai applications
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
multimodal rag with gpt – build smarter search & ai systems
introduction to vertex ai embeddings: text and multimodal
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
build a diy multimodal question answering system with vertex ai
1…229230231232

In summary, here are 10 of our most popular multimodal ai courses

  • Gemini in Google Drive - Français: Google Cloud
  • Gemini in Google Drive - 日本語版: Google Cloud
  • Master of Science in Computer Science: Clemson University
  • Bachelor of Science in Computer Science: University of London
  • Google Threat Intelligence - 简体中文: Google Cloud
  • Intégrer des applications avec Gemini 1.0 Pro sur Google Cloud: Google Cloud
  • Gemini for Cloud Architects - 日本語版: Google Cloud
  • Mastering Digital Prospecting Foundations: LearnQuest
  • The Future of Healthcare and Emerging Trends: University of Illinois Urbana-Champaign
  • Master of Science in Data Science (Statistics): University of Leeds

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Manage Cookie Preferences
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2025 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok