• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
  • Log In
  • Join for Free
    Coursera
    • Browse
    • Multimodal Ai

    Multimodal AI Courses

    Multimodal AI courses can help you learn how to build and apply models that work with text, images, audio, and video together, and use them for real-world applications like chatbots, search, and creative tools.

    Skip to search results

    Filter by

    Subject
    Required
     *

    Language
    Required
     *

    The language used throughout the course, in both instruction and assessments.

    Learning Product
    Required
     *

    Build job-relevant skills in under 2 hours with hands-on tutorials.
    Learn from top instructors with graded assignments, videos, and discussion forums.
    Learn a new tool or skill in an interactive, hands-on environment.
    Get in-depth knowledge of a subject by completing a series of courses and projects.
    Earn career credentials from industry leaders that demonstrate your expertise.
    Earn career credentials while taking courses that count towards your Master’s degree.
    Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.
    Earn a university-issued career credential in a flexible, interactive format.

    Level
    Required
     *

    Duration
    Required
     *

    Skills
    Required
     *

    Subtitles
    Required
     *

    Educator
    Required
     *

    Find the Best Multimodal AI Course for Your Goals

    • Status: Free Trial
      Free Trial
      G

      Google Cloud

      Görüntülere Altyazı Ekleme Modelleri Oluşturma

      Skills you'll gain: Generative AI, Image Analysis, Generative Model Architectures, Deep Learning, Applied Machine Learning, Computer Vision

      Advanced · Course · 1 - 4 Weeks

    • Status: New
      New
      G

      Google Cloud

      Agentspace로 더 신속하게 지식 교환하기

      Skills you'll gain: Data Store, Productivity Software, Data Access, Data Sharing, Data Storage, OAuth, Google Cloud Platform, Enterprise Application Management, Application Deployment, Information Architecture, Application Programming Interface (API), AWS Identity and Access Management (IAM)

      Beginner · Course · 1 - 4 Weeks

    • G

      Google Cloud

      Gemini for DevOps Engineers - 简体中文

      Skills you'll gain: Google Gemini, Kubernetes, Google Cloud Platform, DevOps, Infrastructure as Code (IaC), Development Environment, CI/CD, Cloud Management

      Beginner · Course · 1 - 4 Weeks

    • G

      Google Cloud

      Gemini for Network Engineers - Français

      Skills you'll gain: Google Gemini, Google Cloud Platform, Virtual Private Networks (VPN), Network Administration, Network Planning And Design, Cloud Management, Network Architecture, System Implementation

      Beginner · Course · 1 - 4 Weeks

    • G

      Google Cloud

      Gemini for DevOps Engineers - Português Brasileiro

      Skills you'll gain: Google Gemini, Kubernetes, Google Cloud Platform, DevOps, Infrastructure as Code (IaC), Cloud Management, Development Environment, Artificial Intelligence

      Beginner · Course · 1 - 4 Weeks

    • U

      Universidad de los Andes

      Inteligencia Artificial: Machine learning, ética y nuevas tendencias Certificado MasterTrack®

      Skills you'll gain: Supervised Learning, Unsupervised Learning, Dimensionality Reduction, Anomaly Detection, Artificial Intelligence, Machine Learning, Regression Analysis, Probability & Statistics, Data Ethics, Image Analysis, Natural Language Processing, Linear Algebra, Responsible AI, Computer Vision, Embedded Systems, Machine Learning Algorithms, Applied Machine Learning, Statistical Machine Learning, Scikit Learn (Machine Learning Library), Probability

      Credit offered

      Mastertrack · 6 - 12 Months

    • G

      Google Cloud

      Gemini in Google Docs - 日本語版

      Skills you'll gain: Google Gemini, Generative AI, Prompt Engineering, Google Workspace, Technical Writing, Document Management, Grammar

      Beginner · Course · 1 - 4 Weeks

    • Status: New
      New
      G

      Google Cloud

      Acelere a troca de conhecimento com o Agentspace

      Skills you'll gain: OAuth, Generative AI Agents, LLM Application, Data Integration, Application Deployment, Data Access, Data Sharing, Enterprise Application Management, Application Programming Interface (API), Data Store, AWS Identity and Access Management (IAM), System Configuration

      Beginner · Course · 1 - 4 Weeks

    • O

      O.P. Jindal Global University

      MBA Business Analytics

      Skills you'll gain: Data Storytelling, Operations Management, Design Thinking, Active Listening, Business Ethics, Data Visualization, Sampling (Statistics), Working Capital, Database Management, Environmental Social And Corporate Governance (ESG), Project Estimation, Marketing Management, Business Analytics, Financial Statement Analysis, Financial Accounting, Predictive Analytics, Marketing Planning, Human Resources Management and Planning, Big Data, Regression Analysis

      Earn a degree

      Degree · 1 - 4 Years

    • G

      Google Cloud

      Gemini for end-to-end SDLC - Español

      Skills you'll gain: Google Gemini, Software Development Life Cycle, Google Cloud Platform, Development Testing, Web Applications, Application Lifecycle Management, Application Development, Software Development Tools, Debugging, Query Languages

      Beginner · Course · 1 - 4 Weeks

    • G

      Google Cloud

      Gemini in Google Meet - 한국어

      Skills you'll gain: Google Gemini, Google Workspace, Generative AI, Collaborative Software, Image Quality

      Beginner · Course · 1 - 4 Weeks

    • G

      Google Cloud

      Mengintegrasikan Aplikasi dengan Gemini 1.0 Pro di GC

      Skills you'll gain: Google Gemini, Generative AI, Google Cloud Platform, LLM Application, Application Development, Application Programming Interface (API), Test Case

      Beginner · Course · 1 - 4 Weeks

    Searches related to multimodal ai

    build multimodal generative ai applications
    multimodal generative ai: vision, speech, and assistants
    modern ai models for vision and multimodal understanding
    introduction to vertex ai embeddings: text and multimodal
    multimodal rag with gpt – build smarter search & ai systems
    multimodal retrieval augmented generation (rag) using the vertex ai gemini api
    build a diy multimodal question answering system with vertex ai
    1…224225226227

    In summary, here are 10 of our most popular multimodal ai courses

    • Görüntülere Altyazı Ekleme Modelleri Oluşturma: Google Cloud
    • Agentspace로 더 신속하게 지식 교환하기: Google Cloud
    • Gemini for DevOps Engineers - 简体中文: Google Cloud
    • Gemini for Network Engineers - Français: Google Cloud
    • Gemini for DevOps Engineers - Português Brasileiro : Google Cloud
    • Inteligencia Artificial: Machine learning, ética y nuevas tendencias Certificado MasterTrack®: Universidad de los Andes
    • Gemini in Google Docs - 日本語版: Google Cloud
    • Acelere a troca de conhecimento com o Agentspace: Google Cloud
    • MBA Business Analytics: O.P. Jindal Global University
    • Gemini for end-to-end SDLC - Español: Google Cloud

    Other topics to explore

    Arts and Humanities
    338 courses
    Business
    1095 courses
    Computer Science
    668 courses
    Data Science
    425 courses
    Information Technology
    145 courses
    Health
    471 courses
    Math and Logic
    70 courses
    Personal Development
    137 courses
    Physical Science and Engineering
    413 courses
    Social Sciences
    401 courses
    Language Learning
    150 courses

    Coursera Footer

    Skills

    • Artificial Intelligence (AI)
    • Cybersecurity
    • Data Analytics
    • Digital Marketing
    • English Speaking
    • Generative AI (GenAI)
    • Microsoft Excel
    • Microsoft Power BI
    • Project Management
    • Python

    Certificates & Programs

    • Google Cybersecurity Certificate
    • Google Data Analytics Certificate
    • Google IT Support Certificate
    • Google Project Management Certificate
    • Google UX Design Certificate
    • IBM Data Analyst Certificate
    • IBM Data Science Certificate
    • Machine Learning Certificate
    • Microsoft Power BI Data Analyst Certificate
    • UI / UX Design Certificate

    Industries & Careers

    • Business
    • Computer Science
    • Data Science
    • Education & Teaching
    • Engineering
    • Finance
    • Healthcare
    • Human Resources (HR)
    • Information Technology (IT)
    • Marketing

    Career Resources

    • Career Aptitude Test
    • Examples of Strengths and Weaknesses for Job Interviews
    • High-Income Skills to Learn
    • How Does Cryptocurrency Work?
    • How to Highlight Duplicates in Google Sheets
    • How to Learn Artificial Intelligence
    • Popular Cybersecurity Certifications
    • Preparing for the PMP Certification
    • Signs You Will Get the Job After an Interview
    • What Is Artificial Intelligence?

    Coursera

    • About
    • What We Offer
    • Leadership
    • Careers
    • Catalog
    • Coursera Plus
    • Professional Certificates
    • MasterTrack® Certificates
    • Degrees
    • For Enterprise
    • For Government
    • For Campus
    • Become a Partner
    • Social Impact
    • Free Courses
    • Share your Coursera learning story

    Community

    • Learners
    • Partners
    • Beta Testers
    • Blog
    • The Coursera Podcast
    • Tech Blog

    More

    • Press
    • Investors
    • Terms
    • Privacy
    • Help
    • Accessibility
    • Contact
    • Articles
    • Directory
    • Affiliates
    • Modern Slavery Statement
    • Manage Cookie Preferences
    Learn Anywhere
    Download on the App Store
    Get it on Google Play
    Logo of Certified B Corporation
    © 2025 Coursera Inc. All rights reserved.
    • Coursera Facebook
    • Coursera Linkedin
    • Coursera Twitter
    • Coursera YouTube
    • Coursera Instagram
    • Coursera TikTok