Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Data Warehousing and Integration Part 2

5 days left: Discover new skills with 30% off courses from industry experts. Save now.

Diese kurs ist nicht verfügbar in Deutsch (Deutschland)

Wir übersetzen es in weitere Sprachen.

Data Warehousing and Integration Part 2

Dozent: Venkat Krishnamurthy

Bei Coursera Plus enthalten

Mehr erfahren

6 Module

Verschaffen Sie sich einen Einblick in ein Thema und lernen Sie die Grundlagen.

1 Woche zu vervollständigen

unter 10 Stunden pro Woche

Flexibler Zeitplan

In Ihrem eigenen Lerntempo lernen

6 Module

Verschaffen Sie sich einen Einblick in ein Thema und lernen Sie die Grundlagen.

1 Woche zu vervollständigen

unter 10 Stunden pro Woche

Flexibler Zeitplan

In Ihrem eigenen Lerntempo lernen

Kompetenzen, die Sie erwerben

Kategorie: CI/CD
Kategorie: Scalability
Kategorie: Data Warehousing
Kategorie: Amazon Redshift
Kategorie: Data Governance
Kategorie: Data Quality
Kategorie: Cloud Computing
Kategorie: Data Pipelines
Kategorie: Data Architecture
Kategorie: Extract, Transform, Load
Kategorie: Infrastructure as Code (IaC)
Kategorie: Analytics
Kategorie: Database Architecture and Administration
Kategorie: Amazon S3
Kategorie: Cloud Computing Architecture
Kategorie: Data Transformation
Kategorie: DevOps
Kategorie: Data Integration

Wichtige Details

Zertifikat zur Vorlage

Zu Ihrem LinkedIn-Profil hinzufügen

Kürzlich aktualisiert!

August 2025

Bewertungen

9 Aufgaben

Unterrichtet in Englisch

Erfahren Sie, wie Mitarbeiter führender Unternehmen gefragte Kompetenzen erwerben.

Weitere Informationen zu Coursera für Unternehmen

Logos von Petrobras, TATA, Danone, Capgemini, P&G und L'Oreal

In diesem Kurs gibt es 6 Module

Covers various topics in Data Engineering in support of decision support systems, data analytics, data mining, machine learning, and artificial intelligence. Studies on-premises data warehouse architecture, dimensional modeling of data warehouses, Extract-Transform-Load (ETL) integration from source systems to data warehouse, On-line Analytical Processing (OLAP) systems, and the evolving world of data quality and data governance. Offers students an opportunity to design, develop and maintain cloud-based data pipelines. Both on-premises and cloud-based platforms will be used to illustrate and implement Data Engineering techniques using operational and analytical data warehouses.

In this module, you'll learn about ETL (Extract, Transform, Load) processes, an essential part of Data Warehousing and Data Integration solutions. ETL processes can be complex and costly, but effective design and modeling can significantly reduce development and maintenance costs. You'll be introduced to the basics of Business Process Modeling Notation (BPMN), which is crucial for modeling business processes. We’ll focus on the basics of BPMN, including key components such as flow objects, gateways, events, and artifacts, which are essential for modeling business processes. You will explore how BPMN can be customized to conceptual modeling of ETL tasks, with a particular focus on differentiating control tasks from data tasks. Control tasks manage the orchestration of ETL processes, while data tasks handle data manipulation, both of which are critical in conceptualizing ETL workflows. By the end of this module, you’ll gain a solid understanding of how to design ETL processes using BPMN, enabling greater flexibility and adaptability across various tools.

Das ist alles enthalten

2 Videos8 Lektüren2 Aufgaben

2 VideosInsgesamt 3 Minuten

Course Overview1 Minute
Meet Your Instructor: Venkat Krishnamurthy2 Minuten

8 LektürenInsgesamt 87 Minuten

Course Introduction2 Minuten
Syllabus - Data Warehousing & Integration Part 210 Minuten
Academic Integrity1 Minute
Module 1: ETL Design 15 Minuten
BPMN Notation21 Minuten
Conceptual ETL Design Using BPMN28 Minuten
Differentiating between Control Tasks and Data Tasks10 Minuten
Types of Data Tasks10 Minuten

2 AufgabenInsgesamt 24 Minuten

Assess Your Learning: BPMN Notation12 Minuten
Assess Your Learning: Conceptual ETL modeling using BPMN12 Minuten

In this module you will dive into Talend Studio, a powerful Eclipse-based data integration platform that transforms complex ETL operations into intuitive visual workflows. By explorating Talend's drag-and-drop interface, you will learn to navigate the core components of the platform. You’ll master fundamental ETL operations by studying essential components like tMap for complex data transformations and joins, tJoin for straightforward data linking, and various input/output components for connecting to databases, files, and APIs. By the end of the module you will understand how Talend automatically generates executable Java code from visual designs, enabling you to create scalable, production-ready data integration solutions that can handle both batch processing and real-time data scenarios across diverse technological environments.

Das ist alles enthalten

3 Lektüren1 Aufgabe

In this module, we transition from on-premises Data Warehousing to Data Engineering. While Data Engineering has its roots in Data Warehousing, it encompasses much more. We’ll explore the key enablers of this evolution, specifically cloud computing and DevOps. You will learn about the benefits of cloud development, including enhanced scalability, cost efficiency, and flexibility in data operations. We will also dive into how traditional IT infrastructure components—such as security, networking, and compute resources—are redefined in cloud environments using AWS. Additionally, you'll gain an understanding of DevOps in the cloud, focusing on the use of virtual machines and containers to streamline continuous integration and deployment. We will cover key DevOps practices like Infrastructure as Code (IaC), CI/CD pipelines, and automated testing, emphasizing their role in ensuring consistency, faster development cycles, and secure applications. You will then explore what data engineering entails and the skills required to become a data engineer. Finally, we’ll introduce the concept of the data engineering lifecycle and its different phases, focusing on the first two: Data Generation and Storage.

Das ist alles enthalten

1 Video12 Lektüren2 Aufgaben

1 VideoInsgesamt 2 Minuten

Introduction to Data Engineering2 Minuten

12 LektürenInsgesamt 141 Minuten

Module 3 Overview5 Minuten
Cloud Computing10 Minuten
Benefits & Best Practices of Cloud Development10 Minuten
Similarities between Traditional IT and AWS10 Minuten
DevOps in Cloud10 Minuten
Virtual Machines vs Containers10 Minuten
Software Development Lifecycle and CI/CD10 Minuten
Data Warehousing to Data Engineering2 Minuten
Introduction to Data Engineering11 Minuten
Storage and Generation53 Minuten
Generation: Key Considerations5 Minuten
Storage: Key Considerations5 Minuten

2 AufgabenInsgesamt 30 Minuten

Assess Your Learning: Cloud Computing and DevOps15 Minuten
Assess Your Learning: Storage and Generation15 Minuten

In this module, we will explore the next two phases of the data engineering lifecycle: Ingestion and Transformation. Data ingestion refers to the process of moving data from source systems into storage, making it available for processing and analysis. As you delve into the reading, you will examine key ingestion patterns, including batch versus streaming ingestion, synchronous versus asynchronous methods, and push, pull, and hybrid approaches. You’ll also explore essential engineering considerations such as scalability, reliability, and data quality management, along with the challenges posed by schema changes. The reading will introduce various technologies that enable data ingestion, such as JDBC/ODBC, Change Data Capture (CDC), APIs, and event-streaming platforms like Kafka. We then shift focus to the transformation phase of the lifecycle, exploring different types of transformations that integrate complex business logic into data pipelines. At the end of the module, we will focus on data architecture and implementing good architecture principles to build scalable and reliable data pipelines.

Das ist alles enthalten

4 Videos12 Lektüren2 Aufgaben2 App-Elemente

4 VideosInsgesamt 8 Minuten

Combining Batch & Stream Processing3 Minuten
Hybrid Approach: Combining Push & Pull Methods2 Minuten
Introduction to Transformation0 Minuten
Transformation Conclusion1 Minute

12 LektürenInsgesamt 98 Minuten

Module 4 Overview5 Minuten
Ingestion18 Minuten
Batching versus Streaming2 Minuten
Batching in Data Pipelines5 Minuten
Streaming in Data Pipelines5 Minuten
Push and Pull: Introduction2 Minuten
Ingestion: Key Considerations5 Minuten
Queries, Modeling and Transformations33 Minuten
Transformation: Key Considerations2 Minuten
Data Engineering Lifecycle - Undercurrents11 Minuten
Good Data Architecture Principles5 Minuten
Data Architecture Examples5 Minuten

2 AufgabenInsgesamt 30 Minuten

Assess Your Learning: Ingestion15 Minuten
Assess Your Learning: Queries, Modeling and Transformation15 Minuten

2 App-ElementeInsgesamt 35 Minuten

Push & Pull Method in Data Pipelines15 Minuten
Types of Data Transformations20 Minuten

In this module, we will explore data characteristics and how they drive infrastructure decisions. In today’s data-driven world, understanding the properties of your data is essential for designing robust data pipelines. We’ll go over key characteristics like volume, which refers to the size of datasets, and velocity, which concerns how frequently new data is generated. We’ll also take a look at variety, which focuses on data formats and sources, and veracity, which emphasizes data accuracy and trustworthiness. The ultimate goal is to uncover value from data through insightful analysis. As we delve into pipeline design, you'll learn how these characteristics influence key decisions, such as the choice of storage, processing, and analytics tools. We will also cover essential AWS services like Amazon S3, Glue, and Athena, exploring how they support scalable and flexible data engineering. By the end of this module, you’ll have a comprehensive understanding of how to build effective data solutions to meet both technical and business needs.

Das ist alles enthalten

6 Lektüren1 Aufgabe

Welcome to the final stage of the data engineering lifecycle: serving data. In this module, we will focus on how to effectively serve data for analytics, machine learning (ML), and reverse ETL to ensure that the data products you design are reliable, actionable, and trusted by stakeholders. Key topics include setting SLAs, identifying use cases, evolving data products with feedback, standardizing data definitions, and exploring delivery methods such as file exchanges, databases, and streaming systems. We’ll also cover the use of reverse ETL to improve business processes and discuss the importance of context for choosing the best visualization type and tools. We then delve into KPIs and metrics and how to classify them, including how to identify robust KPIs based on the business context. Finally, we will focus on creating intuitive dashboards by choosing the right analysis, visualizations, and metrics to showcase based on the business context and audience involved. By the end of this module, you will understand how to design and serve data solutions that drive meaningful action and are trusted by end users.

Das ist alles enthalten

11 Lektüren1 Aufgabe

11 LektürenInsgesamt 88 Minuten

Module 6 Overview5 Minuten
Serving Data36 Minuten
Serving Data: Key Considerations5 Minuten
Context of Visualizations5 Minuten
Comparison of Visualization Fields5 Minuten
Types of Data Visualization and Their Benefits10 Minuten
Key Performance Indicators5 Minuten
KPI: Guidelines5 Minuten
Dashboards5 Minuten
Dashboards: Guidelines5 Minuten
Congratulations! 2 Minuten

1 AufgabeInsgesamt 15 Minuten

Assess Your Learning: Serving Data and Visualizations15 Minuten

Dozent

Venkat Krishnamurthy

Northeastern University

3 Kurse341 Lernende

von

Northeastern University

Mehr von Data Analysis entdecken

Status: Vorschau
Northeastern University
Data Warehousing Essentials for Analytics and AI Support
Kurs
Status: Kostenloser Testzeitraum
University of California, Irvine
Data Warehousing and Business Intelligence
Kurs
Status: Kostenloser Testzeitraum
Coursera Instructor Network
Data Warehousing: Schema, ETL, Optimal Performance
Kurs
Status: Kostenloser Testzeitraum
University of Colorado System
Data Warehouse Concepts, Design, and Data Integration
Kurs

Warum entscheiden sich Menschen für Coursera für ihre Karriere?

Felipe M.

Lernender seit 2018

„Es ist eine großartige Erfahrung, in meinem eigenen Tempo zu lernen. Ich kann lernen, wenn ich Zeit und Nerven dazu habe.“

Jennifer J.

Lernender seit 2020

„Bei einem spannenden neuen Projekt konnte ich die neuen Kenntnisse und Kompetenzen aus den Kursen direkt bei der Arbeit anwenden.“

Larry W.

Lernender seit 2021

„Wenn mir Kurse zu Themen fehlen, die meine Universität nicht anbietet, ist Coursera mit die beste Alternative.“

Chaitanya A.

„Man lernt nicht nur, um bei der Arbeit besser zu werden. Es geht noch um viel mehr. Bei Coursera kann ich ohne Grenzen lernen.“

Neue Karrieremöglichkeiten mit Coursera Plus

Unbegrenzter Zugang zu 10,000+ Weltklasse-Kursen, praktischen Projekten und berufsqualifizierenden Zertifikatsprogrammen - alles in Ihrem Abonnement enthalten

Mehr erfahren

Bringen Sie Ihre Karriere mit einem Online-Abschluss voran.

Erwerben Sie einen Abschluss von erstklassigen Universitäten – 100 % online

Erkunden Sie die Abschlüsse

Schließen Sie sich mehr als 3.400 Unternehmen in aller Welt an, die sich für Coursera for Business entschieden haben.

Schulen Sie Ihre Mitarbeiter*innen, um sich in der digitalen Wirtschaft zu behaupten.

Mehr erfahren

Häufig gestellte Fragen

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

You will be eligible for a full refund until two weeks after your payment date, or (for courses that have just launched) until two weeks after the first session of the course begins, whichever is later. You cannot receive a refund once you’ve earned a Course Certificate, even if you complete the course within the two-week refund period. See our full refund policy.