Apache Hadoop

Apache Hadoop is an open-source software framework used for distributed storage and processing of large datasets across clusters of computers. Coursera's Apache Hadoop catalogue teaches you about the core concepts and components of this powerful framework. You'll learn about Hadoop's architecture, its key components like Hadoop Distributed File System (HDFS) and MapReduce, as well as advanced topics such as data ingestion with tools like Flume and Sqoop. You will also delve into data processing using Hive and Pig, and explore scalable machine learning algorithms. By mastering Apache Hadoop, you will be equipped to handle big data challenges, contributing to business insights and decision making.
27credentials
1online degree
64courses

Most popular

Trending now

New releases

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Hadoop Course Catalog

What brings you to Coursera today?

Leading partners

  • Google Cloud
  • IBM
  • Johns Hopkins University
  • University of California San Diego
  • Packt
  • University of Illinois Urbana-Champaign
  • Pearson
  • Alibaba Cloud Academy