• Post category:StudyBullet-8
  • Reading time:6 mins read


In depth course on Big Data Hadoop, Hive, Spark, HBase, MongoDB, Spark, Databricks, Kafka, Airflow and Projects

What you will learn

Learn Big Data Engineering required to work in any organisation

Learn Big Data Engineering required to prepare for interviews

Hands on implementations and practice

Hadoop, Hive, Spark, Kafka, Airflow and NoSQL(Hbase, MongoDB)

Description

Data now surround us. People upload videos, take pictures with their phones, text pals, change their Facebook status, write comments on websites, click on advertisements, and so on. Machines, too, are producing and storing an increasing amount of data. Specialized tools are required to process such massive datasets. This course covers both Hadoop and Spark, two fundamental frameworks that provide essential tools for completing massive big data projects.

This course has been designed to cater to all types of learners who want to get into the vast field of BigΒ Data Engineering. Be it theory , hands on or projects, everything is covered in detail without missing any topics in the field.

You will learn the following in details

Introduction to Big Data and Data Engineering- Big Data Engineering

Introduction to Distributed Systems – Hadoop and MapReduce -Big Data Engineering Introduction

Map Reduce & YARN -Big Data Hadoop Map Reduce YARN, Hadoop Map Reduce Hands On

Hive – Theory and Hands On

Hive Hands On- Theory and Hands On

NoSQL and Hbase- Theory and Hands On

Sqoop- Theory and Hands On

Spark- Theory and Hands On

Spark – Introduction

Big Data Engineering using PySpark- RDDs

Spark hands on – RDD


Get Instant Notification of New Courses on our Telegram channel.


Big Data Engineering using PySpark- Core, Internals, Architecture

Apache Spark Actions_ Transformations

Apache Spark Caching

Big Data Engineering using PySpark- Shared Vars , Coalesce Repartition

Big Data Engineering using PySpark- Dataframes

Spark hands on – Dataframe

Spark hands on – Databricks

Big Data Engineering using PySpark- Catalyst& Tungsten

Spark ML- Theory and Hands On

Spark Streaming- Theory and Hands On

Kafka- Theory and Hands On

Apache Airflow – Workflow Management Platform- Theory and Hands On

Big Data Projects – 3 end to end hands on projects

Big Data Enterprise Architecture

English
language

Content

Big Data Engineering & Hadoop

Big Data Engineering Learning Roadmap
Big Data Engineering Introduction
Introduction to Big Data Engineering
Introduction to Hadoop Part 1
Introduction to Hadoop Part 2
Introduction to Hadoop Part 3
Hadoop MapReduce Hands On
Apache Hive
Hive Hands On
NoSQL and HBase
Introduction to MongoDB
HBase Hands On
Big Data Sqoop
Big Data Sqoop Hands On

Spark and Advanced Big Data Engineering

Spark Introduction
Spark RDD
Spark RDD Hands On
Spark Core, internals, Architecture
Spark Actions
Spark Caching
Spark Shared Variables
Spark Dataframes and SQL
Spark Hands On
Spark Optimisation
RDD vs DF vs DS
Kafka
Kafka Hands On
Spark Streaming Part 1
Spark Streaming Part 2
Spark Streaming Hands On
Spark ML Lib
Spark hands On ML
Spark on Databricks Hands On
Airflow
Airflow Hands On

Big Data Projects

Short Video Analytics
Product Reccomender System
IoT Streaming Projects

Bonus

Big Data Architecture Overview