• Post category:StudyBullet-2
  • Reading time:6 mins read

Olympic Games Analytics Project in Apache Spark for beginner using Databricks (Unofficial)

What you will learn

In this course you will learn to Analyze data (Olympic Game) in Apache Spark using Databricks Notebook (Community edition)

Data exploration about the recent history of the Olympic Games using Apache Spark

Basics flow of data in Apache Spark, loading data, and working with data, this course shows you how Apache Spark is perfect for Big Data Analysis job.

Learn basics of Databricks notebook by enrolling into Free Community Edition Server

Olympic Games Analytics a real world examples.

Graphical  Representation of Data using Databricks notebook.

Transform structured data using SparkSQL and DataFrames

Publish the Project on Web to Impress your recruiter

Description

In this course you will learn to Analyze data (Olympic Game) in Apache Spark using Databricks Notebook (Community edition),

1) Basics flow of data in Apache Spark, loading data, and working with data, this course shows you how Apache Spark is perfect for Big Data Analysis job.

2) Learn basics of Databricks notebook by enrolling into Free Community Edition Server

3) Olympic Games Analytics a real world examples.

4) Graphical  Representation of Data using Databricks notebook.

5) Hands-on learning

6) Real-time Use Case

7) Publish the Project on Web to Impress your recruiter

About Databricks:

Databricks lets you start writing Spark queries instantly so you can focus on your data problems.

Lets discover more about the Olympic Games using Apache Spark

Data:

Data exploration about the recent history of the Olympic Games

We will explore a dataset on the modern Olympic Games, including all the Games from Athens 1896 to Rio 2016.

English

Language

Content

Introduction

Introduction


Get Instant Notification of New Courses on our Telegram channel.


Download Resources

Download Resources

Project Begins

File level details

Free Account creation in Databricks

Importing Databricks Notebook

Overview and Project Objective

File Content Explaination

Launch Spark Cluster

Spark Notebook Basics

Loading data into Spark Dataframe

Distribution of the age of gold medalists

Gold Medals for Athletes Over 50 based on Sports

Women medals per edition(Summer Season) of the Games

Top 5 Gold Medal Countries

Disciplines with the greatest number of Gold Medals

Height vs Weight of Olympic Medalists

Variation of Male/Female Athletes over time

Variation of (Age/Weight/Height) for Male/Female Athletes over time

Weight over year for Male/Female Gymnasts

Weight/Height over years for Male/Female Lifters

Gold/Silver/Bronze Medals based on Countries

Publish Notebook to the Web

Bonus Lecture