• Post category:StudyBullet-15
  • Reading time:22 mins read


Snowflake Mastery: An easy step by step approach to be an expert in snowflake

What you will learn

You will learn about the Snowflake Architecture and Performance management

Deep Dive in to the Access management and the type of Db objects

Understand in full detail how the data moves from external system to Snowflake

Deep knowledge on Dynamic data masking and visualization

Extensive knowledge on Data Sharing and Data Protection

Description

Are you a technology professional or college graduate who have thought multiple times to learn snowflake but couldn’t learn it so far?

Is your self snowflake learning a struggle? As a busy technology professional, do you feel overwhelmed and frustrated with all the ‘answers’ to master snowflake, but still not sure which one is right for you?

Have you wondered, how many days , weeks and months are passed in just thinking about learning snowflake? The time is NOW to get started.

In this course, you will discover exactly what you need to focus on to master snowflake. We will take you through an easy step by step approach to learn and master snowflake in shortest period of time with out overwhelm.


Get Instant Notification of New Courses on our Telegram channel.


Cloud is the new norm, almost all tech giants are moving towards cloud platform for the solutions which are cost effective and scalable. Cloud data-warehousing aka Snowflake warehouse is one the fastest growing technology which most of the technology companies are adopting. This is going to be big in future, hence this is the right time to get into the ship to learn and equip ourselves will all the knowledge of snowflake and prepare ourselves for better career today and in future. So if you find yourself saying “May be I will try it another time “. I would say THE TIME IS NOW. Don’t let yourself go another day, another week, another month of learning procrastination. It’s time RIGHT NOW to get on the journey of learning and have the life knowledge and excellent career you have designed for you.

Following modules will be covered in a very easily step by step approach

  1. Snowflake Architecture and Performance Management
  1. Introduction
    1. Create Trial account
    2. Know your UI interface
    3. Shared and Non Shared Disk Architecture
    4. High Level Architecture
    5. Create Virtual Warehouses
    6. Scaling Policy
    7. Create First Database and Table
    8. Load CSV from public S3 bucket
  2. Snowflake Architecture
    1. Snowflake Editions
    2. Credits
    3. Credit Usage Scenarios
    4. Credit to Dollar Storage cost
    5. Credit Storage Consumption
  3. Performance Management
    1. Performance Improvement Methods
    2. Dedicated Virtual Warehouses
    3. Dedicated Virtual Warehouses Creation
    4. Scale Up and Scale Down
    5. Scale Out and Scale In
    6. Maximize Caching
    7. Maximize Caching demonstration
    8. Clustering Keys
    9. Clustering Cost
    10. Clustering Performance
    11. Micropartitions
  1. Access management and snowflake objects
  1. Access management
    1. Access control and RBAC
    2. Securable objects
    3. Roles
  2. Tables in snowflake
    1. Table Types
    2. Database and schema type
  3. User Defined Functions (UDFโ€™s) and Stored Procedures
    1. User defined functions (UDFโ€™s)
    2. Stored procedures (SPโ€™s)
    3. Secure UDFโ€™s and SPโ€™s
  4. Views
    1. Non Materialized views
    2. Materialized views
  1. Data Movement – Techniques and Unloading
  1. Data Loading techniques
    1. Types of Data load
    2. Know snowflake stages
    3. Install Snowsql
    4. Internal Stages
      1. Userstage
      2. Tablestage
      3. Namedstage
    5. External Stages
    6. COPY Command
    7. COPY Transformation
    8. File Format Object
  2. COPY OPTIONS
    1. ON ERROR
    2. SIZE LIMIT
    3. RETURN_FAILED_ONLY
    4. ENFORCE_LENGTH / TRUNCATECOLUMNS
    5. FORCE / LOAD_UNCERTAIN_FILES
    6. LOAD_HISTORY
    7. VALIDATION MODE
    8. Logging rejected records
    9. Data loading best practices
  3. Data Unloading
    1. Data unloading theory
    2. COPY into location – Internal Stage -> User stage
      1. Resolving max_workers must be greater than 0
    3. COPY into location – Internal Stage -> Table stage
    4. COPY into location – Internal Stage -> Named Internal Stage
    5. Unloading using Query
    6. Unloading to Multiple Files
  4. Data Movement – Data loading
  1. Loading from AWS
    1. Create AWS free-tier account
    2. Create S3 bucket, file and role
    3. Storage integration object
    4. Load file from AWS
    5. Unload data to AWS
  2. Loading from Azure
    1. Create Azure free trial account
    2. Create storage and container
    3. Integration object
    4. Load data from Azure
    5. Unload data to Azure
  3. Loading from GCP
    1. Create GCP free trial account, create bucket and upload file
    2. Integration object
    3. Load data from GCP
    4. Unload data to GCP
  4. Loading Semi-Structured Data
    1. Introduction
    2. Process and understanding the demo data
    3. JSON: Load raw data and analyze nested objects
    4. JSON: FLATTEN
      1. Array/Objects
      2. Hierarchical
    5. JSON: Data transformation and load
    6. JSON: Querying Data
    7. PARQUET: File load and transformation
  5. Snowpipe
    1. Overview
    2. Create snowpipe, load and configure
    3. Troubleshooting and pipe refresh
    4. Manage snowpipe
  1. Dynamic Data Masking and Visualization
    1. Dynamic data masking
      1. Overview
      2. Create masking policy
      3. Change, Unset and replace masking policy
      4. Conditional masking
  2. Data Visualization
    1. Snowsight – Dashboard and Visualization
    2. Getting started with Tableau
    3. Tableau visualization from snowflake data
  3. Partner Connect
    1. Overview
    2. Partner connect categories
  1. Data Sharing and Data Protection
  1. Data Sharing
    1. Overview
    2. Data sharing demo of Table, External tables and UDFโ€™s.
    3. Data share to a reader account
    4. Imported privileges
    5. Data share – entire database
    6. Data share operations – DDL and DML affect
    7. Sharing views and materialized views
  1. Time Travel
    1. Overview and retrieve data using โ€œatโ€ and โ€œbeforeโ€
    2. Data restoration using direct method
    3. Data restoration using indirect method
    4. UNDROP
    5. Understanding retention time
  2. Fail-Safe
    1. Overview
    2. Fail Safe demo
  3. Zero-Copy Cloning
    1. Overview
    2. Cloning permanent tables
    3. Copy grants and clone temporary external tables
    4. Clone schema and database
    5. Swap
    6. Clone using time travel and understand sharing vs cloning
  4. Streams
    1. Overview
    2. Consuming streams using insert, update and delete
    3. Types of streams – default, append only. Consume stream using false condition
    4. Insert only streams for external tables
    5. Multiple consumers with streams
    6. Changes clause
    7. Streams consideration and stale after attribute
    8. Recreate, rename and clone effect on stream
  5. Tasks
    1. Understanding tasks
    2. Create task using procedure when and troubleshoot task
    3. Scheduling tasks using CRON
    4. CRON – Daylight savings considerations
    5. DAG – Tree of tasks
    6. Task with stream
    7. User managed tasks and warehouse considerations
  6. Sampling Data
    1. Overview
    2. Row sampling method
    3. System block method
  7. Access management in practice
    1. System defined roles and its users
    2. Custom roles and its users
    3. Create objects from sysadmin role and assign to developer role
    4. Create custom users – developer1, developer2,qa1 and reader1
    5. Problems due to unassigned custom roles
    6. Solution of the problems
    7. Public Role
  8. Best Practices
    1. Auto resume, success and failure and long running queries
    2. Resource monitor
    3. Columns storage cost and benchmarking the queries

Enjoy and Happy Learning !

English
language

Content

Getting Started

Welcome and Course Structure
Alphaedge Solutions disclaimer and instructors
Course Resources
Few Actions before you launch your learning

Introduction

Create trial account
Know your UI interface
Share and non share disc architecture
High level snowflake architecture
Virtual warehouse sizes
Create virtual warehouse
Scaling policy
Create first database and table
Load csv file from public S3 bucket
Introduction

Snowflake Architecture

Snowflake editions
Snowflake editions
Credits
Minumum 1 minute concept
Credit usage scenarios
Credit to dollar storage cost
Credit consumption for storage
Snowflake Architecture

Performance management

Performance improvement methods
Dedicated virtual warehouses
Dedicated Virtual warehouse creation
Scale up and Scale down
Scale out and Scale in
Maximize caching
Maximize caching demonstration
Clustering keys
Clustering costs
Clustering performance
Micropartitions
Micropartitions – query pruning, depths and overlap
Performance Management

Access management

Access control and RBAC
Securable objects
Roles
Access Control

Tables in snowflake

Table types
Database and schema types
Tables

User defined functions (UDF’s) and Stored procedures

User defined functions (UDF’s)
Stored procedures (SP’s)
Secure UDF’s and SP’s
UDF’s and SP

Views

Non materialized views
Materialized views
Refresh materialized view
Views

Data loading

Data loading techniques
Know snowflake stages
Install SNOWSQL
Data loading
Internal stages – Userstage
Internal stages – Tablestage
Internal stages – Namedstage
External stages
Stages
COPY command
COPY Transformations1
COPY Transformations2
File format object
File format object properties
COPY and File Format Object

COPY OPTIONS

ON_ERROR – CONTINUE
ON_ERROR – SKIP_FILE
SIZE_LIMIT
RETURN_FAILED_ONLY
Copy options knowledge check 1
ENFORCE_LENGTH/TRUNCATECOLUMNS
FORCE/LOAD_UNCERTAIN_FILES
LOAD_HISTORY
Copy options knowledge check 2
VALIDATION MODE – RETURN_ERRORS
VALIDATION MODE – RETURN_<n>_ROWS/RETURN_ALL_ERRORS
Logging rejected records using RESULT_SCAN
Logging rejected records using validate
Data loading best practices1
Data loading best practices2
Validation mode , logging and rejected records

Data Unloading

Data unloading theory
COPY into location: Internalstage -> Userstage
COPY into location: Internalstage -> Tablestage
COPY into location: Internalstage -> Named internalstage
Unloading using query
Unloading multiple files
Data unloading

Loading and Unloading from AWS

Create AWS free-tier account
Create S3 bucket, file and role
Storage integration object
Load file from AWS
Unload data to AWS
Loading unloading to AWS

Loading and Unloading from AZURE

Create azure free trial account
Create storage and container
Integration object
Load data from Azure
Unload data to Azure
Loading unloading from Azure

Loading and Unloading from GCP

Create GCP trial account, create bucket and upload file
Integration object
Load data from GCP
Unload data to GCP
Loading unloading from GCP

Loading Semi-Structured data

Introduction
Process and understanding the demo data
JSON: Load raw data and analyse nested objects
JSON: FLATTEN
JSON: Data load and transformation
PARQUET: File load and transformation
Semi-structured data

Snowpipe

Overview
Create snowpipe, load and configure
Troubleshooting and pipe refresh
Manage snowpipe
Snowpipe

Dynamic Data Masking

Dynamic data masking overview
Create dynamic data masking policy
Masking policy and policy reference
Masking policy on multiple columns, nested masking and real life examples
Masking policy context and conditional masking
Dynamic Data Masking

Data Visualization

Data setup for visualization
Snowsight charts and visualization
Tableau installation
Snowflake ODBC driver
Tableau visualization from snowflake data
Data visualization

Partner Connect

Overview
Partner connect demo
Partner Connect

Data Sharing

Data sharing overview
Create Table, external table and UDF’s for data sharing
Data sharing demo of table, external table and UDF
Create data share via UI
Data share to a reader account
Imported privileges
Data share – entire database
Data share operations – DDL and DML affect
Sharing views and materialized views
Data Sharing

Time Travel

Time travel overview
Retrieve data using “at” and “before”
Data restoration using direct method
Data restoration using indirect method
UNDROP
Understanding retention time
Time Travel

FAILSAFE

Overview
Failsafe demo
Fail Safe

Zero Copy Cloning

Overview
Cloning permanent tables
Copy grants and more on cloning
Clone schema and database
Swap
Clone using time travel and understand sharing vs cloning
Zero copy cloning

Streams

Streams Overview
Streams Offset
Consuming streams using insert, update and delete
Types of streams – default, append only. Consume stream using false condition
Insert only streams for external tables
Multiple consumers with streams
Changes clause
Streams stale_after and extension in days
Recreate, rename and clone effect on stream
Streams

Tasks

Understanding tasks
Create tasks using procedure when and troubleshoot tasks
Schedule task using CRON
CRON – Day light savings considerations
DAG – Tree of tasks
Task with stream
User managed tasks and warehouse considerations
Tasks

Sampling Data

Overview
Row sampling method
System block method
Sampling Data

Access management in practice

System defined roles and its users
Custom roles and its users
Create objects from sysadmin role and assign to developer role
Create custom users – developer1, developer2, qa1 and reader1
Problems due to unassigned custom roles
Solution of the problems
Public Role
Access management in practice

Best Practices

Auto resume, success and failure and long running queries
Resource monitor
Resource monitor via SQL
Resource monitor with custom schedule
Columns storage cost and benchmarking the queries
Best practices

Thank you

Thank you note, stay connected and good luck !