N a n o d e g r e e p r o g r a m s y L l a b u s


LESSON ONE Introduction to the


Download 479.32 Kb.
Pdf ko'rish
bet4/16
Sana08.01.2022
Hajmi479.32 Kb.
#246526
1   2   3   4   5   6   7   8   9   ...   16
Bog'liq
Data Engineering Nanodegree Program Syllabus (1)

LESSON ONE

Introduction to the

Data Warehouses

• 

Understand Data Warehousing architecture 



• 

Run an ETL process to denormalize a database (3NF to Star) 

• 

Create an OLAP cube from facts and dimensions 



• 

Compare columnar vs. row oriented approaches



LESSON TWO

Introduction to the

Cloud with AWS

• 

Understand cloud computing 



• 

Create an AWS account and understand their services 

• 

Set up Amazon S3, IAM, VPC, EC2, RDS PostgreSQ



LESSON THREE

Implementing Data

Warehouses on AWS

• 

Identify components of the Redshift architecture 



• 

Run ETL process to extract data from S3 into Redshift 

• 

Set up AWS infrastructure using Infrastructure as Code 



   (IaC)

• 

Design an optimized table by selecting the appropriate



   distribution style and sorting key

Course Project 

Build a Cloud Data 

Warehouse

In this project, you are tasked with building an ELT pipeline that 

extracts their data from S3, stages them in Redshift, and transforms 

data into a set of dimensional tables for their analytics team to 

continue finding insights in what songs their users are listening to.



Data Engineering  |  6


Download 479.32 Kb.

Do'stlaringiz bilan baham:
1   2   3   4   5   6   7   8   9   ...   16




Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling