Project introduction

Use AWS redshift to build an ETL pipeline for a database. Load data from S3 to staging tables on Redshift and execute SQL statements that create the analytics tables from these staging tables.

How to run

Create connection to Redshift. Run redshift.py ** Please do not run redshift.py file because we have initial redshift already.
Add IAM Role by run create_iam_role.py
Create database connection by run create_tables.py ** Please do not run create_tables.py because we have all databases
Read etl to run load CSV file from S3, then insert to own database (staging, fact and dimension)

** Run delete_cluster.py will delete all redshift. Please carefully.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
README.md		README.md
create_iam_role.py		create_iam_role.py
create_tables.py		create_tables.py
delete_cluster.py		delete_cluster.py
dwh.cfg		dwh.cfg
etl.py		etl.py
redshift.py		redshift.py
sql_queries.py		sql_queries.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project introduction

How to run

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project introduction

How to run

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages