Data-Modeling-with-Postgres

In this project, we’ll model user activity data for a music streaming app called Sparkify. We’ll create a relational database and ETL pipeline designed to optimize queries for understanding what songs users are listening to. In PostgreSQL we will also define Fact and Dimension tables and insert data into your new tables.

songplay(this is fact_table) - songplay_id has songid as a primary key. songplay is a fact_table since it stores the metric for business processes.

user(dim_user) - user_id as a primary key.

song(dim_song) - song_id is primary key since there should only one song must be present in the song table

artist_table_create(dim_artist) - artist_id is primary key since there should only one artist in the table

time(dim_time) - start_time is a primary key since its the timestamp which can let us query on time tabel

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
create_tables.py		create_tables.py
etl.ipynb		etl.ipynb
etl.py		etl.py
requirments.txt		requirments.txt
sql_queries.py		sql_queries.py
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Data-Modeling-with-Postgres

About

Uh oh!

Releases

Packages

Languages

License

vighneshanap/Data-Modeling-with-Postgres

Folders and files

Latest commit

History

Repository files navigation

Data-Modeling-with-Postgres

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages