Skip to content

Exploratory Data Analysis on the Spotify Dataset with more than 1.1M records

Notifications You must be signed in to change notification settings

Mridul-Gulati/Spotify_EDA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

App Screenshot

Key Findings:

1. Duration of Songs has increased since 1920s
2. Dance, Pop, Rap, Hip Hop and Raggaeton are most
   popular genres
3. With Loudness Popularity of Songs has increased
4. Loud songs are generally more Energetic

Author

Problem Statement:

This project analyses what kinds of music gain the most amount of listeners. It helps us get a detailed view of why some songs performed well while others didn't.

Data Source

Methods

  • Defined the Problem Statement
  • Data Cleaning using Pandas
  • Feature Engineering - Correlation HeatMap
  • Comparing Loudness and Energy
  • Analysis of Most Popular genres

🛠 Tech Stack

  • Python 3.10
  • Pandas
  • Matplotlib
  • Seaborn

Quick Glance at Findings

Correlation between Features

App Screenshot

Popular Genres

App Screenshot

Comparing Loudness & Energy

App Screenshot

Changing Duration of Songs with Time(Years)

App Screenshot

Lessons learnt:

  • How to Identify problem statements?
  • How to use Data insights to solve the problems?
  • How to perform Feature Engineering?
  • How to use Seaborn for data visualization?

What to do next?

  • Use the above insights to create a song recommendation system by grouping related songs one after the other.
  • Predicting how a song would do according to its features like duration, loudness, genre etc.

🔗 Links

portfolio linkedin

About

Exploratory Data Analysis on the Spotify Dataset with more than 1.1M records

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published