Spotify Data Analytics Platform
This project transforms raw Spotify streaming logs into actionable insights. It allows users to visualize their listening journey, discover their 'true' favorite genres, and predict their future listening habits based on historical trends.
Technology Stack
System Architecture
Data Ingestion
Spotify API and user-exported JSON logs.
Analysis
Pandas and NumPy for statistical processing.
Visualization
Interactive Power BI dashboard and Matplotlib/Seaborn plots.
The Challenges
Cleaning and normalizing inconsistent data from the Spotify API.
Efficiently processing large datasets of thousands of listening events.
Creating intuitive visualizations for complex temporal data.
The Solutions
Built a robust data cleaning pipeline that handles missing values and merges multiple data sources.
Optimized memory usage by using vectorized operations in Pandas.
Developed a time-series heat map to visualize listening intensity throughout the day/week.
Key Results & Metrics
Data-driven insights
Interactive visualizations
Pattern recognition