placeholder image 2

Project Aim

This project seeks to prove that there are differences between the lyrics of different genres and we can use Data Science to show this.

placeholder image 1

Scraping Lyrics from

We will scrape lyrics from, a free website that hosts a tonne of accurate lyric content

placeholder image 1

Encoding lyrics using Google's Universal Sentence Encoder

We will represent lyrics using high dimensional vectors from Google’s Universal Sentence Encoder.

placeholder image 2

Reducing Dimensionality using PCA

We will apply Principal Component Analysis (PCA) to reduce these high dimensional representations

placeholder image 2

Visualising in Plotly

We will use Plotly and build a combination of simple plots and more sophisticated 2d and 3d scatter plots to see where there are similarities and differences in lyrics

Show me the viz

placeholder image 2


The full article was originally a featured story on Medium

Take me to Medium