top | item 10112436

Using Spotify to measure the popularity of older music

88 points| bdr | 10 years ago |poly-graph.co | reply

51 comments

order
[+] wazoox|10 years ago|reply
What strikes me are the missing artists. For instance, no Beatles. I'm pretty sure I hear the Beatles on TV, radio, etc more than all other 60s music combined. Ditto AC/DC: 1980 "Back in black" is the best selling rock album ever, and I hear AC/DC really, really often in the background. I'm afraid Spotify isn't _that_ representative of all available (pop) music.
[+] ianbooker|10 years ago|reply
Keep in mind: Some artist did not consent to streaming, like the Beatles. Aren't they Apple-exclusive?
[+] sehugg|10 years ago|reply
I'm wondering how much this is skewed by playlists -- i.e. a popular song's stats are amplified by its inclusion on popular playlists and Spotify radio. It'd be fun to see if these stats would change if you included only songs that people actively seeked out.
[+] mfdaniels|10 years ago|reply
Yea, the plan was to remove counts via radio and playlists, but it would have doubled the amount of work from a data call/cleaning perspective. I'd bet that the results are still directionally accurate :)
[+] bkjelden|10 years ago|reply
Chet Faker released a cover of No Diggity a few years ago, which is responsible for all of my plays of the original on Spotify.

A popular cover can probably drive a lot of popularity for an older song on Spotify.

[+] mfdaniels|10 years ago|reply
Agree! In fact, I'd argue that a v2 of this project, where I look at the features of songs with high longevity (e.g., No Diggity) were driven by samples/covers/Glee/etc.
[+] scottkduncan|10 years ago|reply
Was thinking the same thing, but also for a current cover to catch on there is likely something about the original that still resonates.

One nitpick: "Iris" by Goo Goo Dolls was ineligible for the Billboard Hot 100 for a long time due to their rules that songs had to be sold as a "single" to be counted on that chart. It was #1 in terms of radio airplay for a huge chunk of 1998, so it's not really accurate to say it didn't chart highly at the time.

[+] iamdave|10 years ago|reply
Chet Faker released a cover of No Diggity a few years ago

This doesn't add a single thing to the discussion, but it's a great cover too.

[+] tribe|10 years ago|reply
If you liked this, you may be interested in Dorothy Gambrell's analysis from the other direction. She took the top songs from billboard over the past 100 years, and tried to find out which ones have remainied popular.

http://www.verysmallarray.com/?p=1752

[+] mfdaniels|10 years ago|reply
Yup. I emailed Dorothy at the beginning of the project to get the Whitburn database.
[+] mfdaniels|10 years ago|reply
author here, if there's any questions
[+] scottkduncan|10 years ago|reply
Thanks for this: http://poly-graph.co/vocabulary.html I've gotten hours of conversation out of it.

Also so happy to see that "You Got Me" by The Roots got 6 million streams in 2014. That is the definition of a future timeless track.

[+] exhuma|10 years ago|reply
Has there been any kind of analysis about potential biases? From the top of my head I can think of the fact that Spotify was not available in every country from the start. Also, not every person on the world streams their music. Does this data include record sales for examlpe? Does it normalise the data somehow taking into account the percentage of the population that streams music versus other means of consuming music?

Or is it just plain and simple data gathered from Spotify?

I'm not saying that there is anything wrong/bad about the results. But without knowing the details on how the data is collected, it's hard to read anything from the results.

[+] steadicat|10 years ago|reply
This is awesome. You kinda touch on this, but the main thing that jumps out at me is that there seems to be a very strong bias towards songs from the late 90s and early 00s. Do you think this is because Spotify’s main cohort is people in their late 20s to early 30s, who came of age, musically, during that time?
[+] wodenokoto|10 years ago|reply
I'm surprised that nirvana's smell like teen spirit wasn't a bigger hit. However, Nirvana was insanely big at the time, so they don't really fit the hypothesis of underground sleeper hits.

I'm sure Nirvana has many #1 hits, but why did Smell like teen spirit become the poster song for Nirvana?

The same can be said about Oasis, who at the time was insanely big and held several spots on top 10 lists for months. But maybe they were bigger in Europe than the U.S. And maybe European fans are driving the spotify listens?

[+] ap22213|10 years ago|reply
It's too bad that pearl jam isn't represented. I listened to some the other day and was taken with how good it was.

But, very interesting data nonetheless! I'm loving it!

Is it possible to filter the data by listener age? I wonder, because a lot of these songs are in play lists of mine from the 80s / 90s (I grew up in the 80s / 90s). Maybe spotify's user base is older than suspected?

Also, it would be interesting to plot billboard rankings vs spotify rankings. Possible?

[+] flurie|10 years ago|reply
Why "timeless"? When I use that word to talk about music, it means that the music has evaded its contemporary trappings. Its opposite is "dated".

Why lasting popularity as a measure of timelessness?

How do you account for longer trends? Some of Bach's children were more popular than he was for quite a while.

[+] marme|10 years ago|reply
I am more interested in what you use to make these graphs and charts. The data presentation in all the articles on poly-graph is always excellent. The interactive charts and graphs is perfect for letting the reader understand the data. Did you make all the graphs and charts?
[+] mattmanser|10 years ago|reply
Great job.

Few slight bugs on the Present-day Popularity of Five Decades of Music, Dream On appears twice in the 70s with the same listen count(73 & 76). Also Blink-182, 1999 is showing in the 00s. All I Want For Christmas Is You — Mariah Carey, 2000 is showing in the 90s.

[+] Garlef|10 years ago|reply
No question; But this is one of the coolest things I've seen on the internet yet.
[+] jacques_chester|10 years ago|reply
When are you coming back to do another tech talk at Pivotal Labs?
[+] gish|10 years ago|reply
This is impressive!

Did you manually retrieve the play count for each track or is there an automated way of doing it?

[+] juddlyon|10 years ago|reply
Very slick analysis!

No Diggity is a great song, but the song it samples might be even better: Grandma's Hands by Bill Withers.

[+] kpennell|10 years ago|reply
Crazy how much of the 90s rap sampled from the 70s and people don't realize it.
[+] sebcioz|10 years ago|reply
Were you thinking about creating spotify playlists from those top lists?
[+] mr_green_tea|10 years ago|reply
First of all I do dislike blog posts which lack a comment section to ask questions, criticize or praize.

Second of all I do dislike texts on data which lack information on where the data comes from.

I can think of ways to mine present day play counts from Spotify (while not working there) but I wonder where did he get the daily counts from he used in the last chart. Any ideas?

Furthermore I doubt that Spotify is necessarily a good indicator on how songs are being perceived in the long run. Especially b/c there are local platform-specific attractor dynamics at play.

[+] mfdaniels|10 years ago|reply
Sorry that there's so comments section.

The data is pretty clear in terms of source...Spotify in 2014...Billboard data via Whitburn.

The data was directly from one of Spotify's data partners.

Yea Spotify isn't a perfect indicator. This is the best proxy for present-day popularity that I can think of. I could have create an index that abstracted several data sources, but that would have killed the readability of the article.

[+] lcswi|10 years ago|reply
* in US pop culture
[+] realusername|10 years ago|reply
Why the downvotes ? It's actually true and really relevant to the article. Music was way more localized back then and a lot of mainstream US artists never actually reached a worldwide audience. (especially on the Rap section of the article).
[+] mfdaniels|10 years ago|reply
It's all of Spotify. So while it's international data, it certainly skews toward western music.
[+] flakmonkey|10 years ago|reply
While the author does refer to US music charts as a reference to past popularity, I don't see any indication that the Spotify data was US only.
[+] dexterdog|10 years ago|reply
Isn't it ironic that Ironic is on there largely because people go listen to it because people tell them that none of the examples are actually ironic?
[+] paublyrne|10 years ago|reply
No.

The lyrics are a little fluff, but Ironic has a great chorus and catchy hooks. It's a classic pop song.

[+] ewzimm|10 years ago|reply
Isn't it ironic that people listen to a hit song about irony to feel superior about their own understanding of irony, which only increases its popularity and entrenches its expression of situational irony?