What strikes me are the missing artists. For instance, no Beatles. I'm pretty sure I hear the Beatles on TV, radio, etc more than all other 60s music combined. Ditto AC/DC: 1980 "Back in black" is the best selling rock album ever, and I hear AC/DC really, really often in the background.
I'm afraid Spotify isn't _that_ representative of all available (pop) music.
I'm wondering how much this is skewed by playlists -- i.e. a popular song's stats are amplified by its inclusion on popular playlists and Spotify radio. It'd be fun to see if these stats would change if you included only songs that people actively seeked out.
Yea, the plan was to remove counts via radio and playlists, but it would have doubled the amount of work from a data call/cleaning perspective. I'd bet that the results are still directionally accurate :)
Agree! In fact, I'd argue that a v2 of this project, where I look at the features of songs with high longevity (e.g., No Diggity) were driven by samples/covers/Glee/etc.
Was thinking the same thing, but also for a current cover to catch on there is likely something about the original that still resonates.
One nitpick: "Iris" by Goo Goo Dolls was ineligible for the Billboard Hot 100 for a long time due to their rules that songs had to be sold as a "single" to be counted on that chart. It was #1 in terms of radio airplay for a huge chunk of 1998, so it's not really accurate to say it didn't chart highly at the time.
If you liked this, you may be interested in Dorothy Gambrell's analysis from the other direction. She took the top songs from billboard over the past 100 years, and tried to find out which ones have remainied popular.
Has there been any kind of analysis about potential biases? From the top of my head I can think of the fact that Spotify was not available in every country from the start. Also, not every person on the world streams their music. Does this data include record sales for examlpe? Does it normalise the data somehow taking into account the percentage of the population that streams music versus other means of consuming music?
Or is it just plain and simple data gathered from Spotify?
I'm not saying that there is anything wrong/bad about the results. But without knowing the details on how the data is collected, it's hard to read anything from the results.
This is awesome. You kinda touch on this, but the main thing that jumps out at me is that there seems to be a very strong bias towards songs from the late 90s and early 00s. Do you think this is because Spotify’s main cohort is people in their late 20s to early 30s, who came of age, musically, during that time?
I'm surprised that nirvana's smell like teen spirit wasn't a bigger hit. However, Nirvana was insanely big at the time, so they don't really fit the hypothesis of underground sleeper hits.
I'm sure Nirvana has many #1 hits, but why did Smell like teen spirit become the poster song for Nirvana?
The same can be said about Oasis, who at the time was insanely big and held several spots on top 10 lists for months. But maybe they were bigger in Europe than the U.S. And maybe European fans are driving the spotify listens?
It's too bad that pearl jam isn't represented. I listened to some the other day and was taken with how good it was.
But, very interesting data nonetheless! I'm loving it!
Is it possible to filter the data by listener age? I wonder, because a lot of these songs are in play lists of mine from the 80s / 90s (I grew up in the 80s / 90s). Maybe spotify's user base is older than suspected?
Also, it would be interesting to plot billboard rankings vs spotify rankings. Possible?
I am more interested in what you use to make these graphs and charts. The data presentation in all the articles on poly-graph is always excellent. The interactive charts and graphs is perfect for letting the reader understand the data. Did you make all the graphs and charts?
Few slight bugs on the Present-day Popularity of Five Decades of Music, Dream On appears twice in the 70s with the same listen count(73 & 76). Also Blink-182, 1999 is showing in the 00s. All I Want For Christmas Is You — Mariah Carey, 2000 is showing in the 90s.
First of all I do dislike blog posts which lack a comment section to ask questions, criticize or praize.
Second of all I do dislike texts on data which lack information on where the data comes from.
I can think of ways to mine present day play counts from Spotify (while not working there) but I wonder where did he get the daily counts from he used in the last chart. Any ideas?
Furthermore I doubt that Spotify is necessarily a good indicator on how songs are being perceived in the long run. Especially b/c there are local platform-specific attractor dynamics at play.
The data is pretty clear in terms of source...Spotify in 2014...Billboard data via Whitburn.
The data was directly from one of Spotify's data partners.
Yea Spotify isn't a perfect indicator. This is the best proxy for present-day popularity that I can think of. I could have create an index that abstracted several data sources, but that would have killed the readability of the article.
Why the downvotes ? It's actually true and really relevant to the article. Music was way more localized back then and a lot of mainstream US artists never actually reached a worldwide audience. (especially on the Rap section of the article).
Isn't it ironic that people listen to a hit song about irony to feel superior about their own understanding of irony, which only increases its popularity and entrenches its expression of situational irony?
[+] [-] wazoox|10 years ago|reply
[+] [-] ianbooker|10 years ago|reply
[+] [-] sehugg|10 years ago|reply
[+] [-] mfdaniels|10 years ago|reply
[+] [-] bkjelden|10 years ago|reply
A popular cover can probably drive a lot of popularity for an older song on Spotify.
[+] [-] mfdaniels|10 years ago|reply
[+] [-] scottkduncan|10 years ago|reply
One nitpick: "Iris" by Goo Goo Dolls was ineligible for the Billboard Hot 100 for a long time due to their rules that songs had to be sold as a "single" to be counted on that chart. It was #1 in terms of radio airplay for a huge chunk of 1998, so it's not really accurate to say it didn't chart highly at the time.
[+] [-] iamdave|10 years ago|reply
This doesn't add a single thing to the discussion, but it's a great cover too.
[+] [-] tribe|10 years ago|reply
http://www.verysmallarray.com/?p=1752
[+] [-] mfdaniels|10 years ago|reply
[+] [-] mfdaniels|10 years ago|reply
[+] [-] scottkduncan|10 years ago|reply
Also so happy to see that "You Got Me" by The Roots got 6 million streams in 2014. That is the definition of a future timeless track.
[+] [-] exhuma|10 years ago|reply
Or is it just plain and simple data gathered from Spotify?
I'm not saying that there is anything wrong/bad about the results. But without knowing the details on how the data is collected, it's hard to read anything from the results.
[+] [-] steadicat|10 years ago|reply
[+] [-] wodenokoto|10 years ago|reply
I'm sure Nirvana has many #1 hits, but why did Smell like teen spirit become the poster song for Nirvana?
The same can be said about Oasis, who at the time was insanely big and held several spots on top 10 lists for months. But maybe they were bigger in Europe than the U.S. And maybe European fans are driving the spotify listens?
[+] [-] ap22213|10 years ago|reply
But, very interesting data nonetheless! I'm loving it!
Is it possible to filter the data by listener age? I wonder, because a lot of these songs are in play lists of mine from the 80s / 90s (I grew up in the 80s / 90s). Maybe spotify's user base is older than suspected?
Also, it would be interesting to plot billboard rankings vs spotify rankings. Possible?
[+] [-] flurie|10 years ago|reply
Why lasting popularity as a measure of timelessness?
How do you account for longer trends? Some of Bach's children were more popular than he was for quite a while.
[+] [-] marme|10 years ago|reply
[+] [-] mattmanser|10 years ago|reply
Few slight bugs on the Present-day Popularity of Five Decades of Music, Dream On appears twice in the 70s with the same listen count(73 & 76). Also Blink-182, 1999 is showing in the 00s. All I Want For Christmas Is You — Mariah Carey, 2000 is showing in the 90s.
[+] [-] Garlef|10 years ago|reply
[+] [-] jacques_chester|10 years ago|reply
[+] [-] gish|10 years ago|reply
Did you manually retrieve the play count for each track or is there an automated way of doing it?
[+] [-] juddlyon|10 years ago|reply
No Diggity is a great song, but the song it samples might be even better: Grandma's Hands by Bill Withers.
[+] [-] kpennell|10 years ago|reply
[+] [-] sebcioz|10 years ago|reply
[+] [-] mfdaniels|10 years ago|reply
[+] [-] mr_green_tea|10 years ago|reply
Second of all I do dislike texts on data which lack information on where the data comes from.
I can think of ways to mine present day play counts from Spotify (while not working there) but I wonder where did he get the daily counts from he used in the last chart. Any ideas?
Furthermore I doubt that Spotify is necessarily a good indicator on how songs are being perceived in the long run. Especially b/c there are local platform-specific attractor dynamics at play.
[+] [-] mfdaniels|10 years ago|reply
The data is pretty clear in terms of source...Spotify in 2014...Billboard data via Whitburn.
The data was directly from one of Spotify's data partners.
Yea Spotify isn't a perfect indicator. This is the best proxy for present-day popularity that I can think of. I could have create an index that abstracted several data sources, but that would have killed the readability of the article.
[+] [-] lcswi|10 years ago|reply
[+] [-] realusername|10 years ago|reply
[+] [-] mfdaniels|10 years ago|reply
[+] [-] flakmonkey|10 years ago|reply
[+] [-] dexterdog|10 years ago|reply
[+] [-] paublyrne|10 years ago|reply
The lyrics are a little fluff, but Ironic has a great chorus and catchy hooks. It's a classic pop song.
[+] [-] ewzimm|10 years ago|reply