There was an open source audio fingerprinting system called echoprint, which actually implemented the shazam algorithm in a way that made it hard to claim it's the same approach as shazam, but in reality it was almost the same. The hardest part about these kind of services is designing the fingerprints so that you can search them effectively. The audio part is interesting and fun, but actually less critical.
what would you say is the difference between fingerprint and using something like OpenAI's whisper approach (visual spectrogram ML) for finding the music
tangent: I'm also thinking about some fast way to search text algo maybe related to Spotify damn that was a long time ago read that article
You should be careful with this. Last time I saw an article about reproducing Shazam's algorithm, their lawyers came after them and eventually the article was removed.
There were questions as to the validity of the threats their lawyers used, but even a bulletproof case is a costly endeavor when going up against the scale of companies.
Well sure, a songs database is important. But song databases like https://acoustid.org/ exist, which let you look up songs that share the same audio "fingerprint" (https://github.com/acoustid/chromaprint). You need the full track to make that fingerprint.
Shazam can take only a tiny snippet, and can guess quite accurately just from that snippet. By comparison to AcoustID, which is also a song database (with an entirely different purpose) we can say that the "main ingredient" is Shazam's system for identifying songs from short snippets.
lukaslalinsky|1 year ago
ge96|1 year ago
tangent: I'm also thinking about some fast way to search text algo maybe related to Spotify damn that was a long time ago read that article
jmholla|1 year ago
There were questions as to the validity of the threats their lawyers used, but even a bulletproof case is a costly endeavor when going up against the scale of companies.
ziddoap|1 year ago
mannyv|1 year ago
gnabgib|1 year ago
How Shazam Works (2003) [pdf] (117 points, 11 months ago, 29 comments) https://news.ycombinator.com/item?id=40029036 - there's a lot of links to past Shazam stories in comments
jurschreuder|1 year ago
Hmm they did seem to have gotten some more customers after I left but the website is all glitchy now so I guess it's abandoned.
https://spot-on.media/
alain_gilbert|1 year ago
https://github.com/cgzirim/seek-tune
gunian|1 year ago
amiga386|1 year ago
Shazam can take only a tiny snippet, and can guess quite accurately just from that snippet. By comparison to AcoustID, which is also a song database (with an entirely different purpose) we can say that the "main ingredient" is Shazam's system for identifying songs from short snippets.
ge96|1 year ago
hmcamp|1 year ago
megadata|1 year ago
postalrat|1 year ago
vcbvbrbdfg|1 year ago
[deleted]