How Shazam or Sound Hound works?

17,617

Solution 1

I came across a paper which explains how audio search algorithms work. Here is the link. It was written by one of the developers of Shazam, a rival application of SoundHound.

Solution 2

good links on the wikipedia page include: https://surdu.me/2011/01/20/how-does-shazam-work.html and the paper http://www.ee.columbia.edu/~dpwe/papers/Wang03-shazam.pdf Sub_stantial noted earlier

Solution 3

Shazam Application is the one of the best Example for Application in Linked open data it takes the short sample music from the end user and identifies the song from datasets as well as link to purchase the album.

The user tags a song for 10 seconds and the application creates an audio fingerprint based on some of the anchors of the simplified spectrogram and the target area between them.

For each point of the target area, they create a hash value that is the combination of the frequency at which the anchor point is located, the frequency at which the point in the target zone is located, and the time difference between the point in the target zone and when the anchor point is located in the song.

Once the fingerprint of the audio is created, Shazam starts the search for matches in the database. If there is a match, the information is returned to the user; otherwise it returns a “song not known” dialogue.

Share:
17,617
VansFannel
Author by

VansFannel

I'm software architect, entrepreneur and self-taught passionate with new technologies. At this moment I am studying a master's degree in advanced artificial intelligence and (in my free time ) I'm developing an immersive VR application with Unreal Engine. I have also interested in home automation applying what I'm learning with Udacity's nanodegree course in object and voice recognition.

Updated on June 04, 2022

Comments

  • VansFannel
    VansFannel almost 2 years

    I'm developing an iOS application with SDK for iOS 5.0 and XCode 4.2.

    I want to develop an application that recognize sounds. I see there is an application called Sound Hound that recognize music and tells artist and title.

    How can I do something similar? I want to compare a sound to an existing sound database. How can I do that?

    Maybe I can use Fourier Transform. I don't know how to process sounds. Or it could be similar to speech recognition, isn't it?