Dr. Dobb's is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them. Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.


Channels ▼
RSS

Web Development

Music Search Engine: Identifying Musically Meaningful Words


Electrical engineers and computer scientists at the University of California, San Diego are working together on a computerized system that will make it easy for people who are not music experts to find the kind of music they want to listen to -" without knowing the names of artists or songs.

In A Game-Based Approach for Collecting Semantic Annotations of Music, Doug Turnbull, Luke Barrington, and Gert Lanckriet demonstrate that the online music game they created provides crucial data for building the back-end of a music search engine that allows users to type in words to find songs.

"When my mom gets up in the morning and is like, 'I need some energy to go jogging,' she has no clue what title or artist is going to help her with that," said Gert Lanckriet, the electrical engineering professor overseeing the project.

What Lanckriet's mom needs is a "Google for music" -" a search engine for music that lets you type in regular words like "high energy instrumental with piano, " "funky guitar solos" or "upbeat music with female vocals, " and get songs in return.

One option for creating this kind of natural-language music search engine is to manually annotate as many songs as possible -" but this is expensive and limits the depth and breadth of the search engine's reach. Another option is to train computers to do the song annotations.

The researchers have, in fact, built such a system over the last two years. They call it a "computer audition system." You feed it songs and it annotates them, thanks to a series of algorithms they created. Once a song is annotated, you can retrieve it using a text-based search engine. But before the system can start annotating songs, it has to be trained "- via a process of machine learning. Getting enough data to properly train the system to label a wide range of music accurately is difficult.

According to Lanckriet, an online music game they created, called Listen Game, is capable of capturing the crucial word-song combinations that are needed to train their system to label large numbers of songs automatically.

Listen Game is an online, multiplayer game prototype that gets players to label songs with words. Like the popular image labeling games, such as ESP game, Peekaboom and Phetch, Listen Game relies on people surfing the Web to generate valuable data while playing the game. For Listen Game, the "human computation" occurs when users listen to clips of songs and determine which words are most and least relevant to the song. Users earn points when they pick the same words as others who are playing at the same time and listening to the same song clips.

Researchers use these "semantic annotations of music" to train their home-grown computer system to label songs it has not previously encountered.

"We've shown "- in academic terms -" that our game works. We're close to the performance we get with comparable survey data from the music undergrads we paid to fill out music surveys," said Doug Turnbull, a computer science Ph.D. student who is working on the project.


Related Reading


More Insights






Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Dr. Dobb's encourages readers to engage in spirited, healthy debate, including taking us to task. However, Dr. Dobb's moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing or spam. Dr. Dobb's further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.