M-MUSICS

An intelligent mobile music retrieval system

Seungmin Rho, Een Jun Hwang, Jong Hyuk Park

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Accurate voice humming transcription and efficient indexing and retrieval schemes are essential to a large-scale humming-based audio retrieval system. Although much research has been done to develop such schemes, their performance in terms of precision, recall, and F-measure, among all similarity metrics, are still unsatisfactory. In this paper, we propose a new voice query transcription scheme. It considers the following features: note onset detection using dynamic threshold methods, fundamental frequency (F0) acquisition of each frame, and frequency realignment using K-means. We use a popularity-adaptive indexing structure called frequently accessed index (FAI) based on frequently queried tunes for indexing purposes. In addition, we propose a semi-supervised relevance feedback and query reformulation scheme based on a genetic algorithm to improve retrieval efficiency. In this paper, we extend our efforts to mobile multimedia environments and develop a mobile audio retrieval system. Experiments show our system performs satisfactory in wireless mobile multimedia environments.

Original languageEnglish
Pages (from-to)313-326
Number of pages14
JournalMultimedia Systems
Volume17
Issue number4
DOIs
Publication statusPublished - 2011 Jul 1

Fingerprint

Transcription
Genetic algorithms
Feedback
Experiments

Keywords

  • Content-based audio retrieval
  • Mobile platform
  • Relevance feedback
  • Signal processing

ASJC Scopus subject areas

  • Media Technology
  • Hardware and Architecture
  • Information Systems
  • Software
  • Computer Networks and Communications

Cite this

M-MUSICS : An intelligent mobile music retrieval system. / Rho, Seungmin; Hwang, Een Jun; Park, Jong Hyuk.

In: Multimedia Systems, Vol. 17, No. 4, 01.07.2011, p. 313-326.

Research output: Contribution to journalArticle

Rho, Seungmin ; Hwang, Een Jun ; Park, Jong Hyuk. / M-MUSICS : An intelligent mobile music retrieval system. In: Multimedia Systems. 2011 ; Vol. 17, No. 4. pp. 313-326.
@article{acee59a15aae4fe4994c541ec56ed4bb,
title = "M-MUSICS: An intelligent mobile music retrieval system",
abstract = "Accurate voice humming transcription and efficient indexing and retrieval schemes are essential to a large-scale humming-based audio retrieval system. Although much research has been done to develop such schemes, their performance in terms of precision, recall, and F-measure, among all similarity metrics, are still unsatisfactory. In this paper, we propose a new voice query transcription scheme. It considers the following features: note onset detection using dynamic threshold methods, fundamental frequency (F0) acquisition of each frame, and frequency realignment using K-means. We use a popularity-adaptive indexing structure called frequently accessed index (FAI) based on frequently queried tunes for indexing purposes. In addition, we propose a semi-supervised relevance feedback and query reformulation scheme based on a genetic algorithm to improve retrieval efficiency. In this paper, we extend our efforts to mobile multimedia environments and develop a mobile audio retrieval system. Experiments show our system performs satisfactory in wireless mobile multimedia environments.",
keywords = "Content-based audio retrieval, Mobile platform, Relevance feedback, Signal processing",
author = "Seungmin Rho and Hwang, {Een Jun} and Park, {Jong Hyuk}",
year = "2011",
month = "7",
day = "1",
doi = "10.1007/s00530-010-0212-y",
language = "English",
volume = "17",
pages = "313--326",
journal = "Multimedia Systems",
issn = "0942-4962",
publisher = "Springer Verlag",
number = "4",

}

TY - JOUR

T1 - M-MUSICS

T2 - An intelligent mobile music retrieval system

AU - Rho, Seungmin

AU - Hwang, Een Jun

AU - Park, Jong Hyuk

PY - 2011/7/1

Y1 - 2011/7/1

N2 - Accurate voice humming transcription and efficient indexing and retrieval schemes are essential to a large-scale humming-based audio retrieval system. Although much research has been done to develop such schemes, their performance in terms of precision, recall, and F-measure, among all similarity metrics, are still unsatisfactory. In this paper, we propose a new voice query transcription scheme. It considers the following features: note onset detection using dynamic threshold methods, fundamental frequency (F0) acquisition of each frame, and frequency realignment using K-means. We use a popularity-adaptive indexing structure called frequently accessed index (FAI) based on frequently queried tunes for indexing purposes. In addition, we propose a semi-supervised relevance feedback and query reformulation scheme based on a genetic algorithm to improve retrieval efficiency. In this paper, we extend our efforts to mobile multimedia environments and develop a mobile audio retrieval system. Experiments show our system performs satisfactory in wireless mobile multimedia environments.

AB - Accurate voice humming transcription and efficient indexing and retrieval schemes are essential to a large-scale humming-based audio retrieval system. Although much research has been done to develop such schemes, their performance in terms of precision, recall, and F-measure, among all similarity metrics, are still unsatisfactory. In this paper, we propose a new voice query transcription scheme. It considers the following features: note onset detection using dynamic threshold methods, fundamental frequency (F0) acquisition of each frame, and frequency realignment using K-means. We use a popularity-adaptive indexing structure called frequently accessed index (FAI) based on frequently queried tunes for indexing purposes. In addition, we propose a semi-supervised relevance feedback and query reformulation scheme based on a genetic algorithm to improve retrieval efficiency. In this paper, we extend our efforts to mobile multimedia environments and develop a mobile audio retrieval system. Experiments show our system performs satisfactory in wireless mobile multimedia environments.

KW - Content-based audio retrieval

KW - Mobile platform

KW - Relevance feedback

KW - Signal processing

UR - http://www.scopus.com/inward/record.url?scp=79960073733&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79960073733&partnerID=8YFLogxK

U2 - 10.1007/s00530-010-0212-y

DO - 10.1007/s00530-010-0212-y

M3 - Article

VL - 17

SP - 313

EP - 326

JO - Multimedia Systems

JF - Multimedia Systems

SN - 0942-4962

IS - 4

ER -