Visual voice activity detection via chaos based lip motion measure robust under illumination changes

Taeyup Song, Kyungsun Lee, Hanseok Ko

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)

Abstract

In this paper, a vision based voice activity detection (VVAD) algorithm is proposed using chaos theory. In conventional VVAD algorithm, the movement measure of lip region is found by applying an optical flow algorithm to detect the visual speech frame using a motion based energy feature set. However, since motion based feature is unstable under illumination changes, a new form of robust feature set is desirable. It is propositioned that contextual changes such as lip opening or closing motion during speech utterances under illumination variation can be observed as chaos-like and the resultant complex fractal trajectories in phase space can be observed. The fractality is measured in phase space from two sequential video input frames and subsequently any visual speech frames are robustly detected. Representative experiments are performed in image sequence containing a driver scene undergoing illumination fluctuations in moving vehicle environment. Experimental results indicate that a substantial improvement is obtained in terms of achieving significantly lower false alarm rate over the conventional method.

Original languageEnglish
Article number6852001
Pages (from-to)251-257
Number of pages7
JournalIEEE Transactions on Consumer Electronics
Volume60
Issue number2
DOIs
Publication statusPublished - 2014 Jan 1

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Media Technology

Fingerprint

Dive into the research topics of 'Visual voice activity detection via chaos based lip motion measure robust under illumination changes'. Together they form a unique fingerprint.

Cite this