A VLSI processor is designed for the small-scale isolated speech recognition applications. It is a dedicated processor which detects endpoint, extracts LPC (Linear Predictive Coefficient) cepstral coefficients from the speech signal, and computes the spectral distances using a dynamic time warping(DTW) technique. The designed chip can recognize 1000 isolated words per second with an average recognition accuracy of 90.3%. It is designed in a 0.8 μm CMOS technology, includes 66,760 gates, and runs with a 10MHz clock.
ASJC Scopus subject areas
- Media Technology
- Electrical and Electronic Engineering