Downloads: 432

Files in This Item:
File Description SizeFormat 
TASL.2010.2052610.pdf695.39 kBAdobe PDFView/Open
Title: Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood
Authors: Gomez, Randy
Kawahara, Tatsuya  kyouindb  KAKEN_id  orcid (unconfirmed)
Author's alias: 河原, 達也
Issue Date: Sep-2010
Publisher: IEEE
Journal title: IEEE Transactions on Audio, Speech, and Language Processing
Volume: 18
Issue: 7
Start page: 1708
End page: 1716
Abstract: Automatic speech recognition (ASR) in reverberant environments is a challenging task. Most dereverberation techniques address this problem through signal processing and enhances the reverberant waveform independent from the speech recognizer. In this paper, we propose a novel scheme to perform dereverberation in relation with the likelihood of the back-end ASR system. Our proposed approach effectively selects the dereverberation parameters, in the form of multiband scale factors, so that they improve the likelihood of the acoustic model. Then, the acoustic model is retrained using the optimal parameters. During the recognition phase, we implement additional optimization of the parameters. By using Gaussian mixture model (GMM), the process for selecting the scale factors become efficient. Moreover, we remove the dependency of the adopted dereverberation technique on the room impulse response (RIR) measurement, by using an artificial RIR generator and selecting based on the acoustic likelihood. Experimental results show significant improvement in recognition performance with the proposed method over the conventional approach.
Rights: © 2010 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.
DOI(Published Version): 10.1109/TASL.2010.2052610
Appears in Collections:Journal Articles

Show full item record

Export to RefWorks

Export Format: 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.