Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood

Gomez, Randy; Kawahara, Tatsuya

ダウンロード数: 514

http://hdl.handle.net/2433/128840

このアイテムのファイル:

ファイル	記述	サイズ	フォーマット
TASL.2010.2052610.pdf		695.39 kB	Adobe PDF	見る/開く

完全メタデータレコード

DCフィールド	値	言語
dc.contributor.author	Gomez, Randy	en
dc.contributor.author	Kawahara, Tatsuya	en
dc.contributor.alternative	河原, 達也	ja
dc.date.accessioned	2010-10-19T02:09:26Z	-
dc.date.available	2010-10-19T02:09:26Z	-
dc.date.issued	2010-09	-
dc.identifier.issn	1558-7916	-
dc.identifier.uri	http://hdl.handle.net/2433/128840	-
dc.description.abstract	Automatic speech recognition (ASR) in reverberant environments is a challenging task. Most dereverberation techniques address this problem through signal processing and enhances the reverberant waveform independent from the speech recognizer. In this paper, we propose a novel scheme to perform dereverberation in relation with the likelihood of the back-end ASR system. Our proposed approach effectively selects the dereverberation parameters, in the form of multiband scale factors, so that they improve the likelihood of the acoustic model. Then, the acoustic model is retrained using the optimal parameters. During the recognition phase, we implement additional optimization of the parameters. By using Gaussian mixture model (GMM), the process for selecting the scale factors become efficient. Moreover, we remove the dependency of the adopted dereverberation technique on the room impulse response (RIR) measurement, by using an artificial RIR generator and selecting based on the acoustic likelihood. Experimental results show significant improvement in recognition performance with the proposed method over the conventional approach.	en
dc.format.mimetype	application/pdf	-
dc.language.iso	eng	-
dc.publisher	IEEE	en
dc.rights	© 2010 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.	en
dc.title	Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood	en
dc.type	journal article	-
dc.type.niitype	Journal Article	-
dc.identifier.ncid	AA12103538	-
dc.identifier.jtitle	IEEE Transactions on Audio, Speech, and Language Processing	en
dc.identifier.volume	18	-
dc.identifier.issue	7	-
dc.identifier.spage	1708	-
dc.identifier.epage	1716	-
dc.relation.doi	10.1109/TASL.2010.2052610	-
dc.textversion	publisher	-
dcterms.accessRights	open access	-
出現コレクション:	学術雑誌掲載論文等

アイテムの簡略レコードを表示する

Export to RefWorks

このリポジトリに保管されているアイテムはすべて著作権により保護されています。