Improved Speaker Markov Modelling for Unsupervised Speaker Normalization

Fung, Pascale; Kuwahara, Tatsuya; Doshita, Shuji; Adda, Martine

このアイテムのアクセス数: 277

http://hdl.handle.net/2433/52475

このアイテムのファイル:

ファイル	記述	サイズ	フォーマット
soa025_049.pdf		611.85 kB	Adobe PDF	見る/開く

タイトル:	Improved Speaker Markov Modelling for Unsupervised Speaker Normalization
著者:	Fung, Pascale Kuwahara, Tatsuya Doshita, Shuji Adda, Martine
著者名の別形:	カワハラ, タツヤドウシタ, シュウジ
発行日:	1991
出版者:	INSTITUTION FOR PHONETIC SCIENCES UNIVERSITY OF KYOTO
誌名:	音声科学研究
巻:	25
開始ページ:	49
終了ページ:	58
抄録:	We propose new methods of improved speech recognition with speaker-variable information. Hidden Markov Model-based recognizers which are trained by reference speaker(s) (RS) are normalized by our two different approaches to give a better speaker-independent recognition rate. Our normalization methods are based on the same principle of inter-speaker Markov mapping. This mapping gives inter-speaker parameters which are used differently in our two approaches. The first Speaker Markov Model Converter (SMMC) converts new speaker spectral data into label data similar to that of the reference speaker utterance, which is passed directly to the recognizer. In the second Integrated Markov Model (IMM) approach, inter-speaker emission probabilities (ISE) are integrated as weights to the HMM emission probabilities. The recognizer in this case is modified according to interspeaker variable information whereas the normalization is done in context. The inter-speaker mapping in both cases are unsupervised to save new speaker (NS) effort. HMM score thresholding, template matching and DP thresholding techniques are applied to select suitable data for unsupervised mapping of NS and RS data. This mapping is done in parallel to the recognition process. Iterations are performed to improve the unsupervised mapping.
URI:	http://hdl.handle.net/2433/52475
出現コレクション:	Vol.25

アイテムの詳細レコードを表示する

Export to RefWorks

このリポジトリに保管されているアイテムはすべて著作権により保護されています。