ダウンロード数: 208

このアイテムのファイル:
ファイル 記述 サイズフォーマット 
1.4969503.pdf113.16 kBAdobe PDF見る/開く
タイトル: Shouted speech detection using hidden markov model with rahmonic and mel-frequency cepstrum coefficients
著者: Fukumori, Takahiro
Nakayama, Masato
Nishiura, Takanobu
Nanjo, Hiroaki
著者名の別形: 南條, 浩輝
キーワード: Speech recognition
Markov processes
Automatic speech recognition systems
Image detection systems
Cameras
発行日: Oct-2016
出版者: Acoustical Society of America (ASA)
誌名: The Journal of the Acoustical Society of America
巻: 140
号: 4
開始ページ: 3057
終了ページ: 3057
論文番号: 2aSPb7
抄録: In recent years, crime prevention systems have been developed to detect various hazardous situations. In general, the systems utilize the image information recorded by a camera to monitor the situations. It is however difficult to detect them in the blind area. To address the problem, it is required to utilize not only image information but also acoustic information occurred in such situations. Our previous study showed that two acoustic features including rahmonic and mel-frequency cepstrum coefficients (MFCCs) are effective for detecting the shouted speech. Rahmonic shows a subharmonic of fundamental frequency in the cepstrum domain, and MFCCs represent coefficients that collectively make up mel-frequency cepstrum. In this method, a shouted speech model is constructed from these features by using a gaussian mixture model (GMM). However, the previous method with GMM has difficulty in representing temporal changes of the speech features. In this study, we further expand the previous method using hidden Markov model (HMM) which has state transition to represent the temporal changes. Through objective experiments, the proposed method using HMM could achieve higher detection performance of the shouted speech than the conventional method using GMM.
著作権等: Copyright 2016 Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America. The following article appeared in 'The Journal of the Acoustical Society of America 140, 3057 (2016)' and may be found at https://doi.org/10.1121/1.4969503.
There are hidden parts depending on the permission condition of the publisher in this pdf.
URI: http://hdl.handle.net/2433/229399
DOI(出版社版): 10.1121/1.4969503
出現コレクション:学術雑誌掲載論文等

アイテムの詳細レコードを表示する

Export to RefWorks


出力フォーマット 


このリポジトリに保管されているアイテムはすべて著作権により保護されています。