Nonparametric Bayesina Sparse Factor Analysis for Frequency Doain Blind Source Separation without Pearmuation Ambiguity

Nagira, Kohei; Otsuka, Takuma; Okuno, Hiroshi G.

ダウンロード数: 131

http://hdl.handle.net/2433/187379

このアイテムのファイル:

ファイル	記述	サイズ	フォーマット
1687-4722-2013-4.pdf		3.84 MB	Adobe PDF	見る/開く

完全メタデータレコード

DCフィールド	値	言語
dc.contributor.author	Nagira, Kohei	en
dc.contributor.author	Otsuka, Takuma	en
dc.contributor.author	Okuno, Hiroshi G.	en
dc.contributor.alternative	奥乃, 博	ja
dc.date.accessioned	2014-05-30T07:48:21Z	-
dc.date.available	2014-05-30T07:48:21Z	-
dc.date.issued	2013-01-22	-
dc.identifier.issn	1687-4714	-
dc.identifier.uri	http://hdl.handle.net/2433/187379	-
dc.description.abstract	Blind source separation (BSS) and sound activity detection (SAD) from a sound source mixture with minimum prior information are two major requirements for computational auditory scene analysis that recognizes auditory events in many environments. In daily environments, BSS suffers from many problems such as reverberation, a permutation problem in frequency-domain processing, and uncertainty about the number of sources in the observed mixture. While many conventional BSS methods resort to a cascaded combination of subprocesses, e.g., frequency-wise separation and permutation resolution, to overcome these problems, their outcomes may be affected by the worst subprocess. Our aim is to develop a unified framework to cope with these problems. Our method, called permutation-free infinite sparse factor analysis (PF-ISFA), is based on a nonparametric Bayesian framework that enables inference without a pre-determined number of sources. It solves BSS, SAD and the permutation problem at the same time. Our method has two key ideas: unified source activities for all the frequency bins and the activation probabilities of all the frequency bins of all the sources. Experiments were carried out to evaluate the separation performance and the SAD performance under four reverberant conditions. For separation performance in the BSS_EVAL criteria, our method outperformed conventional complex ISFA under all conditions. For SAD performance, our method outperformed the conventional method by 5.9–0.5% in F-measure under the condition RT20 = 30–600 [ms], respectively.	en
dc.format.mimetype	application/pdf	-
dc.language.iso	eng	-
dc.publisher	SpringerOpen	en
dc.rights	© 2013 Nagira et al.; licensee Springer.	en
dc.rights	This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.	en
dc.title	Nonparametric Bayesina Sparse Factor Analysis for Frequency Doain Blind Source Separation without Pearmuation Ambiguity	en
dc.type	journal article	-
dc.type.niitype	Journal Article	-
dc.identifier.jtitle	EURASIP Journal on Audio, Speech, and Music Processing	en
dc.identifier.volume	2013	-
dc.relation.doi	10.1186/1687-4722-2013-4	-
dc.textversion	publisher	-
dc.identifier.artnum	4	-
dcterms.accessRights	open access	-
出現コレクション:	学術雑誌掲載論文等

アイテムの簡略レコードを表示する

Export to RefWorks

このリポジトリに保管されているアイテムはすべて著作権により保護されています。