Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics

Lane, Ian; Kawahara, Tatsuya; Matsui, Tomoko; Nakamura, Satoshi

ダウンロード数: 808

http://hdl.handle.net/2433/128902

このアイテムのファイル:

ファイル	記述	サイズ	フォーマット
TASL.2006.876727.pdf		735.71 kB	Adobe PDF	見る/開く

完全メタデータレコード

DCフィールド	値	言語
dc.contributor.author	Lane, Ian	en
dc.contributor.author	Kawahara, Tatsuya	en
dc.contributor.author	Matsui, Tomoko	en
dc.contributor.author	Nakamura, Satoshi	en
dc.contributor.alternative	河原, 達也	ja
dc.date.accessioned	2010-10-21T01:45:25Z	-
dc.date.available	2010-10-21T01:45:25Z	-
dc.date.issued	2007-01	-
dc.identifier.issn	1558-7916	-
dc.identifier.uri	http://hdl.handle.net/2433/128902	-
dc.description.abstract	One significant problem for spoken language systems is how to cope with users' out-of-domain (OOD) utterances which cannot be handled by the back-end application system. In this paper, we propose a novel OOD detection framework, which makes use of the classification confidence scores of multiple topics and applies a linear discriminant model to perform in-domain verification. The verification model is trained using a combination of deleted interpolation of the in-domain data and minimum-classification-error training, and does not require actual OOD data during the training process, thus realizing high portability. When applied to the "phrasebook" system, a single utterance read-style speech task, the proposed approach achieves an absolute reduction in OOD detection errors of up to 8.1 points (40% relative) compared to a baseline method based on the maximum topic classification score. Furthermore, the proposed approach realizes comparable performance to an equivalent system trained on both in-domain and OOD data, while requiring no OOD data during training. We also apply this framework to the "machine-aided-dialogue" corpus, a spontaneous dialogue speech task, and extend the framework in two manners. First, we introduce topic clustering which enables reliable topic confidence scores to be generated even for indistinct utterances, and second, we implement methods to effectively incorporate dialogue context. Integration of these two methods into the proposed framework significantly improves OOD detection performance, achieving a further reduction in equal error rate (EER) of 7.9 points.	en
dc.format.mimetype	application/pdf	-
dc.language.iso	eng	-
dc.publisher	IEEE	en
dc.rights	© 2007 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.	en
dc.title	Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics	en
dc.type	journal article	-
dc.type.niitype	Journal Article	-
dc.identifier.ncid	AA12103538	-
dc.identifier.jtitle	IEEE Transactions on Audio, Speech and Language Processing	en
dc.identifier.volume	15	-
dc.identifier.issue	1	-
dc.identifier.spage	150	-
dc.identifier.epage	161	-
dc.relation.doi	10.1109/TASL.2006.876727	-
dc.textversion	publisher	-
dcterms.accessRights	open access	-
出現コレクション:	学術雑誌掲載論文等

アイテムの簡略レコードを表示する

Export to RefWorks

このリポジトリに保管されているアイテムはすべて著作権により保護されています。