A monotonic statistical machine translation approach to speaking style transformation

Neubig, Graham; Akita, Yuya; Mori, Shinsuke; Kawahara, Tatsuya

このアイテムのアクセス数: 451

http://hdl.handle.net/2433/157359

このアイテムのファイル:

ファイル	記述	サイズ	フォーマット
j.csl.2012.02.003.pdf		517.86 kB	Adobe PDF	見る/開く

完全メタデータレコード

DCフィールド	値	言語
dc.contributor.author	Neubig, Graham	en
dc.contributor.author	Akita, Yuya	en
dc.contributor.author	Mori, Shinsuke	en
dc.contributor.author	Kawahara, Tatsuya	en
dc.date.accessioned	2012-07-05T02:17:55Z	-
dc.date.available	2012-07-05T02:17:55Z	-
dc.date.issued	2012-10	-
dc.identifier.issn	0885-2308	-
dc.identifier.uri	http://hdl.handle.net/2433/157359	-
dc.description.abstract	This paper presents a method for automatically transforming faithful transcripts or ASR results into clean transcripts for human consumption using a framework we label speaking style transformation (SST). We perform a detailed analysis of the types of corrections performed by human stenographers when creating clean transcripts, and propose a model that is able to handle the majority of the most common corrections. In particular, the proposed model uses a framework of monotonic statistical machine translation to perform not only the deletion of disfluencies and insertion of punctuation, but also correction of colloquial expressions, insertions of omitted words, and other transformations. We provide a detailed description of the model implementation in the weighted finite state transducer (WFST) framework. An evaluation of the proposed model on both faithful transcripts and speech recognition results of parliamentary and lecture speech demonstrates the effectiveness of the proposed model in performing the wide variety of corrections necessary for creating clean transcripts.	en
dc.format.mimetype	application/pdf	-
dc.language.iso	eng	-
dc.publisher	Elsevier Ltd.	en
dc.rights	© 2012 Elsevier Ltd.	en
dc.rights	この論文は出版社版でありません。引用の際には出版社版をご確認ご利用ください。	ja
dc.rights	This is not the published version. Please cite only the published version.	en
dc.subject	Rich transcription	en
dc.subject	Speaking style transformation	en
dc.subject	Disfluency detection	en
dc.subject	Weighted finite state transducers	en
dc.subject	Monotonic machine translation	en
dc.title	A monotonic statistical machine translation approach to speaking style transformation	en
dc.type	journal article	-
dc.type.niitype	Journal Article	-
dc.identifier.ncid	AA10677208	-
dc.identifier.jtitle	Computer Speech & Language	en
dc.identifier.volume	26	-
dc.identifier.issue	5	-
dc.identifier.spage	349	-
dc.identifier.epage	370	-
dc.relation.doi	10.1016/j.csl.2012.02.003	-
dc.textversion	author	-
dcterms.accessRights	open access	-
出現コレクション:	学術雑誌掲載論文等

アイテムの簡略レコードを表示する

Export to RefWorks

このリポジトリに保管されているアイテムはすべて著作権により保護されています。