このアイテムのアクセス数: 73

このアイテムのファイル:
ファイル 記述 サイズフォーマット 
jnlp.28.751.pdf370.1 kBAdobe PDF見る/開く
完全メタデータレコード
DCフィールド言語
dc.contributor.authorShirai, Keisukeen
dc.contributor.authorHashimoto, Kazumaen
dc.contributor.authorEriguchi, Akikoen
dc.contributor.authorNinomiya, Takashien
dc.contributor.authorMori, Shinsukeen
dc.contributor.alternative白井, 圭佑ja
dc.contributor.alternative森, 信介ja
dc.date.accessioned2022-09-30T08:09:28Z-
dc.date.available2022-09-30T08:09:28Z-
dc.date.issued2021-
dc.identifier.urihttp://hdl.handle.net/2433/276420-
dc.description.abstractNeural text generation models that are conditioned on a given input (e.g., machine translation and image captioning) are typically trained through maximum likelihood estimation of the target text. However, models trained in this manner often suffer from various types of errors when making subsequent inferences. In this study, we propose suppressing an arbitrary type of error by training the text generation model in a reinforcement learning framework; herein, we use a trainable reward function that can discriminate between references and sentences, containing the targeted type of errors. We create such negative examples by artificially injecting the targeted errors into the references. In the experiments, we focus on two error types; repeated and dropped tokens in model-generated text. The experimental results demonstrate that our method can suppress generation errors, and achieves significant improvements on two machine translation and two image captioning tasks.en
dc.language.isoeng-
dc.publisherAssociation for Natural Language Processingen
dc.publisher.alternative言語処理学会ja
dc.rights© 2021 The Association for Natural Language Processingen
dc.rightsLicensed under CC BY 4.0en
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/-
dc.subjectMachine Translationen
dc.subjectImage Captioningen
dc.subjectDiscriminatoren
dc.subjectNegative Exampleen
dc.titleNeural Text Generation with Artificial Negative Examples to Address Repeating and Dropping Errorsen
dc.typejournal article-
dc.type.niitypeJournal Article-
dc.identifier.jtitleJournal of Natural Language Processingen
dc.identifier.volume28-
dc.identifier.issue3-
dc.identifier.spage751-
dc.identifier.epage777-
dc.relation.doi10.5715/jnlp.28.751-
dc.textversionpublisher-
dcterms.accessRightsopen access-
dc.identifier.pissn1340-7619-
dc.identifier.eissn2185-8314-
dc.identifier.jtitle-alternative自然言語処理ja
出現コレクション:学術雑誌掲載論文等

アイテムの簡略レコードを表示する

Export to RefWorks


出力フォーマット 


このアイテムは次のライセンスが設定されています: クリエイティブ・コモンズ・ライセンス Creative Commons