このアイテムのアクセス数: 261
このアイテムのファイル:
ファイル | 記述 | サイズ | フォーマット | |
---|---|---|---|---|
2537128.pdf | 798.78 kB | Adobe PDF | 見る/開く |
タイトル: | Distortion Model Based on Word Sequence Labeling for Statistical Machine Translation |
著者: | Goto, Isao Utiyama, Masao Sumita, Eiichiro Tamura, Akihiro Kurohashi, Sadao ![]() ![]() |
著者名の別形: | 黒橋, 禎夫 |
キーワード: | Distortion model machine translation reordering |
発行日: | 1-Feb-2014 |
出版者: | Association for Computing Machinery (ACM) |
誌名: | ACM Transactions on Asian Language Information Processing |
巻: | 13 |
号: | 1 |
論文番号: | 2 |
抄録: | This article proposes a new distortion model for phrase-based statistical machine translation. In decoding, a distortion model estimates the source word position to be translated next (subsequent position; SP) given the last translated source word position (current position; CP). We propose a distortion model that can simultaneously consider the word at the CP, the word at an SP candidate, the context of the CP and an SP candidate, relative word order among the SP candidates, and the words between the CP and an SP candidate. These considered elements are called rich context. Our model considers rich context by discriminating label sequences that specify spans from the CP to each SP candidate. It enables our model to learn the effect of relative word order among SP candidates as well as to learn the effect of distances from the training data. In contrast to the learning strategy of existing methods, our learning strategy is that the model learns preference relations among SP candidates in each sentence of the training data. This leaning strategy enables consideration of all of the rich context simultaneously. In our experiments, our model had higher BLUE and RIBES scores for Japanese-English, Chinese-English, and German-English translation compared to the lexical reordering models. |
著作権等: | Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author. 2014 Copyright is held by the author/owner(s). |
URI: | http://hdl.handle.net/2433/187072 |
DOI(出版社版): | 10.1145/2537128 |
出現コレクション: | 学術雑誌掲載論文等 |

このリポジトリに保管されているアイテムはすべて著作権により保護されています。