ダウンロード数: 82
このアイテムのファイル:
ファイル | 記述 | サイズ | フォーマット | |
---|---|---|---|---|
s10579-022-09615-2.pdf | 1.29 MB | Adobe PDF | 見る/開く |
完全メタデータレコード
DCフィールド | 値 | 言語 |
---|---|---|
dc.contributor.author | Chu, Chenhui | en |
dc.contributor.author | Mao, Zhuoyuan | en |
dc.contributor.author | Nakazawa, Toshiaki | en |
dc.contributor.author | Kawahara, Daisuke | en |
dc.contributor.author | Kurohashi, Sadao | en |
dc.contributor.alternative | 褚, 晨翚 | ja |
dc.contributor.alternative | 毛, 卓遠 | ja |
dc.contributor.alternative | 黒橋, 禎夫 | ja |
dc.date.accessioned | 2023-08-21T10:54:18Z | - |
dc.date.available | 2023-08-21T10:54:18Z | - |
dc.date.issued | 2023-09 | - |
dc.identifier.uri | http://hdl.handle.net/2433/284725 | - |
dc.description.abstract | Word segmentation, part-of-speech (POS) tagging, and syntactic parsing are three fundamental Chinese analysis tasks for Chinese language processing, which are also crucial for various downstream tasks such as machine translation and information extraction. To achieve high accuracy for these tasks, treebanks that contain sentences manually annotated with word segmentation, part-of-speech tags, and phrase structures are essential. Although there are large-scale Chinese treebanks in the news domain, such treebanks are unavailable in the scientific domain. This significantly limits the performance of Chinese language processing for scientific text. To address this problem, we annotate the 2nd version of the Chinese treebank in the scientific domain (SCTB-V2). SCTB-V2 contains 12, 175 sentences annotated with word segmentation, part-of-speech tags, and phrase structures. We conducted Chinese analyses and machine translation experiments on SCTB-V2. The results show the effectiveness of SCTB-V2. We release this treebank to promote scientific Chinese language processing research http://nlp.ist.i.kyoto-u.ac.jp/EN/index.php?A%20Chinese%20Treebank%20 in%20Scientific%20Domain%20%28SCTB%29. | en |
dc.language.iso | eng | - |
dc.publisher | Springer Nature | en |
dc.rights | This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s10579-022-09615-2 | en |
dc.rights | The full-text file will be made open to the public on 15 October 2023 in accordance with publisher's 'Terms and Conditions for Self-Archiving'. | en |
dc.rights | This is not the published version. Please cite only the published version. この論文は出版社版でありません。引用の際には出版社版をご確認ご利用ください。 | en |
dc.subject | Treebank | en |
dc.subject | Chinese | en |
dc.subject | Scientific domain | en |
dc.title | SCTB-V2: the 2nd version of the Chinese treebank in the scientific domain | en |
dc.type | journal article | - |
dc.type.niitype | Journal Article | - |
dc.identifier.jtitle | Language Resources and Evaluation | en |
dc.identifier.volume | 57 | - |
dc.identifier.issue | 3 | - |
dc.identifier.spage | 1389 | - |
dc.identifier.epage | 1403 | - |
dc.relation.doi | 10.1007/s10579-022-09615-2 | - |
dc.textversion | author | - |
dcterms.accessRights | open access | - |
datacite.date.available | 2023-10-15 | - |
dc.identifier.pissn | 1574-020X | - |
dc.identifier.eissn | 1574-0218 | - |
出現コレクション: | 学術雑誌掲載論文等 |
このリポジトリに保管されているアイテムはすべて著作権により保護されています。