ダウンロード数: 82

このアイテムのファイル:
ファイル 記述 サイズフォーマット 
s10579-022-09615-2.pdf1.29 MBAdobe PDF見る/開く
完全メタデータレコード
DCフィールド言語
dc.contributor.authorChu, Chenhuien
dc.contributor.authorMao, Zhuoyuanen
dc.contributor.authorNakazawa, Toshiakien
dc.contributor.authorKawahara, Daisukeen
dc.contributor.authorKurohashi, Sadaoen
dc.contributor.alternative褚, 晨翚ja
dc.contributor.alternative毛, 卓遠ja
dc.contributor.alternative黒橋, 禎夫ja
dc.date.accessioned2023-08-21T10:54:18Z-
dc.date.available2023-08-21T10:54:18Z-
dc.date.issued2023-09-
dc.identifier.urihttp://hdl.handle.net/2433/284725-
dc.description.abstractWord segmentation, part-of-speech (POS) tagging, and syntactic parsing are three fundamental Chinese analysis tasks for Chinese language processing, which are also crucial for various downstream tasks such as machine translation and information extraction. To achieve high accuracy for these tasks, treebanks that contain sentences manually annotated with word segmentation, part-of-speech tags, and phrase structures are essential. Although there are large-scale Chinese treebanks in the news domain, such treebanks are unavailable in the scientific domain. This significantly limits the performance of Chinese language processing for scientific text. To address this problem, we annotate the 2nd version of the Chinese treebank in the scientific domain (SCTB-V2). SCTB-V2 contains 12, 175 sentences annotated with word segmentation, part-of-speech tags, and phrase structures. We conducted Chinese analyses and machine translation experiments on SCTB-V2. The results show the effectiveness of SCTB-V2. We release this treebank to promote scientific Chinese language processing research http://nlp.ist.i.kyoto-u.ac.jp/EN/index.php?A%20Chinese%20Treebank%20 in%20Scientific%20Domain%20%28SCTB%29.en
dc.language.isoeng-
dc.publisherSpringer Natureen
dc.rightsThis version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s10579-022-09615-2en
dc.rightsThe full-text file will be made open to the public on 15 October 2023 in accordance with publisher's 'Terms and Conditions for Self-Archiving'.en
dc.rightsThis is not the published version. Please cite only the published version. この論文は出版社版でありません。引用の際には出版社版をご確認ご利用ください。en
dc.subjectTreebanken
dc.subjectChineseen
dc.subjectScientific domainen
dc.titleSCTB-V2: the 2nd version of the Chinese treebank in the scientific domainen
dc.typejournal article-
dc.type.niitypeJournal Article-
dc.identifier.jtitleLanguage Resources and Evaluationen
dc.identifier.volume57-
dc.identifier.issue3-
dc.identifier.spage1389-
dc.identifier.epage1403-
dc.relation.doi10.1007/s10579-022-09615-2-
dc.textversionauthor-
dcterms.accessRightsopen access-
datacite.date.available2023-10-15-
dc.identifier.pissn1574-020X-
dc.identifier.eissn1574-0218-
出現コレクション:学術雑誌掲載論文等

アイテムの簡略レコードを表示する

Export to RefWorks


出力フォーマット 


このリポジトリに保管されているアイテムはすべて著作権により保護されています。