Downloads: 63

Files in This Item:
File Description SizeFormat 
jic094_320.pdf4.24 MBAdobe PDFView/Open
Title: 漢字字體規範史データセット及びそのCHISEとの統合について
Other Titles: Reconstruction of the Hanzi Normative Glyphs Database as a Dataset and its Integration with CHISE
Authors: 守岡, 知彥  KAKEN_name
Author's alias: MORIOKA, Tomohiko
Issue Date: 20-Dec-2019
Publisher: 京都大學人文科學研究所
Journal title: 東方學報
Volume: 94
Start page: 320
End page: 284
Abstract: This report describes an attempt to integrate the "CHISE" (Character Information Service Environment) character ontology and the "HNG" (Hanzi Normative Glyphs) database/dataset. The CHISE character ontology is a large-scale character ontology that includes 365, 000 character objects (1, 460, 000 triples) including Unicode characters, non-Unicode characters and their glyphs. It was developed for CHISE which is a character processing system not depended on character codes. The framework of CHISE is based on a graph storage named "CONCORD". We developed a Web service to display and edit objects of CONCORD, called "EST" (or "CHISE-wiki"). The CHISE character ontology uses the "Multiple Granularity Hanzi Structure Model" to support various glyphs and multiple unification granularity of Chinese characters. This model works fine for modern glyphs of Chinese characters. However, before we started the study to integrate CHISE and HNG, it was not clear that the model is sufficient for premodern Chinese characters. In addition, to design reasonable unification rules for each unification granularity, we need various glyph examples of Chinese characters. In these senses, the CHISE character ontology should integrate glyph database and/or glyph corpus. Therefore, we tried to integrate HNG and the CHISE character ontology. When viewed from the HNG side, this integration has the following significance. The original HNG web service had been stopped since the spring of 2015. Therefore, we applied research on the integration of CHISE and HNG, we provided HNG search function and data browsing function on the CHISE Web service. Although the difficulty of keeping databases on digital humanities for a long time has come to be recognized, it seems that the feasible method for restoring the database whose service has actually stopped and maintaining the data in the future is not yet well established. In this paper, we will outline the efforts on the HNG dataset, such as publication of dataset using distributed version control system (Git), provision of Git hosting service independent of URL of researcher's institution or platform provided by commercial companies, organization of dataset preservation association, and also discuss issues related to long-term preservation of databases.
DOI: 10.14989/250681
Appears in Collections:第94册

Show full item record

Export to RefWorks

Export Format: 

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.