Downloads: 67

Files in This Item:
File Description SizeFormat 
abe.11.16.pdf4.65 MBAdobe PDFView/Open
Title: Integrating Preprocessing Operations into Deep Learning Model: Case Study of Posttreatment Visual Acuity Prediction
Authors: Otsuki, Ryo
Sugiyama, Osamu
Mori, Yuki
Miyake, Masahiro  kyouindb  KAKEN_id
Hiragi, Shusuke
Yamamoto, Goshiro  kyouindb  KAKEN_id  orcid https://orcid.org/0000-0002-2014-7195 (unconfirmed)
Santos, Luciano
Nakanishi, Yuta
Hosoda, Yoshikatsu
Tamura, Hiroshi  kyouindb  KAKEN_id  orcid https://orcid.org/0000-0002-7740-2732 (unconfirmed)
Matsumoto, Shigemi  kyouindb  KAKEN_id  orcid https://orcid.org/0000-0001-6453-7489 (unconfirmed)
Tsujikawa, Akitaka  kyouindb  KAKEN_id
Kuroda, Tomohiro  kyouindb  KAKEN_id  orcid https://orcid.org/0000-0003-1472-7203 (unconfirmed)
Author's alias: 大槻, 涼
杉山, 治
森, 雄貴
三宅, 正裕
平木, 秀輔
山本, 豪志朗
中西, 悠太
細田, 祥勝
田村, 寛
松本, 繁巳
辻川, 明孝
黒田, 知宏
Keywords: deep learning
automation of preprocessing
visual acuity prediction
Issue Date: 2022
Publisher: Japanese Society for Medical and Biological Engineering
Journal title: Advanced Biomedical Engineering
Volume: 11
Start page: 16
End page: 24
Abstract: Designing a deep neural network model that integrates clinical images with other electronic medical records entails various preprocessing operations. Preprocessing of clinical images often requires trimming of parts of the lesions shown in the images, whereas preprocessing of other electronic medical records requires vectorization of these records; for example, patient age is often converted into a categorical vector of 10-year intervals. Although these preprocessing operations are critical to the performance of the classification model, there is no guarantee that the preprocessing step chosen is appropriate for model training. The ability to integrate these preprocessing operations into a deep neural network model and to train the model, including the preprocessing operations, can help design a multi-modal medical classification model. This study proposes integration layers of preprocessing, both for clinical images and electronic medical records, in deep neural network models. Preprocessing of clinical images is realized by a vision transformer layer that selectively adopts the parts of the images requiring attention. The preprocessing of other medical electrical records is performed by adopting full-connection layers and normalizing these layers. These proposed preprocessing-integrated layers were verified using a posttreatment visual acuity prediction task in ophthalmology as a case study. This prediction task requires clinical images as well as patient profile data corresponding to each patient's posttreatment logMAR visual acuity. The performance of a heuristically designed prediction model was compared with the performance of the prediction model that includes the proposed preprocessing integration layers. The mean square errors between predicted and correct results were 0.051 for the heuristic model and 0.054 for the proposed model. Experimental results showed that the proposed model utilizing preprocessing integration layers achieved nearly the same performance as the heuristically designed model.
Rights: Copyright: ©2022 The Author(s).
This is an open access article distributed under the terms of the Creative Commons BY 4.0 International (Attribution) License, which permits the unrestricted distribution, reproduction and use of the article provided the original source and authors are credited.
URI: http://hdl.handle.net/2433/278464
DOI(Published Version): 10.14326/abe.11.16
Appears in Collections:Journal Articles

Show full item record

Export to RefWorks


Export Format: 


This item is licensed under a Creative Commons License Creative Commons