ダウンロード数: 63

このアイテムのファイル:
ファイル 記述 サイズフォーマット 
ACCESS.2022.3156073.pdf2.79 MBAdobe PDF見る/開く
タイトル: Fine Grain Synthetic Educational Data: Challenges and Limitations of Collaborative Learning Analytics
著者: Flanagan, Brendan  kyouindb  KAKEN_id  orcid https://orcid.org/0000-0001-7644-997X (unconfirmed)
Majumdar, Rwitajit  KAKEN_id  orcid https://orcid.org/0000-0003-4671-0238 (unconfirmed)
Ogata, Hiroaki  kyouindb  KAKEN_id  orcid https://orcid.org/0000-0001-5216-1576 (unconfirmed)
著者名の別形: 緒方, 広明
キーワード: Synthetic learner data
student modeling
data sharing
data challenge
発行日: 2022
出版者: Institute of Electrical and Electronics Engineers (IEEE)
誌名: IEEE Access
巻: 10
開始ページ: 26230
終了ページ: 26241
抄録: While data privacy is a key aspect of Learning Analytics, it often creates difficulty when promoting research into underexplored contexts as it limits data sharing. To overcome this problem, the generation of synthetic data has been proposed and discussed within the LA community. However, there has been little work that has explored the use of synthetic data in real-world situations. This research examines the effectiveness of using synthetic data for training academic performance prediction models, and the challenges and limitations of using the proposed data sharing method. To evaluate the effectiveness of the method, we generate synthetic data from a private dataset, and distribute it to the participants of a data challenge to train prediction models. Participants submitted their models as docker containers for evaluation and ranking on holdout synthetic data. A post-hoc analysis was conducted on the top 10 participant’s models by comparing the evaluation of their performance on synthetic and private validation datasets. Several models trained on synthetic data were found to perform significantly poorer when applied to the non-synthetic private dataset. The main contribution of this research is to understand the challenges and limitations of applying predictive models trained on synthetic data in real-world situations. Due to these challenges, the paper recommends model designs that can inform future successful adoption of synthetic data in real-world educational data systems.
著作権等: This work is licensed under a Creative Commons Attribution 4.0 License.
URI: http://hdl.handle.net/2433/279306
DOI(出版社版): 10.1109/ACCESS.2022.3156073
出現コレクション:学術雑誌掲載論文等

アイテムの詳細レコードを表示する

Export to RefWorks


出力フォーマット 


このアイテムは次のライセンスが設定されています: クリエイティブ・コモンズ・ライセンス Creative Commons