ダウンロード数: 341

このアイテムのファイル:
ファイル 記述 サイズフォーマット 
cpe.3522.pdf772.86 kBAdobe PDF見る/開く
タイトル: An analysis of the feasibility and benefits of GPU/multicore acceleration of the Weather Research and Forecasting model
著者: Vanderbauwhede, Wim
Takemi, Tetsuya  kyouindb  KAKEN_id  orcid https://orcid.org/0000-0002-7596-2373 (unconfirmed)
著者名の別形: 竹見, 哲也
キーワード: general-purpose computation on graphics processing units (GPGPU)
parallelization of simulation
large-scale scientific computing
発行日: May-2016
出版者: Wiley-Blackwell
誌名: Concurrency and Computation: Practice and Experience
巻: 28
号: 7
開始ページ: 2052
終了ページ: 2072
抄録: There is a growing need for ever more accurate climate and weather simulations to be delivered in shorter timescales, in particular, to guard against severe weather events such as hurricanes and heavy rainfall. Due to climate change, the severity and frequency of such events – and thus the economic impact – are set to rise dramatically. Hardware acceleration using graphics processing units (GPUs) or Field-Programmable Gate Arrays (FPGAs) could potentially result in much reduced run times or higher accuracy simulations. In this paper, we present the results of a study of the Weather Research and Forecasting (WRF) model undertaken in order to assess if GPU and multicore acceleration of this type of numerical weather prediction (NWP) code is both feasible and worthwhile. The focus of this paper is on acceleration of code running on a single compute node through offloading of parts of the code to an accelerator such as a GPU. The governing equations set of the WRF model is based on the compressible, non-hydrostatic atmospheric motion with multi-physics processes. We put this work into context by discussing its more general applicability to multi-physics fluid dynamics codes: in many fluid dynamics codes, the numerical schemes of the advection terms are based on finite differences between neighboring cells, similar to the WRF code. For fluid systems including multi-physics processes, there are many calls to these advection routines. This class of numerical codes will benefit from hardware acceleration. We studied the performance of the original code of the WRF model and proposed a simple model for comparing multicore CPU and GPU performance. Based on the results of extensive profiling of representative WRF runs, we focused on the acceleration of the scalar advection module. We discuss the implementation of this module as a data-parallel kernel in both OpenCL and OpenMP. We show that our data-parallel kernel version of the scalar advection module runs up to seven times faster on the GPU compared with the original code on the CPU. However, as the data transfer cost between GPU and CPU is very high (as shown by our analysis), there is only a small speed-up (two times) for the fully integrated code. We show that it would be possible to offset the data transfer cost through GPU acceleration of a larger portion of the dynamics code. In order to carry out this research, we also developed an extensible software system for integrating OpenCL code into large Fortran code bases such as WRF. This is one of the main contributions of our work. We discuss the system to show how it allows the replacement of the sections of the original codebase with their OpenCL counterparts with minimal changes – literally only a few lines – to the original code. Our final assessment is that, even with the current system architectures, accelerating WRF – and hence also other, similar types of multi-physics fluid dynamics codes – with a factor of up to five times is definitely an achievable goal. Accelerating multi-physics fluid dynamics codes including NWP codes is vital for its application to weather forecasting, environmental pollution warning, and emergency response to the dispersion of hazardous materials. Implementing hardware acceleration capability for fluid dynamics and NWP codes is a prerequisite for up-to-date and future computer architectures.
著作権等: This is the accepted version of the following article: [Vanderbauwhede, W., and Takemi, T. (2016) An analysis of the feasibility and benefits of GPU/multicore acceleration of the Weather Research and Forecasting model. Concurrency Computat.: Pract. Exper., 28: 2052–2072], which has been published in final form at http://doi.org/10.1002/cpe.3522. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Self-Archiving.
The full-text file will be made open to the public on 8 APR 2017 in accordance with publisher's 'Terms and Conditions for Self-Archiving'.
この論文は出版社版でありません。引用の際には出版社版をご確認ご利用ください。
This is not the published version. Please cite only the published version.
URI: http://hdl.handle.net/2433/218036
DOI(出版社版): 10.1002/cpe.3522
出現コレクション:学術雑誌掲載論文等

アイテムの詳細レコードを表示する

Export to RefWorks


出力フォーマット 


このリポジトリに保管されているアイテムはすべて著作権により保護されています。