典型文献
An MPI+OpenACC-Based PRM Scalar Advection Scheme in the GRAPES Model over a Cluster with Multiple CPUs and GPUs
文献摘要:
A moisture advection scheme is an essential module of a numerical weather/climate model representing the horizontal transport of water vapor.The Piecewise Rational Method (PRM) scalar advection scheme in the Global/Regional Assimilation and Prediction System (GRAPES) solves the moisture flux advection equation based on PRM.Computation of the scalar advection involves boundary exchange,and computation of higher bandwidth requirements is complicated and time-consuming in GRAPES.Recently,Graphics Processing Units (GPUs) have been widely used to solve scientific and engineering computing problems owing to advancements in GPU hardware and related programming models such as CUDA/OpenCL and Open Accelerator (OpenACC).Herein,we present an accelerated PRM scalar advection scheme with Message Passing Interface (MPI) and OpenACC to fully exploit GPUs' power over a cluster with multiple Central Processing Units (CPUs) and GPUs,together with optimization of various parameters such as minimizing data transfer,memory coalescing,exposing more parallelism,and overlapping computation with data transfers.Results show that about 3.5 times speedup is obtained for the entire model running at medium resolution with double precision when comparing the scheme's elapsed time on a node with two GPUs(NVIDIA P100) and two 16-core CPUs (Intel Gold 6142).Further,results obtained from experiments of a higher resolution model with multiple GPUs show excellent scalability.
文献关键词:
中图分类号:
作者姓名:
Huadong Xiao;Yang Lu;Jianqiang Huang;Wei Xue
作者机构:
Institute of Geodesy and Geophysics,Chinese Academy of Sciences,Wuhan 430074,China,University of Chinese Academy of Sciences,Beijing 100049,China;National Meteorological Information Center,Beijing 100081,China;Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China;Department of Computer Technology and Application,Qinghai University,Xining 810016,China
文献出处:
引用格式:
[1]Huadong Xiao;Yang Lu;Jianqiang Huang;Wei Xue-.An MPI+OpenACC-Based PRM Scalar Advection Scheme in the GRAPES Model over a Cluster with Multiple CPUs and GPUs)[J].清华大学学报自然科学版(英文版),2022(01):164-173
A类:
MPI+OpenACC,Advection,OpenACC
B类:
An,Based,PRM,Scalar,Scheme,GRAPES,Model,Cluster,Multiple,CPUs,GPUs,moisture,advection,scheme,essential,module,numerical,weather,climate,representing,horizontal,transport,water,vapor,Piecewise,Rational,Method,scalar,Global,Regional,Assimilation,Prediction,System,solves,flux,equation,Computation,involves,boundary,exchange,computation,higher,bandwidth,requirements,complicated,consuming,Recently,Graphics,Processing,Units,have,been,widely,used,scientific,engineering,computing,problems,owing,advancements,hardware,related,programming,models,such,CUDA,OpenCL,Accelerator,Herein,accelerated,Message,Passing,Interface,fully,exploit,power,cluster,multiple,Central,together,optimization,various,parameters,minimizing,data,memory,coalescing,exposing,more,parallelism,overlapping,transfers,Results,show,that,about,times,speedup,obtained,entire,running,medium,resolution,double,precision,when,comparing,elapsed,node,two,NVIDIA,P100,core,Intel,Gold,Further,results,from,experiments,excellent,scalability
AB值:
0.602383
相似文献
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。