首站-论文投稿智能助手
典型文献
A GPU accelerated Boussinesq-type model for coastal waves
文献摘要:
This study presents an efficient Boussinesq-type wave model accelerated by a single Graphics Processing Unit (GPU). The model uses the hybrid finite volume and finite difference method to solve weakly dispersive and nonlinear Boussinesq equations in the horizontal plane, enabling the model to have the shock-capturing ability to deal with breaking waves and moving shoreline properly. The code is written in CUDA C. To achieve better performance, the model uses cyclic reduction technique to solve massive tridiagonal linear systems and overlapped tiling/shared memory to reduce global memory access and enhance data reuse. Four numerical tests are conducted to validate the GPU implementation. The performance of the GPU model is evaluated by running a series of numerical simulations on two GPU platforms with different hardware configurations. Compared with the CPU version, the maximum speedup ratios for single-precision and double-precision calculations are 55.56 and 32.57, respectively.
文献关键词:
作者姓名:
Kezhao Fang;Jiawen Sun;Guangchun Song;Gang Wang;Hao Wu;Zhongbo Liu
作者机构:
State Key Laboratory of Coastal and Offshore Engineering,Dalian University of Technology,Dalian 116024,China;National Marine Environmental Monitoring Center,Dalian 116023,China;Marine Geological Resources Survey Center of Hebei Province,Qinhuangdao 066001,China;College of Transportation Engineering,Dalian Maritime University,Dalian 116026,China
引用格式:
[1]Kezhao Fang;Jiawen Sun;Guangchun Song;Gang Wang;Hao Wu;Zhongbo Liu-.A GPU accelerated Boussinesq-type model for coastal waves)[J].海洋学报(英文版),2022(09):158-168
A类:
tridiagonal
B类:
GPU,accelerated,Boussinesq,type,model,coastal,waves,This,study,presents,efficient,by,single,Graphics,Processing,Unit,uses,hybrid,finite,volume,difference,method,solve,weakly,dispersive,nonlinear,equations,horizontal,plane,enabling,have,shock,capturing,ability,deal,breaking,moving,shoreline,properly,code,written,CUDA,To,achieve,better,performance,cyclic,reduction,technique,massive,systems,overlapped,tiling,shared,memory,reduce,global,access,enhance,data,reuse,Four,numerical,tests,conducted,validate,implementation,evaluated,running,series,simulations,two,platforms,different,hardware,configurations,Compared,CPU,version,maximum,speedup,ratios,precision,double,calculations,respectively
AB值:
0.663217
相似文献
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。