陈红全

个人信息Personal Information

教授

招生学科专业:
力学 -- 【招收博士、硕士研究生】 -- 航空学院
机械 -- 【招收硕士研究生】 -- 航空学院

性别:男

学历:南京航空学院

学位:工学博士学位

所在单位:航空学院

办公地点:C12-202

联系方式:hqchenam@nuaa.edu.cn

电子邮箱:

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

A multi-layered point reordering study of GPU-based meshless method for compressible flow simulations

点击次数:

所属单位:航空学院

发表刊物:J. Comput. Sci.

摘要:In meshless methods, clouds of points irregularly distributed are widely used in discretizing computational domains and are usually unavoidable to accommodate complex geometries. However, the irregularity of points has been reported to be negative effect on the GPU memory access pattern, which results in low performance in GPU computations. In order to remedy this negative effect, a multi-layered point reordering (MLPRO) approach is proposed in this paper for GPU-based meshless implementations. Layer structures based on the virtual connections between central and satellite points in meshless clouds are constructed and used to reorder the points in a layer-by-layer manner. Besides, point reordering inside each thread warp, which is rarely concerned in GPU implementations, is further considered by proposing a supplemental group satellite reordering to form a modified MLPRO approach. Furthermore, by defining virtual connectivity matrixes of meshless clouds of points in the whole computational domain, the effect of reordering mentioned to the data localities can be visibly observed to have a comprehensive view of the improvement of point locality. Supersonic flows in a rectangular channel are firstly selected to test the effect extent of irregularity of meshless points to the GPU performance by increasing the percentage of irregular points occupied in the computational domain. Then flows over two- and three-dimensional aerodynamic configurations are simulated to show the performances of the reordering approaches presented. Numerical results show that significant enhancements of GPU speedups can be achieved for all test cases, particularly for the three-dimensional M6 wing and RAE wing-body combination cases with up to 2.5× further speedups, which is meaningful for simulations with large-scale irregular meshless clouds of points. © 2019

ISSN号:1877-7503

是否译文:

发表时间:2019-04-01

合写作者:Cao, Cheng,张加乐,Xu, Sheng-Guan

通讯作者:Cao, Cheng,陈红全