Towards Scalable Unstructured Mesh Computations on Shared Memory Many-Cores
Due to data conflicts or data dependences, exploiting shared memory parallelism on unstructured mesh applications is highly challenging. The prior approaches are neither general nor scalable on emerging many-core processors. This paper presents a general and scalable shared memory approach for unstructured mesh computations. We recursively divide and reorder an unstructured mesh to construct a task dependency tree (TDT), where massive parallelism are exposed and data conflicts as well as data dependences are respected. We propose two recursion strategies to integrate our approach into popular programming models on CPUs and GPUs. We evaluate our approach by applying it to an industrial unstructured Computational Fluid Dynamics (CFD) software. Experimental results show that our approach significantly outperforms the prior shared memory approaches, delivering up to 8.1X performance improvement over the engineer-tuned implementations.
Mon 4 MarDisplayed time zone: London change
14:20 - 15:40 | High Performance ComputingMain Conference at Moorfoot Chair(s): Helen Xu Lawrence Berkeley National Laboratory | ||
14:20 20mTalk | OsirisBFT: Say No to Task Replication for Scalable Byzantine Fault Tolerant Analytics Main Conference Link to publication DOI | ||
14:40 20mTalk | Towards Scalable Unstructured Mesh Computations on Shared Memory Many-Cores Main Conference Haozhong Qiu , xuchuanfu National University of Defense Technology, Jianbin Fang National University of Defense Technology, Liang Deng China Aerodynamic Research and Development Center, Jian Zhang China Aerodynamic Research and Development Center, Qingsong Wang National University of Defense Technology, Yue Ding NOT_PROVIDED, Zhe Dai China Aerodynamic Research and Development Center, Yonggang Che National University of Defense Technology Link to publication DOI | ||
15:00 20mTalk | Extreme-scale Direct Numerical Simulation of Incompressible Turbulence on the Heterogeneous Many-core System Main Conference Jiabin Xie Sun Yat-sen University, Guangnan Feng Sun Yat-sen University, Han Huang Sun Yat-sen University, Junxuan Feng Sun Yat-sen University, Yutong Lu Sun Yat-sen University Link to publication DOI | ||
15:20 20mTalk | Pure: Evolving Message Passing To Better Leverage Shared Memory Within Nodes Main Conference James Psota Massachusetts Institute of Technology, Armando Solar-Lezama Massachusetts Institute of Technology Link to publication DOI |