CORC  > 软件研究所  > 并行计算实验室  > 期刊论文
国产百万亿次机群系统Alltoall性能测试与分析
饶立 ; 张云泉 ; 李玉成
刊名计算机科学
2010
卷号37期号:8页码:186-188,207
关键词集合通信 Alltoall 曙光5000A 性能测试与分析Alltoall Collective communication Dawning 5000A Performance test and analysis
ISSN号1002-137X
其他题名performance test and analysis of alltoall collective communication on domestic hundred trillion times cluster system
中文摘要随着高性能计算机的应用和发展,并行应用程序所使用的处理器数越来越多,进程间的通信量也不断增多,这对应用程序的性能有很大影响。在采用一种快速傅里叶变换HFFT对曙光5000A进行性能测试时发现,MPI集合通信函数MPI Alltoall的巨大通信开销是并行程序设计的瓶颈。为此,对现有主流Alltoall算法在曙光5000A和深腾7000上进行性能测试与分析,以期对未来的Alltoall算法的优化工作做出贡献。利用不同消息长度和不同进程数测试了Alltoall函数多种算法的性能,这些算法包括二维网格算法、三维网格算法、Bruck算法、原始算法、成对交换算法、递归倍增算法、环算法以及LAM/MPI中的简单算法等。实验结果表明:消息长度较小时,在曙光5000A上采用原始算法和Bruck算法的性能较好,而在深腾7000上用时较少的算法是简单算法和Bruck算法;对于长消息,曙光5000A上最优的算法是环算法,深腾7000上成对交换性能最优。
学科主题Automation & Control Systems
语种中文
公开日期2011-05-23
附注As rapid development of the high performance computers, more and more cores are used and thus lead to more and more communication which debases the perfor-mance of parallel applications greatly. In the test of the performance of Dawning 5000Aby a kind of Fast Fouler Transform(HFFT),we found out that the huge overhead time of MPI_Alltoall is the bottleneck of HFFT. Thus, this paper aimed to test and analyze the leading Alltoall algorithm on Dawning 5000and Deepcomp 7000hoping to do a favor to further collective communication optimization. In this paper, the leading Alltoall algorithms such as 2D_Mesh,3D_Mesh, Bruck,MPICH native, Pair, recursive doubling, Ring, LAM/MPI simple were recounted and tested with different message size and core numbers. The conclusion is that for short message MPICH native and Bruck performs well on Dawning 5000Awhile the lower time consuming algorithms on Deepcomp 7000are LAM/MPI simple and Bruck;when the message size is medium and large, the best choice for Dawning 5000Ais Ring while the optimal algorithm on Deepcomp 7000is Pair
内容类型期刊论文
源URL[http://124.16.136.157/handle/311060/9852]  
专题软件研究所_并行计算实验室 _期刊论文
推荐引用方式
GB/T 7714
饶立,张云泉,李玉成. 国产百万亿次机群系统Alltoall性能测试与分析[J]. 计算机科学,2010,37(8):186-188,207.
APA 饶立,张云泉,&李玉成.(2010).国产百万亿次机群系统Alltoall性能测试与分析.计算机科学,37(8),186-188,207.
MLA 饶立,et al."国产百万亿次机群系统Alltoall性能测试与分析".计算机科学 37.8(2010):186-188,207.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace