当前位置: X-MOL 学术Sci. Program. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
MPI Runtime Error Detection with MUST: Advances in Deadlock Detection
Scientific Programming ( IF 1.672 ) Pub Date : 2013 , DOI: 10.3233/spr-130368
Tobias Hilbrich, Joachim Protze, Martin Schulz, Bronis R. de Supinski, Matthias S. Müller

The widely used Message Passing Interface (MPI) is complex and rich. As a result, application developers require automated tools to avoid and to detect MPI programming errors. We present the Marmot Umpire Scalable Tool (MUST) that detects such errors with significantly increased scalability. We present improvements to our graph-based deadlock detection approach for MPI, which cover future MPI extensions. Our enhancements also check complex MPI constructs that no previous graph-based detection approach handled correctly. Finally, we present optimizations for the processing of MPI operations that reduce runtime deadlock detection overheads. Existing approaches often require 𝒪(p) analysis time per MPI operation, for p processes. We empirically observe that our improvements lead to sub-linear or better analysis time per operation for a wide range of real world applications.

中文翻译:

带有MUST的MPI运行时错误检测:死锁检测的进展

广泛使用的消息传递接口(MPI)既复杂又丰富。结果,应用程序开发人员需要自动工具来避免和检测MPI编程错误。我们提供了Marmot Umpire可扩展工具(MUST),该工具可以显着提高可扩展性来检测此类错误。我们介绍了针对MPI的基于图的死锁检测方法的改进,其中涵盖了将来的MPI扩展。我们的增强功能还可以检查复杂的MPI构造,而以前的基于图形的检测方法均无法正确处理。最后,我们提出了用于MPI操作处理的优化,可以减少运行时死锁检测的开销。现有方法通常每个MPI操作需要𝒪(p)个分析时间,对于p流程。我们凭经验观察到,对于许多实际应用,我们的改进导致每次操作的亚线性或更好的分析时间。
更新日期:2020-09-25
down
wechat
bug