Abstract:The large scale parallel electromagnetic computation based on the supercomputer is of great significance for solving complicate electromagnetic problems in practical engineering. However, the probability of the process crash event caused by node failure in the supercomputer is much higher than that in the regular computer. Considering the incapable action for traditional electromagnetic computation to overcome the process crash event, an efficient fault-tolerance algorithm for large scale parallel high order Method of Moments (MoM) is proposed in this paper. According to the parallel higher order method of moments algorithm available, a scene protection algorithm and a scene recovery algorithm with high efficiency and reliability are designed, based on the “disk cache” and “direct memory access” technique. The efficiency of this algorithm lies on the feature of the “fixed site protection”, which makes it possible for the algorithm to work normal and ordered even encountering crash failure, while the original algorithm can only restart from the beginning. The numerical simulations demonstrate the efficiency of the fault-tolerant algorithm in dealing with the process crash, which improves greatly the reliability of the large scale parallel high order MoM.
ZHANG Y, ZHAO X, CHEN Y, et al. Massively Parallel Method of Moment in Computational Electromagnetics[M]. Xi’an: Xidian University Press, 2016: 112-171.
XU X F, CAO X Y, GAO J, et al. Parallel out-of-core calculation of electrically large objects’ RCS based on MoM [J]. Journal of Electronics & Information Technology, 2011, 33(3): 758-762. doi: 10.3724/SP.J.1146.2010.00519.
[6]
Zhang Y and Sarkar T K. Parallel Solution of Integral Equation Based EM Problems in the Frequency Domain[M]. Hoboken, NJ: Wiley-IEEE, 2009: 107-136. doi: 10.1002/ 9780470495094.
LIN Z, CHEN Y, ZHANG Y, et al. Study of the parallel higher-order MoM on a domestically-made CPU platform[J]. Journal of Xidian University, 2015, 42(3): 43-47. doi: 10. 3969/j.issn.1001-2400.2015.03.008.
[12]
ZHANG Y, LIN Z, ZHAO X, et al. Performance of a massively parallel higher-order method of moment code using
thousands of CPUs and its applications[J]. IEEE Transactions on Antennas and Propagation, 2014, 62(12): 6317-6324. doi: 10.1109/TAP.2014.2361135.
LIN Z, CHEN Y, ZHANG Y, et al. Study of super electromagnetic computing for higher-order MoM[J]. e-Science Technology & Application, 2015, 6(4): 20-28. doi: 10.11871/j.issn.1674-9480.2015.04.003.
[14]
CHEN Y, ZHANG Y, ZHANG G, et al. Hybrid MIC/CPU parallel implementation of MoM on MIC cluster for electromagnetic problems[J]. IEICE Transactions on Electronics, 2016, 99(7): 735-743. doi: 10.1587/transele.E99. C.735.
Wang S, Guan X, Wang D, et al. Solution of the electric field integral equation using higher-order method of moments[J]. Journal of Electronics & Information Technology, 2007, 29(9): 2265-2268.