[MGI37-P15] Performance evaluation of inter-node MPI parallel TensorFlow for massive learning data
Keywords:machine learning, inter-node parallel
TensorFlow is the one of ML framework developed by Google, and it is used in the world. Google has developed the inter-node parallel TensorFlow however it is not suitable for the supercomputer such as picking up the local computing node directly to participate in the calculation. To overcome this problem, Horovod MPI based TensorFlow has released by Uber, and it can use the MPI tuned for the supercomputer. As the collaboration of Cray and Kyoto University, more optimized MPI based TensorFlow which is called as CPE ML Plugin has been introduced to the supercomputer of Kyoto University. In this talk the performance evaluation of this MPI based TensorFlow.