Benchmarking GPU acceleration of a parallel molecular dynamics simulation
Abstract
The parallel speedup of component subroutines in a molecular dynamics simulation is measured after implementing MPI-based CPU parallelization and GPU acceleration. GPU acceleration was observed to yield an order of magnitude reduction in simulation times (13.9×–23.6× relative to a serial run). Slightly less speedup is achieved by CPU parallelization (7.5×) over nine processors. Performance scaling is diminished by communication operations as the number of processes launched begin to exceed the number of available physical CPU cores.