Cases
- After enabling P2P on Arm, the LAMMPS application achieves a 100% performance improvement with two GPUs and a 249% improvement with four GPUs.
- After enabling P2P on Arm, the GROningen MAchine for Chemical Simulations (GROMACS) application achieves performance comparable to that on x86 with one or two GPUs, and delivers a 9.4% performance gain over x86 with four GPUs.

Table 1 shows the acceleration effect for AMBER and LAMMPS after P2P is enabled.
|
Number of GPUs |
GPU ID |
Amber |
LAMMPS |
||||
|---|---|---|---|---|---|---|---|
|
P2P Disabled |
P2P Enabled |
Performance Improvement |
P2P Disabled |
P2P Enabled |
Performance Improvement |
||
|
One |
134 |
134 |
+0% |
130 |
132 |
+0% |
|
|
Two |
0 and 1 |
27 |
125 |
+362% |
86 |
229 |
+166% |
|
1 and 3 |
26 |
120 |
+361% |
82 |
228 |
+178% |
|
|
2 and 3 |
27 |
125 |
+362% |
89 |
228 |
+156% |
|
|
Four |
22 |
89 |
+280% |
99 |
387 |
+290% |
|
Parent topic: P2P Communication Optimization