Introduction
In this practice, the Kunpeng Performance Boundary Analyzer is used to quickly identify performance issues. Preliminary analysis indicates that DDRC read bandwidth is extremely high, suggesting a performance bottleneck. The System Profiler is then used to examine memory access statistics and cache miss events, revealing a low cache hit ratio in the application. Further analysis of the source code shows that this issue is caused by extensive memory data replication. Block processing is applied to increase the cache hit ratio and enhance program performance.
Networking Environment
This practice uses CentOS 7.6 as an example. Perform similar operations for other OSs on the Kunpeng platform.
Tuning Strategy
- Use the Kunpeng Performance Boundary Analyzer to demarcate and locate application issues.
- Use the System Profiler to further analyze the memory access statistics and miss events of the application. The DDRC read bandwidth is abnormally high. After a detailed analysis, optimize the source file and verify the effect of the optimization.
Parent topic: Practice 1: Memory Access Statistics Analysis