Rate This Document
Findability
Accuracy
Completeness
Readability

Introduction

In this practice, the Kunpeng Performance Boundary Analyzer is used to quickly identify performance issues. Preliminary analysis indicates that DDRC read bandwidth is extremely high, suggesting a performance bottleneck. The System Profiler is then used to examine memory access statistics and cache miss events, revealing a low cache hit ratio in the application. Further analysis of the source code shows that this issue is caused by extensive memory data replication. Block processing is applied to increase the cache hit ratio and enhance program performance.

Networking Environment

This practice uses CentOS 7.6 as an example. Perform similar operations for other OSs on the Kunpeng platform.

Table 1 Networking environment

Item

Description

CPU

Kunpeng processor

OS

CentOS 7.6

Tool

Kunpeng Performance Boundary Analyzer and System Profiler

Practice demo

Link

Tuning Strategy

  1. Use the Kunpeng Performance Boundary Analyzer to demarcate and locate application issues.
  2. Use the System Profiler to further analyze the memory access statistics and miss events of the application. The DDRC read bandwidth is abnormally high. After a detailed analysis, optimize the source file and verify the effect of the optimization.