鲲鹏系统性能方法论分析工具使用示例
示例均为RPM包安装使用,以在鲲鹏920系列服务器上使用工具为例,展示如何进行服务器系统性能采集和分析
图1 整体流程


- 执行采集命令。
1
ksys collect -d 10 -i 1 -o /home/test/
- 采集时间指定为10秒,采样间隔指定为1秒,指定在“/home/test/”目录下生成JSON文件。
- 采集结束后会直接打印Summary数据,但不会保存Summary数据到JSON文件中。
返回信息片段如下:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169
Data collection starts. You can press Ctrl+C to stop the task. Starting to parse the data. This may take some time. You can press Ctrl+C to forcibly stop the task. ======================================================================CPU Metrics====================================================================== Common Microarchitecture Metrics Summary Data +------+--------------+------+------+----------+----------+----------+----------+-----------+-----------+---------+ | IPC | PATH LENGTH | MPKI | BPKI | L1D MPKI | L1I MPKI | L2D MPKI | L2I MPKI | DTLB MPKI | ITLB MPKI | CPU-NUM | +------+--------------+------+------+----------+----------+----------+----------+-----------+-----------+---------+ | 0.33 | 101152422747 | 3.62 | 1.22 | 3.66 | 5.88 | 2.59 | 0.43 | 4.7 | 0.27 | 256 | +------+--------------+------+------+----------+----------+----------+----------+-----------+-----------+---------+ Topdown Summary Data +----------------------------+-------+ | Metric | Value | +----------------------------+-------+ | Retiring(%) | 5.83 | | Frontend Bound(%) | 11.24 | | Fetch Latency Bound(%) | 10.86 | | Fetch Bandwidth Bound(%) | 0.38 | | Bad Speculation(%) | 0.73 | | Branch Mispredicts(%) | 0.58 | | Machine Clears(%) | 0.15 | | Backend Bound(%) | 82.2 | | Core Bound(%) | 39.48 | | Memory Bound(%) | 42.72 | | CPU-NUM | 256 | +----------------------------+-------+ OS Metrics Summary Data +------------------+------------+-------------+---------+ | context-switches | migrations | page-faults | CPU-NUM | +------------------+------------+-------------+---------+ | 309603 | 2277 | 200448 | 256 | +------------------+------------+-------------+---------+ INSTRUCTION Summary Data +----------------------------------+-------+ | Metric | Value | +----------------------------------+-------+ | Memory(%) | 26.94 | | Load(%) | 23.41 | | Store(%) | 3.53 | | Integer(%) | 49.53 | | Floating Point(%) | 0.02 | | Advanced SIMD(%) | 0.13 | | Crypto(%) | 0.0 | | Branches(%) | 23.24 | | Immediate(%) | 21.27 | | Return(%) | 0.84 | | Indirect(%) | 1.14 | | Barriers(%) | 0.08 | | Instruction Synchronization(%) | 0.01 | | Data Synchronization(%) | 0.0 | | Data Memory(%) | 0.06 | | Not Retired(%) | 0.06 | +----------------------------------+-------+ Load_avg Summary Data +--------------+--------------+---------------+ | recent 1 min | recent 5 min | recent 15 min | +--------------+--------------+---------------+ | 0.56 | 0.44 | 0.35 | +--------------+--------------+---------------+ Softirqs Summary Data +----------+----------+---------+---------+---------+ | NET_TX/s | NET_RX/s | BLOCK/s | SCHED/s | CPU-NUM | +----------+----------+---------+---------+---------+ | 0 | 0 | 0 | 4 | 256 | +----------+----------+---------+---------+---------+ CPU_stat Summary Data +----------------+--------------+-------------------+ | ctx_switches/s | interrupts/s | soft_interrupts/s | +----------------+--------------+-------------------+ | 24710.0 | 58114.0 | 2207.0 | +----------------+--------------+-------------------+ CPU_freq Summary Data +--------------+----------+----------+ | current(MHz) | min(MHz) | max(MHz) | +--------------+----------+----------+ | 2899.65 | 400.0 | 2900.0 | +--------------+----------+----------+ CPU_percent Summary Data +-------+-------+---------+-------+---------+------+----------+--------+--------+-------------+ | %user | %nice | %system | %idle | %iowait | %irq | %softirq | %steal | %guest | %guest_nice | +-------+-------+---------+-------+---------+------+----------+--------+--------+-------------+ | 0.12 | 0.0 | 0.16 | 98.74 | 0.01 | 0.13 | 0.01 | 0.0 | 0.0 | 0.0 | +-------+-------+---------+-------+---------+------+----------+--------+--------+-------------+ =================================================================Memory Access Metrics================================================================= DDRC Summary Data +---------------------------------+--------+-----------------+-----------------+ | DEVICE | NUMA | ddrc_rd_bw MB/s | ddrc_wr_bw MB/s | +---------------------------------+--------+-----------------+-----------------+ | DDRC DEVICE hisi_sccl3_ddrc0_0 | Node 0 | 22.28 | 9.12 | | DDRC DEVICE hisi_sccl3_ddrc0_1 | Node 0 | 22.67 | 9.46 | | DDRC DEVICE hisi_sccl3_ddrc2_0 | Node 0 | 22.17 | 8.7 | | DDRC DEVICE hisi_sccl3_ddrc2_1 | Node 0 | 22.59 | 9.55 | | DDRC DEVICE hisi_sccl3_ddrc3_0 | Node 0 | 22.67 | 9.22 | | DDRC DEVICE hisi_sccl3_ddrc3_1 | Node 0 | 22.53 | 9.48 | | DDRC DEVICE hisi_sccl3_ddrc5_0 | Node 0 | 22.44 | 8.76 | | DDRC DEVICE hisi_sccl3_ddrc5_1 | Node 0 | 22.3 | 9.09 | | DDRC DEVICE hisi_sccl1_ddrc0_0 | Node 1 | 20.33 | 12.86 | | DDRC DEVICE hisi_sccl1_ddrc0_1 | Node 1 | 20.02 | 13.66 | | DDRC DEVICE hisi_sccl1_ddrc2_0 | Node 1 | 20.93 | 13.06 | | DDRC DEVICE hisi_sccl1_ddrc2_1 | Node 1 | 22.33 | 20.36 | | DDRC DEVICE hisi_sccl1_ddrc3_0 | Node 1 | 19.64 | 12.99 | | DDRC DEVICE hisi_sccl1_ddrc3_1 | Node 1 | 19.87 | 13.06 | | DDRC DEVICE hisi_sccl1_ddrc5_0 | Node 1 | 20.06 | 13.41 | | DDRC DEVICE hisi_sccl1_ddrc5_1 | Node 1 | 20.19 | 14.26 | | DDRC DEVICE hisi_sccl11_ddrc0_0 | Node 2 | 26.54 | 12.26 | | DDRC DEVICE hisi_sccl11_ddrc0_1 | Node 2 | 26.55 | 11.85 | | DDRC DEVICE hisi_sccl11_ddrc2_0 | Node 2 | 27.08 | 16.91 | | DDRC DEVICE hisi_sccl11_ddrc2_1 | Node 2 | 27.01 | 12.28 | | DDRC DEVICE hisi_sccl11_ddrc3_0 | Node 2 | 27.49 | 20.73 | | DDRC DEVICE hisi_sccl11_ddrc3_1 | Node 2 | 26.63 | 12.12 | | DDRC DEVICE hisi_sccl11_ddrc5_0 | Node 2 | 26.78 | 12.31 | | DDRC DEVICE hisi_sccl11_ddrc5_1 | Node 2 | 26.88 | 11.87 | | DDRC DEVICE hisi_sccl9_ddrc0_0 | Node 3 | 11.87 | 5.77 | | DDRC DEVICE hisi_sccl9_ddrc0_1 | Node 3 | 11.57 | 6.0 | | DDRC DEVICE hisi_sccl9_ddrc2_0 | Node 3 | 11.86 | 5.88 | | DDRC DEVICE hisi_sccl9_ddrc2_1 | Node 3 | 11.45 | 5.78 | | DDRC DEVICE hisi_sccl9_ddrc3_0 | Node 3 | 11.64 | 5.89 | | DDRC DEVICE hisi_sccl9_ddrc3_1 | Node 3 | 11.91 | 6.17 | | DDRC DEVICE hisi_sccl9_ddrc5_0 | Node 3 | 11.56 | 5.25 | | DDRC DEVICE hisi_sccl9_ddrc5_1 | Node 3 | 11.85 | 5.96 | +---------------------------------+--------+-----------------+-----------------+ ... =======================================================================IO Metrics====================================================================== PCIE Summary Data ------------------------------------------------------------------------------------------------ Note: The bandwidth on the PCIe device side differ from the commonly understood bandwidth. For more detailed descriptions, please refer to the README.md. +--------------------------------------------------------------+---------------+---------------+ | PCIE DEVICE | rx_rd_bw MB/s | rx_wr_bw MB/s | +--------------------------------------------------------------+---------------+---------------+ | PCIE DEVICE 03:00.0 Signal processing controller: Huawei | 0.0 | 0.0 | | Technologies Co., Ltd. iBMA Virtual Network Adapter (rev 01) | | | +--------------------------------------------------------------+---------------+---------------+ | PCIE DEVICE 02:00.0 VGA compatible controller: Huawei | 0.0 | 0.0 | | Technologies Co., Ltd. Hi171x Series [iBMC Intelligent | | | | Management system chip w/VGA support] (rev 01) | | | +--------------------------------------------------------------+---------------+---------------+ PA Summary Data ----------------------------------------------------------------- Note: PA (Protocol Adapter) can be used to collect CPU-CPU and CPU-GPU bandwidth. For more detailed descriptions, please refer to the README.md. +---------------------------+-----------------+-----------------+ | PA DEVICE | PA2Ring_bw MB/s | Ring2PA_bw MB/s | +---------------------------+-----------------+-----------------+ | PA DEVICE hisi_sicl8_pa0 | 0.0 | 0.0 | | PA DEVICE hisi_sicl0_pa0 | 0.0 | 0.0 | | PA DEVICE hisi_sicl10_pa0 | 234.47 | 102.51 | | PA DEVICE hisi_sicl2_pa0 | 194.2 | 125.32 | +---------------------------+-----------------+-----------------+ ... ... ... Data saved successfully at /home/test/2025_08_14_15_17_25_report.json
采集完成后会生成终端报告和JSON性能数据文件(“/home/test/2025_08_14_15_17_25_report.json”)。终端报告展示CPU、访存等多维度指标,可以观察到当前服务器上下文切换频率ctx_switches/s较高为24710.0,DDRC带宽较低在0 MB/s-30 MB/s之间,说明当前环境执行了计算密集型业务。
- 对生成的性能数据文件进行分析,生成Excel报告。
1
ksys report -i /home/test/2025_08_14_15_17_25_report.json -o /home/test/
- 2025_08_14_15_17_25_report.json为通过ksys collect命令生成的JSON文件。
- 分析结束后会直接打印Summary数据,并且和时序数据一起保存至Excel文件中。
- 时序数据绘制成折线图或者面积图展示,各个图表之间时间线对齐。
返回信息片段如下:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96
Analyzing system data... Please wait. ======================================================================CPU Metrics====================================================================== Common Microarchitecture Metrics Summary Data +------+--------------+------+------+----------+----------+----------+----------+-----------+-----------+---------+ | IPC | PATH LENGTH | MPKI | BPKI | L1D MPKI | L1I MPKI | L2D MPKI | L2I MPKI | DTLB MPKI | ITLB MPKI | CPU-NUM | +------+--------------+------+------+----------+----------+----------+----------+-----------+-----------+---------+ | 0.33 | 101152422747 | 3.62 | 1.22 | 3.66 | 5.88 | 2.59 | 0.43 | 4.7 | 0.27 | 256 | +------+--------------+------+------+----------+----------+----------+----------+-----------+-----------+---------+ Topdown Summary Data +----------------------------+-------+ | Metric | Value | +----------------------------+-------+ | Retiring(%) | 5.83 | | Frontend Bound(%) | 11.24 | | Fetch Latency Bound(%) | 10.86 | | Fetch Bandwidth Bound(%) | 0.38 | | Bad Speculation(%) | 0.73 | | Branch Mispredicts(%) | 0.58 | | Machine Clears(%) | 0.15 | | Backend Bound(%) | 82.2 | | Core Bound(%) | 39.48 | | Memory Bound(%) | 42.72 | | CPU-NUM | 256 | +----------------------------+-------+ OS Metrics Summary Data +------------------+------------+-------------+---------+ | context-switches | migrations | page-faults | CPU-NUM | +------------------+------------+-------------+---------+ | 309603 | 2277 | 200448 | 256 | +------------------+------------+-------------+---------+ INSTRUCTION Summary Data +----------------------------------+-------+ | Metric | Value | +----------------------------------+-------+ | Memory(%) | 26.94 | | Load(%) | 23.41 | | Store(%) | 3.53 | | Integer(%) | 49.53 | | Floating Point(%) | 0.02 | | Advanced SIMD(%) | 0.13 | | Crypto(%) | 0.0 | | Branches(%) | 23.24 | | Immediate(%) | 21.27 | | Return(%) | 0.84 | | Indirect(%) | 1.14 | | Barriers(%) | 0.08 | | Instruction Synchronization(%) | 0.01 | | Data Synchronization(%) | 0.0 | | Data Memory(%) | 0.06 | | Not Retired(%) | 0.06 | +----------------------------------+-------+ Load_avg Summary Data +--------------+--------------+---------------+ | recent 1 min | recent 5 min | recent 15 min | +--------------+--------------+---------------+ | 0.56 | 0.44 | 0.35 | +--------------+--------------+---------------+ Softirqs Summary Data +----------+----------+---------+---------+---------+ | NET_TX/s | NET_RX/s | BLOCK/s | SCHED/s | CPU-NUM | +----------+----------+---------+---------+---------+ | 0 | 0 | 0 | 4 | 256 | +----------+----------+---------+---------+---------+ CPU_stat Summary Data +----------------+--------------+-------------------+ | ctx_switches/s | interrupts/s | soft_interrupts/s | +----------------+--------------+-------------------+ | 24710.0 | 58114.0 | 2207.0 | +----------------+--------------+-------------------+ CPU_freq Summary Data +--------------+----------+----------+ | current(MHz) | min(MHz) | max(MHz) | +--------------+----------+----------+ | 2899.65 | 400.0 | 2900.0 | +--------------+----------+----------+ CPU_percent Summary Data +-------+-------+---------+-------+---------+------+----------+--------+--------+-------------+ | %user | %nice | %system | %idle | %iowait | %irq | %softirq | %steal | %guest | %guest_nice | +-------+-------+---------+-------+---------+------+----------+--------+--------+-------------+ | 0.12 | 0.0 | 0.16 | 98.74 | 0.01 | 0.13 | 0.01 | 0.0 | 0.0 | 0.0 | +-------+-------+---------+-------+---------+------+----------+--------+--------+-------------+ ... ... ... Save statistics and time series data to an Excel file. Please wait. The report has been saved to /home/test/2025_08_14_15_58_18_report.xlsx
分析任务完成后会生成终端报告和Excel文件(“/home/test/2025_08_14_15_17_25_report.json”),其中Excel文件中包含多维度的时序数据(CPU维度、设备维度等),并且会生成对应的可视化时序图。
- 执行对比命令对比业务两次采集前后的性能差异,并生成对比报告。
1
ksys diff -i /home/test/2025_08_14_15_17_25_report.json /home/test/2025_08_14_16_11_28_report.json -o /home/test
2025_08_14_15_17_25_report.json、2025_08_14_16_11_28_report.json为通过ksys collect命令生成的JSON文件, 对比后的数据会保存在“/home/test/”目录下的Excel文件中。
返回信息片段如下:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178
======================================================================System Info====================================================================== System Architecture diff: +--------------+-------------------------------+-------------------------------+------+ | Metric | Before | After | Diff | +--------------+-------------------------------+-------------------------------+------+ | Cpu Type | Kunpeng920 high-performance | Kunpeng920 high-performance | N/A | | Model Name | HUAWEI Kunpeng 920 V200 7270Z | HUAWEI Kunpeng 920 V200 7270Z | N/A | | Vendor ID | HiSilicon | HiSilicon | N/A | | Hyper Thread | True | True | N/A | | CPU-NUM | 256 | 256 | N/A | +--------------+-------------------------------+-------------------------------+------+ ======================================================================CPU Metrics====================================================================== Common Microarchitecture Metrics diff: +-------------+--------------+--------------+---------+ | Metric | Before | After | Diff | +-------------+--------------+--------------+---------+ | IPC | 0.33 | 0.33 | +0.00% | | PATH LENGTH | 101152422747 | 126264627429 | +24.83% | | MPKI | 3.62 | 3.18 | -12.15% | | BPKI | 1.22 | 0.91 | -25.41% | | L1D MPKI | 3.66 | 3.46 | -5.46% | | L1I MPKI | 5.88 | 4.49 | -23.64% | | L2D MPKI | 2.59 | 2.13 | -17.76% | | L2I MPKI | 0.43 | 0.25 | -41.86% | | DTLB MPKI | 4.7 | 3.37 | -28.30% | | ITLB MPKI | 0.27 | 0.16 | -40.74% | +-------------+--------------+--------------+---------+ Topdown diff: +----------------------------+--------+-------+---------+ | Metric | Before | After | Diff | +----------------------------+--------+-------+---------+ | Retiring(%) | 5.83 | 5.06 | -13.21% | | Frontend Bound(%) | 11.24 | 8.31 | -26.07% | | Fetch Bandwidth Bound(%) | 0.38 | 0.49 | +28.95% | | Fetch Latency Bound(%) | 10.86 | 7.82 | -27.99% | | Bad Speculation(%) | 0.73 | 0.94 | +28.77% | | Branch Mispredicts(%) | 0.58 | 0.72 | +24.14% | | Machine Clears(%) | 0.15 | 0.22 | +46.67% | | Backend Bound(%) | 82.2 | 85.69 | +4.25% | | Core Bound(%) | 39.48 | 38.03 | -3.67% | | Memory Bound(%) | 42.72 | 47.66 | +11.56% | +----------------------------+--------+-------+---------+ OS Metrics diff: +------------------+--------+--------+---------+ | Metric | Before | After | Diff | +------------------+--------+--------+---------+ | context-switches | 309603 | 313974 | +1.41% | | migrations | 2277 | 2652 | +16.47% | | page-faults | 200448 | 120183 | -40.04% | +------------------+--------+--------+---------+ INSTRUCTION diff: +----------------------------------+--------+-------+---------+ | Metric | Before | After | Diff | +----------------------------------+--------+-------+---------+ | Memory(%) | 26.94 | 26.86 | -0.30% | | Load(%) | 23.41 | 24.02 | +2.61% | | Store(%) | 3.53 | 2.83 | -19.83% | | Integer(%) | 49.53 | 50.1 | +1.15% | | Floating Point(%) | 0.02 | 0.03 | +50.00% | | Advanced SIMD(%) | 0.13 | 0.13 | +0.00% | | Crypto(%) | 0.0 | 0.0 | +0.00% | | Branches(%) | 23.24 | 22.74 | -2.15% | | Immediate(%) | 21.27 | 21.24 | -0.14% | | Return(%) | 0.84 | 0.66 | -21.43% | | Indirect(%) | 1.14 | 0.83 | -27.19% | | Barriers(%) | 0.08 | 0.07 | -12.50% | | Instruction Synchronization(%) | 0.01 | 0.01 | +0.00% | | Data Synchronization(%) | 0.0 | 0.0 | +0.00% | | Data Memory(%) | 0.06 | 0.06 | +0.00% | | Not Retired(%) | 0.06 | 0.06 | +0.00% | +----------------------------------+--------+-------+---------+ Load_avg diff: +---------------+--------+-------+---------+ | Metric | Before | After | Diff | +---------------+--------+-------+---------+ | recent 1 min | 0.56 | 0.4 | -28.57% | | recent 5 min | 0.44 | 0.48 | +9.09% | | recent 15 min | 0.35 | 0.45 | +28.57% | +---------------+--------+-------+---------+ Softirqs diff: +----------+--------+-------+---------+ | Metric | Before | After | Diff | +----------+--------+-------+---------+ | NET_TX/s | 0 | 0 | +0.00% | | NET_RX/s | 0 | 0 | +0.00% | | BLOCK/s | 0 | 0 | +0.00% | | SCHED/s | 4 | 3 | -25.00% | +----------+--------+-------+---------+ CPU_stat diff: +-------------------+---------+---------+---------+ | Metric | Before | After | Diff | +-------------------+---------+---------+---------+ | ctx_switches/s | 24710.0 | 23522.0 | -4.81% | | interrupts/s | 58114.0 | 55823.0 | -3.94% | | soft_interrupts/s | 2207.0 | 1702.0 | -22.88% | +-------------------+---------+---------+---------+ CPU_freq diff: +--------------+---------+---------+--------+ | Metric | Before | After | Diff | +--------------+---------+---------+--------+ | current(MHz) | 2899.65 | 2899.65 | +0.00% | | min(MHz) | 400.0 | 400.0 | +0.00% | | max(MHz) | 2900.0 | 2900.0 | +0.00% | +--------------+---------+---------+--------+ CPU_percent diff: +-------------+--------+-------+----------+ | Metric | Before | After | Diff | +-------------+--------+-------+----------+ | %user | 0.12 | 0.08 | -33.33% | | %nice | 0.0 | 0.0 | +0.00% | | %system | 0.16 | 0.17 | +6.25% | | %idle | 98.74 | 98.51 | -0.23% | | %iowait | 0.01 | 0.0 | -100.00% | | %irq | 0.13 | 0.14 | +7.69% | | %softirq | 0.01 | 0.01 | +0.00% | | %steal | 0.0 | 0.0 | +0.00% | | %guest | 0.0 | 0.0 | +0.00% | | %guest_nice | 0.0 | 0.0 | +0.00% | +-------------+--------+-------+----------+ =================================================================Memory Access Metrics================================================================= DDRC summary diff: +-----------------------+--------+--------+---------+ | Metric | Before | After | Diff | +-----------------------+--------+--------+---------+ | Total ddrc_rd_bw MB/s | 651.69 | 540.23 | -17.10% | | Total ddrc_wr_bw MB/s | 344.07 | 283.02 | -17.74% | +-----------------------+--------+--------+---------+ NUMA NODE0 diff: +----------+-----------+-----------+---------+ | Metric | Before | After | Diff | +----------+-----------+-----------+---------+ | rx_outer | 740052.0 | 389902.5 | -47.31% | | rx_sccl | 1359007.2 | 1281564.6 | -5.70% | +----------+-----------+-----------+---------+ ... ... ... ========================================================================Top diff======================================================================= Top diff: ----------------------------------------------------------------------------------------------------------------- Note: At most 20 Top diffs are listed, please check the generated xlsx file for the rest of report. +-------------+------------------------------+-------------------+----------+----------+----------+-------------+ | Table Group | Metric Type/Metric Device | Metric | Before | After | Diff | Diff(value) | +-------------+------------------------------+-------------------+----------+----------+----------+-------------+ | NUMA | NUMA NODE2 | rx_sccl | 695016.9 | 209223.9 | -69.90% | 485793.0 | | NUMA | NUMA NODE3 | rx_outer | 266775.6 | 690445.5 | +158.81% | 423669.9 | | Miss | Miss Latency L2 Miss Latency | cycles_max | 2507 | 11460 | +357.12% | 8953 | | IO_info | IO_info Summary | Total rkB/s | 1288.25 | 2723.15 | +111.38% | 1434.9 | | IO_info | IO_info Summary | Total wkB/s | 1395.23 | 681.08 | -51.19% | 714.15 | | IO_info | IO_info IO Device sda3 | rkB/s | 429.15 | 907.45 | +111.45% | 478.3 | | IO_info | IO_info IO Device dm-0 | rkB/s | 429.15 | 907.45 | +111.45% | 478.3 | | IO_info | IO_info IO Device sda | rkB/s | 429.55 | 907.85 | +111.35% | 478.3 | | Net_info | Net_info Summary | Total txpck/s | 33.4 | 3.4 | -89.82% | 30.0 | | Net_info | Net_info Network Device eno1 | txpck/s | 33.4 | 3.4 | -89.82% | 30.0 | | Net_info | Net_info Summary | Total txkB/s | 26.31 | 0.25 | -99.05% | 26.06 | | Net_info | Net_info Network Device eno1 | txkB/s | 26.31 | 0.25 | -99.05% | 26.06 | | Net_info | Net_info Summary | Total rxpck/s | 30.6 | 5.71 | -81.34% | 24.89 | | CPU_percent | CPU_percent | %iowait | 0.01 | 0.0 | -100.00% | 0.01 | | INSTRUCTION | INSTRUCTION | Floating Point(%) | 0.02 | 0.03 | +50.00% | 0.01 | +-------------+------------------------------+-------------------+----------+----------+----------+-------------+ Data has been saved to /home/test/2025_08_14_16_12_42_diff.xlsx
对比后的数据保留在“/home/test/2025_08_14_16_12_42_diff.xlsx”文件中。对比分析会比较两次采集的Summary数据,并在最后生成Top diff报告,用于比较差异最大的指标,本次对比中,可以发现两次采集时的Miss时延差距较大。
父主题: 鲲鹏系统性能方法论分析工具