VM Single-Core and Single-Page Exception Handling
To ensure long-term stable running of Internet systems, Reliability, Availability, and Serviceability (RAS) capabilities are required to obtain the faulty hardware of the current physical machine and process the corresponding hardware information, minimizing the impact scope. After the single-core and single-page exception handling feature is enabled, single-core corrected errors (CEs) can be isolated online on Kunpeng servers without affecting service running; uncorrected errors (UEs) in a single page of memory affect only one process in a VM, preventing the VM from going offline.
Specifications
Supported VM specifications include but are not limited to 2 vCPUs with 8 GB memory, 4 vCPUs with 8 GB memory, 4 vCPUs with 16 GB memory, 8 vCPUs with 16 GB memory, 16 vCPUs with 32 GB memory, and 32 vCPUs with 64 GB memory.
Version Requirements
- Versions: openEuler 24.03 LTS SP3, QEMU 8.2.0, and libvirt 9.10.0-26.oe2403sp3 or later
- License: none.
Constraints
- The application environment must meet the hardware and software requirements.
- The single-page memory error handling is supported only when physical machines use 4 KB pages.
Application Scenarios
In common cloud computing scenarios, VMs can be recovered without interrupting other services when memory hardware errors occur.