Kunpeng Topology Affinity Plugin

Currently, most cloud service providers need to deploy a large number of containers on a server, and the number of deployed containers even exceeds the number of physical cores of the server. In this scenario, the Kunpeng Topology Affinity Plugin (Kunpeng TAP) can automatically adjust the pod deployment range based on the node resource usage to ensure NUMA affinity for container resources like CPUs and GPUs and prevent cross-NUMA memory access of service containers.

The Kunpeng TAP runs at the node layer of the Kubernetes cluster. It is deployed through pods that use DaemonSet (Daemon pods), which ensures a plugin instance on each node. This plugin interconnects with the NRI of containerd to dynamically adjust the CPU scheduling range of a pod, securing effective NUMA affinity.

The following figure shows how the NUMA affinity scheduling plugin works based on the NRI function of the containerd runtime.

Figure 1 Working principle of the Kunpeng TAP in NRI mode

In addition, to be compatible with Docker and containerd in earlier versions of Kubernetes clusters, the Kunpeng TAP can obtain container requests and return responses as a proxy, and optimize and adjust resource parameters during request delivery to implement NUMA affinity. The following figure shows how the Kunpeng TAP works.

Figure 2 Working principle of the Kunpeng TAP

Constraints

The Kunpeng TAP has different usage restrictions and version requirements for the containerd and Docker scenarios. For details, see Table 1.

**Table 1** Constraints
Plugin Type	Version Requirement
Plugin Type	Kubernetes	containerd	Docker	Remarks
Kunpeng TAP (containerd, with NRI enabled)	1.28.4	1.7.0 or later	-	In the containerd scenario, use containerd as the container runtime.
Kunpeng TAP (Docker/containerd)	1.23.6	1.6.8+	20.10.14	In the Docker scenario, use Dockershim as the runtime communication component. containerd is also supported.

Parent topic: Cloud Native Resource Affinity and Isolation