Cache Drive Fault
The BoostIO distributed cache layer uses NVMe SSDs as tier-2 cache media to persist data in the write and read caches; it needs to effectively handle cache drive faults.
Scenario |
Impact |
Handling Method |
Remarks |
|---|---|---|---|
Adding a new drive |
During drive adding, the front-end I/O performance decreases temporarily, and the service interruption duration does not exceed 60 seconds. |
Adds and identifies the newly added drive, updates the configuration file, reports the drive addition event, triggers view rebalancing, evicts cache data, and initiates the cache. |
|
Faulty drive removal |
During fault detection and removal, the front-end I/O performance decreases, and the service interruption duration does not exceed 60 seconds. |
Reports the drive fault to the cluster management module, completes data eviction from affected partitions, reports the completion, and triggers partition view recalculation and release (during which I/Os for the failed partitions are automatically retried). |
|