Storage Capacity Balance Strategy¶
As of now, Watcher optimizes only compute nodes. Storage optimization is also an important feature for centralized storage (non distributed storage).
This spec will add Storage Capacity Balance Strategy to balance the storage capacity, which can be also considered a way to balance the storage workload. And we can use existing goal(workload_balancing) and action(volume_migrate) for this storage capacity balance.
In current Data Center, the capacity of storage back-end may be not balanced, some are extremely high, some are idle. This situation will degrade the performance of I/O Read/Write, which will finally affect the QoS. This problem can be solved by storage capacity balance strategy.
This strategy migrates volumes based on the capacity utilization of the cinder pools. It makes decision to migrate a volume whenever a pool’s capacity utilization % is higher than the specified threshold. The migration of a volume should make the capacity utilization of the pool where it locates lower than the storage capacity utilization threshold.
As an administrator, I want to be able to trigger an audit that controls the storage capacity utilization below a certain threshold.
Extend base strategy classes to add one new strategy - ” Storage Capacity Balance Strategy”
Use Cinder client to get all volumes with status in available or in-use and no snapshots, and to get all pools except the pools listed as exclude_pools in the configuration file.
Group volume pools into two categories: underload or overload pools according to threshold:
under_pools = list(filter(lambda p: float(p.total_capacity_gb) - float(p.free_capacity_gb) < float(p.total_capacity_gb) * threshold, pools)) over_pools = list(filter(lambda p: float(p.total_capacity_gb) - float(p.free_capacity_gb) >= float(p.total_capacity_gb) * threshold, pools))
Determine migrate_volumes, source pools and destination pools based on some factors:
whether a volume is mounted to a VM
whether a volume is a mirrored volume
Data model impact¶
REST API impact¶
Other end user impact¶
Other deployer impact¶
- Primary assignee:
Define proper threshold
Write the execute function to locate pool overloaded
Function to generate actions:volume_migrate or volume_retype.
Unit and functional test are needed.
Add docs on how to use this strategy.