Scale a CelerData cluster

CelerData supports scaling clusters both vertically and horizontally. As your workloads grow or drop, you can view the details about a cluster and then decide whether to scale your cluster to maintain the necessary performance levels at minimum costs.

Introduction to scaling operations

Vertical scaling

You can vertically scale your cluster up or down by upgrading or downgrading the instance type of cluster nodes to increase or decrease computing power and storage capacity. Consider a scale-up in the following scenarios:

Your workloads are hitting CPU or I/O limits, which increase query latency and decrease concurrency, but storage capacity is sufficient.
You need to quickly react to fix performance issues that cannot be resolved by using classic optimization techniques.

Horizontal scaling

You can horizontally scale your cluster out or in by adding or removing cluster nodes to increase or decrease computing power and storage capacity. Horizontal scaling imposes no downtime on clusters. Consider a scale-out in the following scenarios:

Your workloads are hitting CPU, I/O, and storage limits, which increase query latency and decrease concurrency, but storage capacity is sufficient.
You have maxed out your performance requirements, even in the highest performance tier of your service.
Your data cannot fit into the current number of nodes.

Storage scaling

You can scale the storage of your cluster up or down to suit the needs of spikes and dips in cluster activity as your data volume changes.

CelerData also supports automatic storage scale-up for Coordinator Nodes in Elastic clusters. If the workload of your business is unpredictable and you cannot allocate a fixed number of storage volumes at cluster creation time, you can enable storage autoscaling for nodes in your CelerData cluster. With this feature enabled, CelerData automatically scales up the storage size when it detects that you are running out of the preset storage space.

Scale an elastic cluster

CelerData supports vertical scaling, horizontal scaling, and Coordinator Node storage scaling. You can also enable autoscaling for each warehouse to allow the system to automatically scale the number of Compute Nodes based on the CPU utilization of the warehouse.

Vertical scaling

Take note of the following points:

If your cluster uses EBS volumes as storage, the cluster nodes will restart on a rolling basis during a scale-up and you may experience query or data loading failures. Therefore, we recommend that you perform a scale-up during off-peak hours.
If your cluster uses instance store volumes as storage, the amount of time taken by a scale-up varies depending on the volume of data in your cluster.

Follow these steps:

Sign in to the CelerData Cloud BYOC console.
On the Clusters page, click the cluster that you want to scale.
On the cluster details page, click Manage and choose Edit cluster.

note
You can only scale clusters that are in the Running state. If a cluster is in not in the Running state, the Edit cluster menu item is disabled.
On the page that appears, select the type of node that you want to scale from the Node type drop-down list, select Scale up/down from the Operation type drop-down list, and select the name of the warehouse if you have selected Compute Node as the Node type. Then, select the instance type that you want to scale to, and then click Subscribe.
In the message that appears, confirm your scaling settings and click Subscribe.

The cluster enters the Updating state.

CelerData requires some time to launch instances of the new instance type and migrates your data and workloads from the original instances to the new instances, during which charges to you are still calculated based on the original instance type.

When the scaling operation is complete, the cluster returns to the Running state.

Horizontal scaling

For a scale-out or scale-in, you can set the number of Coordinator Nodes only to 1, 3, or 5. You can edit the number of Compute Nodes in a warehouse.

Follow these steps:

Sign in to the CelerData Cloud BYOC console.
On the Clusters page, click the cluster that you want to scale.
On the cluster details page, click Manage and choose Edit cluster.

note
You can only scale clusters that are in the Running state. If a cluster is in not in the Running state, the Edit cluster menu item is disabled.
On the page that appears, select the type of node that you want to scale from the Node type drop-down list, select Scale in/out from the Operation type drop-down list, and select the name of the warehouse if you have selected Compute Node as the Node type. Then, specify the number of nodes that you want to have, and then click Subscribe.
In the message that appears, confirm your scaling settings and click Subscribe.

The cluster enters the Updating state.

CelerData requires some time to release or launch instances of the current instance type, during which charges to you are still calculated based on the original number of nodes.

When the scaling operation is complete, the cluster returns to the Running state.

note
If Multi-AZ Deployment is enabled for the cluster, Compute Nodes will be distributed to the three availability zones as evenly as possible. For more information about CelerData's Multi-AZ Deployment, see Multi-AZ Deployments.

Storage scaling

Manual Scaling

You can scale the storage only for Coordinator Nodes. In addition to the disk size, you can edit the disk IOPS and throughput of the disks.

Follow these steps:

Sign in to the CelerData Cloud BYOC console.
On the Clusters page, click the cluster that you want to scale.
On the cluster details page, click Manage and choose Edit cluster.

note
You can only scale clusters that are in the Running state. If a cluster is in not in the Running state, the Edit cluster menu item is disabled.
On the page that appears, select Coordinator Node from the Node type drop-down list, select Edit storage from the Operation type drop-down list, specify the Disk IOPS, Disk throughput, and Disk size for the storage you want to scale, and then click Subscribe.
note
- The minimum disk IOPS per Coordinator Node is 3000.
- The minimum disk throughput per Coordinator Node is 150 MB/s.
- The minimum disk size per Coordinator Node is 30 GB.
In the message that appears, confirm your scaling settings and click Subscribe.

The cluster enters the Updating state.

CelerData requires some time to release or launch storage resources, during which charges to you are still calculated based on the original storage size.

When the scaling operation is complete, the cluster returns to the Running state.

Storage Autoscaling

You can define a storage autoscaling policy for Coordinator Nodes. CelerData will monitor the storage usage of the nodes and automatically scale up the storage when it detects that the storage usage reached a pre-defined threshold.

Follow these steps:

Sign in to the CelerData Cloud BYOC console.
On the Clusters page, click the cluster that you want to set storage autoscaling policy for.
On the cluster details page, click the Resource Scheduling tab.
In the Storage autoscaling policy section, click Edit.
In the dialog box that appears, configure the storage autoscaling policy as follows:
1. Turn on the switch following the Coordinator Storage to enable storage autoscaling.
2. Set the storage usage threshold (in percentage) that triggers an autoscaling operation. You can set this threshold between 80% to 90%. When the storage usage of a node reached this threshold and lasted for over five minutes, CelerData will scale up its storage by the step size you defined in the following procedure.
3. Set the step size of each autoscaling operation. You can choose to set the step size in fixed size (GB) or percentage, for example, 50 GB or 15% (of the original storage size).
4. Set the maximum storage size of each node. CelerData will stop scaling up the storage when its size reaches this threshold.
Click Submit to save the policy.

note

A minimum of six hours is mandatory as the interval between two scaling operations (including manual scaling and autoscaling).
Currently, Azure-based clusters does not support storage autoscaling.
The maximum size of each storage is 16 TB.
Compute Nodes do not support autoscaling.

Compute Autoscaling

You can define the autoscaling strategies for each warehouse to allow them to adaptively adjust the number of Compute Nodes or Compute Node Groups within. CelerData will assess the CPU utilization or query queue length of the warehouse in real time, and scale the Compute Nodes or Groups based on the policies you have made, helping you maintain steady, predictable performance at the lowest possible cost.

CelerData provides two scaling strategies:

Node Scaling Strategy scales compute resources on node granularity, and provides two scaling policies - the CPU utilization based policy and the Query queue based policy.
- If the CPU utilization based policy is enabled, the system will assess the CPU utilization of the warehouse. If the warehouse CPU utilization exceeds a pre-specified upper limit for a pre-specified time duration, additional Compute Nodes will be added to the warehouse. Vice versa, extra Compute Nodes will be removed from the warehouse if its CPU utilization has been below the lower limit for the time duration.
- If the Query queue based policy is enabled, the system will assess the query queue length of the warehouse. If the warehouse query queue length exceeds a pre-specified upper limit or the first query in the queue has been waiting for more than a pre-specified time duration, additional Compute Nodes will be added to the warehouse. Vice versa, extra Compute Nodes will be removed from the warehouse if the warehouse resource utilization has been below a pre-specified lower limit for a pre-specified time duration.
Group Scaling Strategy scales compute resources on the granularity of Compute Node groups. A Compute Node group is the group of Compute Nodes that were defined as the Compute Node count at the time of warehouse creation. Group Scaling Strategy only provides the Query queue based scaling policy. If the warehouse query queue length exceeds a pre-specified upper limit or the first query in the queue has been waiting for more than a pre-specified time duration, additional Compute Node groups will be added to the warehouse. Vice versa, Compute Node groups will be removed from the warehouse if the warehouse resource utilization has been below a pre-specified lower limit for a pre-specified time duration.

Follow these steps:

Sign in to the CelerData Cloud BYOC console.
On the Clusters page, click the elastic cluster where the warehouse that you want to enable autoscaling resides.
On the Warehouses tab of the cluster detail page, move the cursor to the lower-right corner of the card for the warehouse to display the View more details button, and then click the button.
Click the Resource Scheduling tab. Then, click Edit in the Autoscaling Policy section.
In the Edit autoscaling policy dialog box, select the autoscaling strategy in the Autoscaling drop-down list. You can choose Node Scaling or [Private Preview] Group Scaling.
- If you select the Node Scaling strategy, you need to specify the Scaling range (the minimum and maximum number of the Compute Nodes in the warehouse), and then, select the Scaling policy. You can choose CPU utilization based or [Private Preview] Query queue based.
  - If you select the CPU utilization based scaling policy, set the Scale out and Scale in policies as follows:
    
    a. In the Scale out policy section, set the CPU utilization upper limit, the time threshold, and the number of Compute Nodes to be scaled out each step.
    
    b. In the Scale in policy section, set the CPU utilization lower limit, the time threshold, and the number of Compute Nodes to be scaled in each step.
  - If you select the [Private Preview] Query queue based scaling policy, set the Scale out and Scale in policies as follows:
    
    Private Preview
    This feature must be enabled by CelerData Support, please open a support case if you would like to use it.
    Advice on use of Private Preview features
    
    a. In the Scale out policy section, set the maximum query queue length, and the waiting time threshold of the first query in the queue.
    
    b. In the Scale in policy section, set the resource utilization lower limit, the time threshold, and the number of Compute Nodes to be scaled in each step.
- If you select the [Private Preview] Group Scaling strategy, you need to specify the Scaling range (the minimum and maximum number of the Compute Node groups in the warehouse), and set the Scale out and Scale in policies as follows:
  
  Private Preview
  This feature must be enabled by CelerData Support, please open a support case if you would like to use it.
  Advice on use of Private Preview features
  
  a. In the Scale out policy section, set the maximum query queue length, and the waiting time threshold of the first query in the queue.
  
  b. In the Scale in policy section, set the resource utilization lower limit, the time threshold, and the number of Compute Node groups to be scaled in each step.
  note
  - Autoscaling policies take effect only when the warehouse is running.
  - For Node Scaling, the lower bound of the Scaling range is 1, and the upper bound is 100.
  - For Group Scaling, the lower bound of the Scaling range is 1, and the upper bound is 10 in Single-AZ Deployment, and 2 and 20 in Multi-AZ Deployment.
  - Scale out policy takes effect only when the current Compute Node or Compute Node Group count is less than the upper limit of the Scaling range you have defined.
  - Scale in policy takes effect only when the current Compute Node or Compute Node Group count is greater than the lower limit of the Scaling range you have defined.
  - The CPU utilization upper limit of Scale out policy must be greater than the lower limit of Scale in policy.
  - To avoid significant fluctuations in cluster performance, only a maximum of two Compute Nodes or Compute Node Groups can be scaled in per step.
  - For monitoring metrics of warehouse query queues and Compute Node Groups, see Metrics for warehouse query queue and Metrics for warehouse CN Group.
Click Save changes to save the policies.

Introduction to scaling operations​

Vertical scaling​

Horizontal scaling​

Storage scaling​

Scale an elastic cluster​

Vertical scaling​

Horizontal scaling​

Storage scaling​

Manual Scaling​

Storage Autoscaling​

Compute Autoscaling​

Introduction to scaling operations

Vertical scaling

Horizontal scaling

Storage scaling

Scale an elastic cluster

Vertical scaling

Horizontal scaling

Storage scaling

Manual Scaling

Storage Autoscaling

Compute Autoscaling