- Release Notes
- Get Started
- Clusters
- Cloud Settings
- Table Type
- Query Data Lakes
- Integration
- Query Acceleration
- Data Loading
- Concepts
- Batch load data from Amazon S3
- Batch load data from Azure cloud storage
- Load data from a local file system
- Load data from Confluent Cloud
- Load data from Amazon MSK
- Load data from Amazon Kinesis
- Data Unloading
- Data Backup
- Security
- Console Access Control
- Data Access Control
- Application keys
- Service accounts
- Use SSL connection
- Alarm
- Usage and Billing
- Organizations and Accounts
- Reference
- Amazon Web Services (AWS)
- Microsoft Azure
- SQL Reference
- Keywords
- ALL statements
- User Account Management
- Cluster Management
- ADMIN CANCEL REPAIR
- ADMIN CHECK TABLET
- ADMIN REPAIR
- ADMIN SET CONFIG
- ADMIN SET REPLICA STATUS
- ADMIN SHOW CONFIG
- ADMIN SHOW REPLICA DISTRIBUTION
- ADMIN SHOW REPLICA STATUS
- ALTER RESOURCE GROUP
- ALTER SYSTEM
- CANCEL DECOMMISSION
- CREATE FILE
- CREATE RESOURCE GROUP
- DROP FILE
- DROP RESOURCE GROUP
- EXPLAIN
- INSTALL PLUGIN
- SET
- SHOW BACKENDS
- SHOW BROKER
- SHOW COMPUTE NODES
- SHOW FRONTENDS
- SHOW FULL COLUMNS
- SHOW INDEX
- SHOW PLUGINS
- SHOW PROCESSLIST
- SHOW RESOURCE GROUP
- SHOW TABLE STATUS
- SHOW FILE
- SHOW VARIABLES
- UNINSTALL PLUGIN
- DDL
- ALTER DATABASE
- ALTER MATERIALIZED VIEW
- ALTER TABLE
- ALTER VIEW
- ANALYZE TABLE
- BACKUP
- CANCEL ALTER TABLE
- CANCEL BACKUP
- CANCEL RESTORE
- CREATE ANALYZE
- CREATE DATABASE
- CREATE EXTERNAL CATALOG
- CREATE INDEX
- CREATE MATERIALIZED VIEW
- CREATE REPOSITORY
- CREATE TABLE AS SELECT
- CREATE TABLE LIKE
- CREATE TABLE
- CREATE VIEW
- CREATE FUNCTION
- DROP ANALYZE
- DROP STATS
- DROP CATALOG
- DROP DATABASE
- DROP INDEX
- DROP MATERIALIZED VIEW
- DROP REPOSITORY
- DROP TABLE
- DROP VIEW
- DROP FUNCTION
- KILL ANALYZE
- RECOVER
- REFRESH EXTERNAL TABLE
- RESTORE
- SET CATALOG
- SHOW ANALYZE JOB
- SHOW ANALYZE STATUS
- SHOW META
- SHOW FUNCTION
- TRUNCATE TABLE
- USE
- DML
- ALTER LOAD
- ALTER ROUTINE LOAD
- BROKER LOAD
- CANCEL LOAD
- CANCEL EXPORT
- CANCEL REFRESH MATERIALIZED VIEW
- CREATE ROUTINE LOAD
- DELETE
- EXPORT
- GROUP BY
- INSERT
- PAUSE ROUTINE LOAD
- RESUME ROUTINE LOAD
- REFRESH MATERIALIZED VIEW
- SELECT
- SHOW ALTER
- SHOW ALTER MATERIALIZED VIEW
- SHOW BACKUP
- SHOW CATALOGS
- SHOW CREATE CATALOG
- SHOW CREATE MATERIALIZED VIEW
- SHOW CREATE TABLE
- SHOW CREATE VIEW
- SHOW DATA
- SHOW DATABASES
- SHOW DELETE
- SHOW DYNAMIC PARTITION TABLES
- SHOW EXPORT
- SHOW LOAD
- SHOW MATERIALIZED VIEW
- SHOW PARTITIONS
- SHOW REPOSITORIES
- SHOW RESTORE
- SHOW ROUTINE LOAD
- SHOW ROUTINE LOAD TASK
- SHOW SNAPSHOT
- SHOW TABLES
- SHOW TABLET
- SHOW TRANSACTION
- STOP ROUTINE LOAD
- STREAM LOAD
- SUBMIT TASK
- UPDATE
- Auxiliary Commands
- Data Types
- Keywords
- SQL Functions
- Function list
- Java UDFs
- Window functions
- Lambda expression
- Date Functions
- add_months
- adddate
- convert_tz
- current_date
- current_time
- current_timestamp
- date
- date_add
- date_diff
- date_format
- date_slice
- date_sub, subdate
- date_trunc
- datediff
- day
- dayofweek_iso
- dayname
- dayofmonth
- dayofweek
- dayofyear
- days_add
- days_diff
- days_sub
- from_days
- from_unixtime
- hour
- hours_add
- hours_diff
- hours_sub
- jodatime_format
- last_day
- makedate
- microseconds_add
- microseconds_sub
- minute
- minutes_add
- minutes_diff
- minutes_sub
- month
- monthname
- months_add
- months_diff
- months_sub
- next_day
- now
- previous_day
- quarter
- second
- seconds_add
- seconds_diff
- seconds_sub
- str_to_date
- str_to_jodatime
- str2date
- time_slice
- time_to_sec
- timediff
- timestamp
- timestampadd
- timestampdiff
- to_date
- to_days
- to_iso8601
- to_tera_date
- to_tera_timestamp
- unix_timestamp
- utc_timestamp
- week
- week_iso
- weekofyear
- weeks_add
- weeks_diff
- weeks_sub
- year
- years_add
- years_diff
- years_sub
- Aggregate Functions
- any_value
- approx_count_distinct
- array_agg
- avg
- bitmap
- bitmap_agg
- count
- count_if
- corr
- covar_pop
- covar_samp
- group_concat
- grouping
- grouping_id
- hll_empty
- hll_hash
- hll_raw_agg
- hll_union
- hll_union_agg
- max
- max_by
- min
- min_by
- multi_distinct_sum
- multi_distinct_count
- percentile_approx
- percentile_cont
- percentile_disc
- retention
- stddev
- stddev_samp
- sum
- variance, variance_pop, var_pop
- var_samp
- window_funnel
- Geographic Functions
- String Functions
- append_trailing_char_if_absent
- ascii
- char
- char_length
- character_length
- concat
- concat_ws
- ends_with
- find_in_set
- group_concat
- hex
- hex_decode_binary
- hex_decode_string
- instr
- lcase
- left
- length
- locate
- lower
- lpad
- ltrim
- money_format
- null_or_empty
- parse_url
- repeat
- replace
- reverse
- right
- rpad
- rtrim
- space
- split
- split_part
- substring_index
- starts_with
- strleft
- strright
- str_to_map
- substring
- trim
- ucase
- unhex
- upper
- url_decode
- url_encode
- Pattern Matching Functions
- JSON Functions
- Overview of JSON functions and operators
- JSON operators
- JSON constructor functions
- JSON query and processing functions
- Bit Functions
- Bitmap Functions
- Array Functions
- all_match
- any_match
- array_agg
- array_append
- array_avg
- array_concat
- array_contains
- array_contains_all
- array_cum_sum
- array_difference
- array_distinct
- array_filter
- array_generate
- array_intersect
- array_join
- array_length
- array_map
- array_max
- array_min
- array_position
- array_remove
- array_slice
- array_sort
- array_sortby
- array_sum
- arrays_overlap
- array_to_bitmap
- cardinality
- element_at
- reverse
- unnest
- Map Functions
- Binary Functions
- cast function
- hash function
- Cryptographic Functions
- Math Functions
- Pattern Matching Functions
- Percentile Functions
- Scalar Functions
- Struct Functions
- Table Functions
- Utility Functions
- AUTO_INCREMENT
- Generated columns
- System variables
- System limits
- Information Schema
- Overview
- be_bvars
- be_cloud_native_compactions
- be_compactions
- character_sets
- collations
- column_privileges
- columns
- engines
- events
- global_variables
- key_column_usage
- load_tracking_logs
- loads
- materialized_views
- partitions
- pipe_files
- pipes
- referential_constraints
- routines
- schema_privileges
- schemata
- session_variables
- statistics
- table_constraints
- table_privileges
- tables
- tables_config
- task_runs
- tasks
- triggers
- user_privileges
- views
- System Metadatabase
- API
- Overview
- Actions
- Clusters
- Create and Manage Clusters
- Query Clusters
- Identity and Access Management
- Organization and Account
- Usage and Billing
- Clusters
- Terraform Provider
- Run scripts
Connect from a client application to a CelerData cluster
You can connect to CelerData clusters from your client application to query and visualize the data in your data lakes and relational databases.
The connection methods vary depending on your cloud provider. Currently, CelerData supports the following cloud providers:
AWS
Connect from a client application on a public network
If you want to connect your client applications on a public network to your CelerData cluster, you must specify the public subnet ID, and configure the security group with inbound rules that allow client access.
Follow these steps to add an inbound rule to the security group of the VPC that hosts your CelerData cluster:
Sign in to the AWS VPC console as a user with administrator privileges.
In the upper-right corner of the page, select your AWS region.
In the left-side navigation pane, choose Security > Security groups.
On the Security groups page, click the security group ID of the VPC that hosts your CelerData cluster.
On the detail page of the security group, click Edit inbound rules on the Inbound rules tab.
On the Edit inbound rules page, click Add rule to add an inbound rule.
Edit the inbound rule:
If you want to expose your CelerData cluster to all applications on public networks, you can edit it as follows:
a. Set Type to All traffic.
b. Set Source to Anywhere-IPv4.
If you want to expose your CelerData cluster to a specific application on public networks, you can edit it as follows:
a. Set Type to All traffic.
b. Set Source to Custom and specify the IP address of your application.
Click Save rules.
Connect from a client application on a private subnet
If you want to connect to your CelerData cluster from a client application that is in a custom VPC, you must configure the VPC that hosts the FEs in your CelerData cluster to allow the client application to request port 9030.
You can connect your CelerData cluster from your client application using the same VPC that hosts the FE node, or from another custom VPC via VPC peering connection.
Create a VPC peering connection
Follow these steps to create a VPC peering connection:
- Sign in to the AWS VPC console as a user with administrator privileges.
- In the navigation pane, choose Virtual private cloud > Peering connections.
- Choose Create peering connection.
- Configure the information as follows, and choose Create Peering Connection when you are done:
- Peering connection name tag: You can optionally name your VPC peering connection.
- VPC (Requester): Select the VPC in your account with which to create the VPC peering connection.
- Account: Choose My account to create a peering connection with a VPC within this account, or choose Another account to create a peering connection with a VPC from another account.
- Account ID: Enter the AWS account ID of the owner of the accepter VPC if you choose Another account.
- Region: Choose This Region if the accepter VPC resides in the same region. If it resides in a different region, choose Another region, and select the region in which the accepter VPC resides.
- VPC (Accepter): Enter the ID of the VPC with which to create the VPC peering connection.
- In the confirmation dialog box, choose OK.
Accept a peering connection
Follow these steps to accept a VPC peering connection:
- Sign in to the AWS VPC console as a user with administrator privileges.
- In the upper-right corner of the page, select the region of the accepter VPC.
- In the navigation pane, choose Virtual private cloud > Peering connections.
- Select the pending VPC peering connection (the status is
pending-acceptance
), and choose Actions, Accept Request. - In the confirmation dialog box, choose Yes, Accept.
Azure
Connect from a client application on a public network
To allow your client application on a public network to access your CelerData cluster deployed on Azure, follow these steps to add an inbound security rule to the network security group of the virtual network that hosts your CelerData cluster:
Sign in to the Microsoft Azure portal.
Navigate to the Network security groups service page, and click the network security group that your CelerData cluster belongs to.
On the page that appears, choose Inbound security rules in the left-side pane, and click Add to add a new inbound security rule.
In the right-side pane that appears, configure the inbound security rule:
If you want to expose your CelerData cluster to all applications on public networks, you can edit it as follows:
a. Select Any from the Source drop-down list.
b. Enter *
in the Source port ranges field.
c. Enter 9030 in the Destination port ranges field.
If you want to expose your CelerData cluster to a specific application on public networks, you can edit it as follows:
a. Select IP Address from the Source drop-down list.
b. Specify the IP address of your application in the Source IP addresses/CIDR ranges field.
c. Enter *
in the Source port ranges field.
d. Enter 9030 in the Destination port ranges field.
For other fields, you can use the default settings.
- Click Add to add the inbound security rule.
Connect from a client application on a private subnet
If you want to connect to your CelerData cluster from a client application that is in a custom virtual network, you must configure the virtual network that hosts the FEs in your CelerData cluster to allow the client application to request port 9030.
You can connect your CelerData cluster from your client application using the same virtual network that hosts the FE nodes, or from another custom virtual network via virtual network peering.
Create a VPC peering connection
Follow these steps to create a VPC peering connection:
Sign in to the Microsoft Azure portal.
Navigate to the Virtual networks service page, and click the virtual network that hosts your CelerData cluster.
On the page that appears, choose Peerings in the left-side pane, and click Add to add a virtual network peering.
On the Add peering page, configure the virtual network peering:
a. Enter a name for the peering link in the Peering link name field for This virtual network.
b. Choose Allow '<your_virtual_network_name>' to access the peered virtual network.
c. Enter a name for the peering link in the Peering link name field for Remote virtual network.
d. Select the deployment model of the virtual network you want to peer with in the Virtual network deployment model section.
e. Select the subscription of the virtual network you want to peer with from the Subscription drop-down list.
f. Select the virtual network you want to peer with from the Virtual network drop-down list.
NOTE
If you do not have read access to the virtual network or subscription you want to peer with, but only the resource ID of the virtual network, you must choose I know my resource ID, and specify the resource ID of the virtual network you want to peer with in the Resource ID field.
g. Choose Allow the peered virtual network to access '<your_virtual_network_name>'.
Click Add to add the virtual network peering.