CREATE MATERIALIZED VIEW
CREATE MATERIALIZED VIEW creates a materialized view. For usage information about materialized views, see Synchronous materialized view and Asynchronous materialized view.
CAUTION
- Only users with the CREATE MATERIALIZED VIEW privilege in the database where the base table resides can create a materialized view.
- From v3.4.0 onwards, StarRocks supports creating synchronous materialized views in shared-data clusters.
Creating a materialized view is an asynchronous operation. Running this command successfully indicates that the task of creating the materialized view is submitted successfully. You can view the building status of a synchronous materialized view in a database via SHOW ALTER MATERIALIZED VIEW command, and view that of an asynchronous materialized view by querying the metadata views tasks and task_runs in Information Schema.
StarRocks supports asynchronous materialized views from v2.4. The major differences between asynchronous materialized views and synchronous materialized views in previous versions are as follows:
| Single-table aggregation | Multi-table join | Query rewrite | Refresh strategy | Base table | |
|---|---|---|---|---|---|
| ASYNC MV | Yes | Yes | Yes | 
 | Multiple tables from: 
 | 
| SYNC MV (Rollup) | Limited choices of aggregate functions | No | Yes | Synchronous refresh during data loading | Single table in the default catalog | 
Synchronous materialized view
Syntax
CREATE MATERIALIZED VIEW [IF NOT EXISTS] [database.]<mv_name>
[COMMENT ""]
[PROPERTIES ("key"="value", ...)]
AS 
<query_statement>
Parameters in brackets [] are optional.
Parameters
mv_name (required)
The name of the materialized view. The naming requirements are as follows:
- The name must consist of letters (a-z or A-Z), digits (0-9), or underscores (_), and it can only start with a letter.
- The length of the name cannot exceed 64 characters.
- The name is case-sensitive.
COMMENT (optional)
Comment on the materialized view. Note that COMMENT must be placed after mv_name. Otherwise, the materialized view cannot be created.
query_statement (required)
The query statement to create the materialized view. Its result is the data in the materialized view. The syntax is as follows:
SELECT select_expr[, select_expr ...]
[WHERE where_expr]
[GROUP BY column_name[, column_name ...]]
[ORDER BY column_name[, column_name ...]]
- 
select_expr (required) All columns in the query statement, that is, all columns in the materialized view schema. This parameter supports the following values: - 
Simple columns or aggregate columns such as SELECT a, abs(b), min(c) FROM table_a, wherea,b, andcare the names of columns in the base table. If you do not specify column names for the materialized view, StarRocks automatically assigns names to the columns.
- 
Expressions such as SELECT a+1 AS x, b+2 AS y, c*c AS z FROM table_a, wherea+1,b+2andc*care the expressions that reference the columns in the base tables, andx,yandzare the aliases assigned to the columns in the materialized view.
 NOTE - You must specify at least one column in select_expr.
- When creating a synchronous materialized view with an aggregate function, you must specify the GROUP BY clause, and specify at least one GROUP BY column in select_expr.
- Synchronous materialized views do not support clauses such as JOIN and the HAVING clause of GROUP BY.
- From v3.1 onwards, each synchronous materialized view can support more than one aggregate function for each column of the base table, for example, query statements such as select b, sum(a), min(a) from table group by b.
- From v3.1 onwards, synchronous materialized views support complex expressions for SELECT and aggregate functions, for example, query statements such as select b, sum(a + 1) as sum_a1, min(cast (a as bigint)) as min_a from table group by borselect abs(b) as col1, a + 1 as col2, cast(a as bigint) as col3 from table. The following restrictions are imposed on the complex expression used for synchronous materialized views:- Each complex expression must have an alias and different aliases must be assigned to different complex expressions among all the synchronous materialized views of a base table. For example, query statements select b, sum(a + 1) as sum_a from table group by bandselect b, sum(a) as sum_a from table group by bcannot be used to create synchronous materialized views for a same base table. You can set different aliases for a complex expression.
- You can check whether your queries are rewritten by the synchronous materialized views created with complex expressions by executing EXPLAIN <sql_statement>. For more information, see Query analysis.
 
- Each complex expression must have an alias and different aliases must be assigned to different complex expressions among all the synchronous materialized views of a base table. For example, query statements 
 
- 
- 
WHERE (optional) From v3.1.8 onwards, synchronous materialized views support the WHERE clause which can filter rows used for the materialized view. 
- 
GROUP BY (optional) The GROUP BY column of the query. If this parameter is not specified, the data will not be grouped by default. 
- 
ORDER BY (optional) The ORDER BY column of the query. - Columns in the ORDER BY clause must be declared in the same order as the columns in select_expr.
- If the query statement contains a GROUP BY clause, the ORDER BY columns must be identical to the GROUP BY columns.
- If this parameter is not specified, the system will automatically supplement the ORDER BY column according to the following rules:
- If the materialized view is the AGGREGATE type, all GROUP BY columns are automatically used as sort keys.
- If the materialized view is not the AGGREGATE type, StarRocks automatically selects sort keys based on the prefix columns.
 
 
- Columns in the ORDER BY clause must be declared in the same order as the columns in 
Query a synchronous materialized view
Because a synchronous materialized view is essentially an index of the base table rather than a physical table, you can only query a synchronous materialized view using the hint [_SYNC_MV_]:
-- Do not omit the brackets [] in the hint.
SELECT * FROM <mv_name> [_SYNC_MV_];
CAUTION
Currently, StarRocks automatically generates names for columns in a synchronous materialized view even if you have specified aliases for them.
Automatic query rewrite with synchronous materialized view
When a query that follows the pattern of a synchronous materialized view is executed, the original query statement is automatically rewritten and the intermediate results stored in the materialized view are used.
The following table shows the correspondence between the aggregate function in the original query and the aggregate function used to construct the materialized view. You can select the corresponding aggregate function to build a materialized view according to your business scenario.
| aggregate function in the original query | aggregate function of the materialized view | 
|---|---|
| sum | sum | 
| min | min | 
| max | max | 
| count | count | 
| bitmap_union, bitmap_union_count, count(distinct) | bitmap_union | 
| hll_raw_agg, hll_union_agg, ndv, approx_count_distinct | hll_union | 
| percentile_approx, percentile_union | percentile_union | 
In addition to the above functions, starting from StarRocks v3.4.0, synchronous materialized views also support generic aggregate functions. For more information about generic aggregate functions, see Generic aggregate function states.
-- Create a synchronous materialized view test_mv1 to store aggregate states.
CREATE MATERIALIZED VIEW test_mv1 
AS
SELECT 
    dt,
    -- Original aggregate functions.
    min(id) AS min_id,
    max(id) AS max_id,
    sum(id) AS sum_id,
    bitmap_union(to_bitmap(id)) AS bitmap_union_id,
    hll_union(hll_hash(id)) AS hll_union_id,
    percentile_union(percentile_hash(id)) AS percentile_union_id,
    -- Generic aggregate state functions.
    ds_hll_count_distinct_union(ds_hll_count_distinct_state(id)) AS hll_id,
    avg_union(avg_state(id)) AS avg_id,
    array_agg_union(array_agg_state(id)) AS array_agg_id,
    min_by_union(min_by_state(province, id)) AS min_by_province_id
FROM t1
GROUP BY dt;
Asynchronous materialized view
Syntax
CREATE MATERIALIZED VIEW [IF NOT EXISTS] [database.]<mv_name>
[COMMENT ""]
-- You must specify either `distribution_desc` or `refresh_scheme`, or both.
-- distribution_desc
[DISTRIBUTED BY HASH(<bucket_key>[,<bucket_key2> ...]) [BUCKETS <bucket_number>]]
-- refresh_desc
[REFRESH 
-- refresh_moment
    [IMMEDIATE | DEFERRED]
-- refresh_scheme
    [ASYNC | ASYNC [START (<start_time>)] EVERY (INTERVAL <refresh_interval>) | MANUAL]
]
-- partition_expression
[PARTITION BY 
    {<date_column> | date_trunc(fmt, <date_column>)}
]
-- order_by_expression
[ORDER BY (<sort_key>)]
[PROPERTIES ("key"="value", ...)]
AS 
<query_statement>
Parameters in brackets [] are optional.
Parameters
mv_name (required)
The name of the materialized view. The naming requirements are as follows:
- The name must consist of letters (a-z or A-Z), digits (0-9), or underscores (_), and it can only start with a letter.
- The length of the name cannot exceed 64 characters.
- The name is case-sensitive.
CAUTION
Multiple materialized views can be created on the same base table, but the names of the materialized views in the same database cannot be duplicated.
COMMENT (optional)
Comment on the materialized view. Note that COMMENT must be placed after mv_name. Otherwise, the materialized view cannot be created.
distribution_desc (optional)
The bucketing strategy of the asynchronous materialized view. StarRocks supports hash bucketing and random bucketing (from v3.1 onwards). If you do not specify this parameter, StarRocks uses the random bucketing strategy and automatically sets the number of buckets.
NOTE
While creating an asynchronous materialized view, you must specify either
distribution_descorrefresh_scheme, or both.
- 
Hash bucketing: Syntax DISTRIBUTED BY HASH (<bucket_key1>[,<bucket_key2> ...]) [BUCKETS <bucket_number>]For more information, see Data distribution. NOTE Since v2.5.7, StarRocks can automatically set the number of buckets (BUCKETS) when you create a table or add a partition. You no longer need to manually set the number of buckets. For detailed information, see set the number of buckets. 
- 
Random bucketing: If you choose the random bucketing strategy and allow StarRocks to set the number of buckets automatically, you do not need to specify distribution_desc. However, if you want to set the number of buckets manually, you can refer to the following syntax:DISTRIBUTED BY RANDOM BUCKETS <bucket_number>CAUTION Asynchronous materialized views with a random bucketing strategy cannot be assigned to a colocation group. For more information, see Random bucketing 
refresh_moment (optional)
The refresh moment of the materialized view. Default value: IMMEDIATE. Valid values:
- IMMEDIATE: Refresh the asynchronous materialized view immediately after it is created.
- DEFERRED: The asynchronous materialized view is not refreshed after it is created. You can manually refresh the materialized view or schedule regular refresh tasks.
refresh_scheme (optional)
NOTE
While creating an asynchronous materialized view, you must specify either
distribution_descorrefresh_scheme, or both.
The refresh strategy of the asynchronous materialized view. Valid values:
- ASYNC: Automatic refresh mode. Each time the base table data changes, the materialized view is automatically refreshed.
- ASYNC [START (<start_time>)] EVERY(INTERVAL <interval>): Regular refresh mode. The materialized view is refreshed regularly at the interval defined. You can specify the interval as- EVERY (interval n day/hour/minute/second)using the following units:- DAY,- HOUR,- MINUTE, and- SECOND. The default value is- 10 MINUTE. You can further specify the refresh start time as- START('yyyy-MM-dd hh:mm:ss'). If the start time is not specified, the current time is used. Example:- ASYNC START ('2023-09-12 16:30:25') EVERY (INTERVAL 5 MINUTE).
- MANUAL: Manual refresh mode. The materialized view will not be refreshed unless you trigger a refresh task manually.
If this parameter is not specified, the default value MANUAL is used.
partition_expression (optional)
The partitioning strategy of the asynchronous materialized view. As for the current version of StarRocks, only one partition expression is supported when creating an asynchronous materialized view.
CAUTION
From v3.3.3 onwards, StarRocks supports creating asynchronous materialized views with the List Partitioning strategy.
- You can create list-partitioned materialized views based on tables that are created with the List Partitioning or Expression partitioning strategy.
- Currently, you can only specify one Partition Key when creating materialized views with the List Partitioning strategy. You must choose one Partition Key if the base table has more than one Partition Key.
- The refresh behavior and query rewrite logic of materialized views with the List Partitioning strategy are consistent with those with the Range Partitioning strategy.
Valid values:
- column_name: The name of the column used for partitioning. The expression- PARTITION BY dtmeans to partition the materialized view according to the- dtcolumn.
- date_truncfunction: The function used to truncate the time unit.- PARTITION BY date_trunc("MONTH", dt)means that the- dtcolumn is truncated to month as the unit for partitioning. The- date_truncfunction supports truncating time to units including- YEAR,- MONTH,- DAY,- HOUR, and- MINUTE.
- str2datefunction: The function used to partition string type parititions of base table into materialized view's partition.- PARTITION BY str2date(dt, "%Y%m%d")means that the- dtcolumn is a string date type whose date format is- "%Y%m%d". The- str2datefunction supports a lot of date formats, you can refer to str2date for more information. Supported from v3.1.4.
- time_slicefunction: From v3.1 onwards, you can further use these functions to convert the given time into the beginning or end of a time interval based on the specified time granularity, for example,- PARTITION BY date_trunc("MONTH", time_slice(dt, INTERVAL 7 DAY))where time_slice must have a finer granularity than date_trunc. You can use them to specify a GROUP BY column with a finer granularity than that of the partitioning key, for example,- GROUP BY time_slice(dt, INTERVAL 1 MINUTE) PARTITION BY date_trunc('DAY', ts).
If this parameter is not specified, no partitioning strategy is adopted by default.
order_by_expression (optional)
The sort key of the asynchronous materialized view. If you do not specify the sort key, StarRocks chooses some of the prefix columns from SELECT columns as the sort keys. For example, in select a, b, c, d, sort keys can be a and b. This parameter is supported from StarRocks v3.0 onwards.
NOTE There are two different uses of
ORDER BYin materialized views:
ORDER BYin the CREATE MATERIALIZED VIEW statement defines the sort key of the materialized view, which helps accelerate queries based on the sort key. This does not affect the materialized view's SPJG-based transparent acceleration capability but does not guarantee global ordering of the materialized view's query results.
ORDER BYin the materialized view's query definition guarantees global ordering of the query results, but prevents the materialized view from being used for SPJG-based transparent query rewrite. Therefore,ORDER BYshould not be used in the materialized view's query definition if the MV is used for query rewrite usage.
INDEX (optional)
Asynchronous materialized views support Bitmap and BloomFilter indexes to accelerate query performance, and their usage is the same as in regular tables. For details on the use cases and information about Bitmap and BloomFilter indexes, please refer to:Bitmap Index and Bloom filter Index.
Using Bitmap Indexes:
-- Create an index  
CREATE INDEX <index_name> ON <mv_name>(<column_name>) USING BITMAP COMMENT '<comment>';  
-- Check index creation progress  
SHOW ALTER TABLE COLUMN;  
-- View indexes  
SHOW INDEXES FROM <mv_name>;  
-- Drop an index  
DROP INDEX <index_name> ON <mv_name>;  
Using BloomFilter Indexes:
-- Create an index  
ALTER MATERIALIZED VIEW <mv_name> SET ("bloom_filter_columns" = "<col1,col2,col3,...>");  
-- View indexes  
SHOW CREATE MATERIALIZED VIEW <mv_name>;  
-- Drop an index  
ALTER MATERIALIZED VIEW <mv_name> SET ("bloom_filter_columns" = "");  
PROPERTIES (optional)
Properties of the asynchronous materialized view. You can modify the properties of an existing materialized view using ALTER MATERIALIZED VIEW.
- 
session.: If you want to alter a session variable-related property of the materialized view, you must add asession.prefix to the property, for example,session.insert_timeout. You do not need to specify the prefix for non-session properties, for example,mv_rewrite_staleness_second.
- 
replication_num: The number of materialized view replicas to create.
- 
storage_medium: Storage medium type. Valid values:HDDandSSD.
- 
storage_cooldown_time: the storage cooldown time for a partition. If both HDD and SSD storage mediums are used, data in the SSD storage is moved to the HDD storage after the time specified by this property. Format: "yyyy-MM-dd HH:mm:ss". The specified time must be later than the current time. If this property is not explicitly specified, the storage cooldown is not performed by default.
- 
bloom_filter_columns: An array of column names that enable Bloom filter indexing. For details about Bloom filter indexes, see Bloom filter Index.
- 
partition_ttl: The time-to-live (TTL) for partitions. Partitions whose data is within the specified time range are retained. Expired partitions are deleted automatically. Unit:YEAR,MONTH,DAY,HOUR, andMINUTE. For example, you can specify this property as2 MONTH. This property is recommended overpartition_ttl_number. It is supported from v3.1.5 onwards.
- 
partition_ttl_number: The number of most recent materialized view partitions to retain. For the partitions with a start time earlier than the current time, after the number of these partitions exceeds this value, less recent partitions will be deleted. StarRocks will periodically check materialized view partitions according to the time interval specified in the FE configuration itemdynamic_partition_check_interval_seconds, and automatically delete expired partitions. If you enabled the dynamic partitioning strategy, the partitions created in advance are not counted in. When the value is-1, all partitions of the materialized view will be preserved. Default:-1.
- 
partition_refresh_number: In a single refresh, the maximum number of partitions to refresh. If the number of partitions to be refreshed exceeds this value, StarRocks will split the refresh task and complete it in batches. Only when the previous batch of partitions is refreshed successfully, StarRocks will continue to refresh the next batch of partitions until all partitions are refreshed. If any of the partitions fail to be refreshed, no subsequent refresh tasks will be generated. When the value is-1, the refresh task will not be split. The default value is changed from-1to1since v3.3, meaning StarRocks refeshes partitions one by one.
- 
excluded_trigger_tables: If a base table of the materialized view is listed here, the automatic refresh task will not be triggered when the data in the base table is changed. This parameter only applies to load-triggered refresh strategy, and is usually used together with the propertyauto_refresh_partitions_limit. Format:[db_name.]table_name. When the value is an empty string, any data change in all base tables triggers the refresh of the corresponding materialized view. The default value is an empty string.
- 
excluded_refresh_tables: The base tables listed in this property will not be updated to the materialized view when their data changes. Format:[db_name.]table_name. The default value is an empty string. When the value is an empty string, any base table data change will trigger the corresponding materialized view refresh.tipThe difference between excluded_trigger_tablesandexcluded_refresh_tablesis:- excluded_trigger_tablescontrols whether to trigger a refresh, not whether to participate in the refresh. For example, a partitioned materialized view is obtained by joining two partitioned tables A and B, and the partitions of the two tables A and B correspond one to one.- excluded_trigger_tablecontains table A. During a period of time, table A updated partitions- [1,2,3], but because it is an- excluded_trigger_table, the refresh of the materialized view is not triggered. At this time, table B updates partition- [3], and the materialized view triggers a refresh, which will refresh the three partitions- [1, 2, 3]. Here you can see that- excluded_trigger_tableonly controls whether to trigger a refresh. Although the update of table A cannot trigger a materialized view refresh, when the update of table B triggers a materialized view refresh, the partition updated by table A will also be added to the refresh task.
- excluded_refresh_tablescontrols whether to participate in the refresh. In the above example, if table A exists in both- excluded_trigger_tableand- excluded_refresh_tables, when the update of table B triggers a materialized view refresh, only partition- [3]will be refreshed.
 
- 
auto_refresh_partitions_limit: The number of most recent materialized view partitions that need to be refreshed when a materialized view refresh is triggered. You can use this property to limit the refresh range and reduce the refresh cost. However, because not all the partitions are refreshed, the data in the materialized view may not be consistent with the base table. Default:-1. When the value is-1, all partitions will be refreshed. When the value is a positive integer N, StarRocks sorts the existing partitions in chronological order, and refreshes the current partition and N-1 most recent partitions. If the number of partitions is less than N, StarRocks refreshes all existing partitions. If there are dynamic partitions created in advance in your materialized view, StarRocks refreshes all pre-created partitions.
- 
mv_rewrite_staleness_second: If the materialized view's last refresh is within the time interval specified in this property, this materialized view can be used directly for query rewrite, regardless of whether the data in the base tables changes. If the last refresh is before this time interval, StarRocks checks whether the base tables have been updated to determine whether the materialized view can be used for query rewrite. Unit: Second. This property is supported from v3.0.
- 
colocate_with: The colocation group of the asynchronous materialized view. See Colocate Join for further information. This property is supported from v3.0.
- 
unique_constraintsandforeign_key_constraints: The Unique Key constraints and Foreign Key constraints when you create an asynchronous materialized view for query rewrite in the View Delta Join scenario. See Asynchronous materialized view - Rewrite queries in View Delta Join scenario for further information. This property is supported from v3.0.
- 
excluded_refresh_tables:The base tables listed in this property will not trigger data refresh to the materialized view when their data changes. This property is usually used together with theexcluded_trigger_tablesproperty. Format:[db_name.]table_name. The default value is an empty string. When the value is an empty string, any data change in all base tables will trigger the corresponding materialized view refresh.CAUTION The Unique Key constraints and Foreign Key constraints are only used for query rewrite. The Foreign Key constraint checks are not guaranteed when data is loaded into the table. You must ensure the data loaded into the table meets the constraints. 
- 
resource_group: The resource group to which the refresh tasks of the materialized view belong. The default value of this property isdefault_mv_wg, which is a system-defined resource group specifically used for materialized view refresh. Thecpu_core_limitofdefault_mv_wgis1,mem_limitis0.8. For more about resource groups, see Resource group.
- 
query_rewrite_consistency: The query rewrite rule for the asynchronous materialized views. This property is supported from v3.2. Valid values:- disable: Disable automatic query rewrite of the asynchronous materialized view.
- checked(Default value): Enable automatic query rewrite only when the materialized view meets the timeliness requirement, which means:- If mv_rewrite_staleness_secondis not specified, the materialized view can be used for query rewrite only when its data is consistent with the data in all base tables.
- If mv_rewrite_staleness_secondis specified, the materialized view can be used for query rewrite when its last refresh is within the staleness time interval.
 
- If 
- loose: Enable automatic query rewrite directly, and no consistency check is required.
 
- 
storage_volume: The name of the storage volume used to store the asynchronous materialized view you want to create if you are using a shared-data cluster. This property is supported from v3.1 onwards. If this property is not specified, the default storage volume is used. Example:"storage_volume" = "def_volume".
- 
force_external_table_query_rewrite: Whether to enable query rewrite for external catalog-based materialized views. This property is supported from v3.2. Valid values:- true(Default value since v3.3): Enable query rewrite for external catalog-based materialized views.
- false: Disable query rewrite for external catalog-based materialized views.
 Because strong data consistency is not guaranteed between base tables and external catalog-based materialized views, this feature is set to falseby default. When this feature is enabled, the materialized view is used for query rewrite in accordance with the rule specified inquery_rewrite_consistency.
- 
enable_query_rewrite: Whether to use the materialized view for query rewrite. When there are many materialized views, query rewrite based on materialized views can impact the optimizer's time consumption. With this property, you can control whether the materialized view can be used for query rewrite. This feature is supported from v3.3.0 onwards. Valid values:- default(Default): The system will not perform semantic checks on the materialized view, but only the SPJG-type materialized views can be used for query rewrite. Note that if the text-based query rewrite is enabled, non-SPJG-type materialized views can also be used for query rewrite.
- true: The system will perform semantic checks when creating or modifying the materialized view. If the materialized view is not eligible for query rewrite (that is, the definition of the materialized view is not an SPJG-type query), a failure will be returned.
- false: The materialized view will not be used for query rewrite.
 
- 
[Preview] transparent_mv_rewrite_mode: Specifies the transparent rewrite mode for queries directly against the materialized view. This feature is supported from v3.3.0 onwards. Valid values:- false(Default, compatible with the behavior in earlier versions): Queries directly against the materialized view will not be rewritten, and are only returned with the existing data in the materialized view. Their query results may be different from those of queries based on the definition of the materialized view according to the refresh status (data consistency) of the materialized view.
- true: Queries directly against the materialized view will be rewritten and returned with the most updated data, which is consistent with the result of the materialized view definition query. Please note that when the materialized view is inactive or does not support transparent query rewrite, these queries will be executed as the materialized view definition query.
- transparent_or_error: Queries directly against the materialized view will be rewritten whenever they are eligible. If the materialized view is inactive or does not support transparent query rewrite, these queries will be returned with an error.
- transparent_or_defaultQueries directly against the materialized view will be rewritten whenever they are eligible. If the materialized view is inactive or does not support transparent query rewrite, these queries will be returned with the existing data in the materialized view.
 
query_statement (required)
The query statement to create the asynchronous materialized view. From v3.1.6 onwards, StarRocks supports creating asynchronous materialized views with Common Table Expression (CTE).
Query an asynchronous materialized view
An asynchronous materialized view is a physical table. You can operate it as any regular table except that you cannot directly load data into an asynchronous materialized view.
Automatic query rewrite with asynchronous materialized view
StarRocks v2.5 supports automatic and transparent query rewrite based on the SPJG-type asynchronous materialized views. The SPJG-type materialized views refer to materialized views whose plan only includes Scan, Filter, Project, and Aggregate types of operators. The SPJG-type materialized views query rewrite includes single table query rewrite, Join query rewrite, aggregation query rewrite, Union query rewrite and query rewrite based on nested materialized views.
See Asynchronous materialized view - Rewrite queries with the asynchronous materialized view for further information.
Supported data types
- 
Asynchronous materialized views created based on the StarRocks default catalog support the following data types: - Date: DATE, DATETIME
- String: CHAR, VARCHAR
- Numeric: BOOLEAN, TINYINT, SMALLINT, INT, BIGINT, LARGEINT, FLOAT, DOUBLE, DECIMAL, PERCENTILE
- Semi-structured: ARRAY, JSON, MAP (from v3.1 onwards), STRUCT (from v3.1 onwards)
- Other: BITMAP, HLL
 
NOTE
BITMAP, HLL, and PERCENTILE have been supported since v2.4.5.
- 
Asynchronous materialized views created based on the StarRocks external catalogs support the following data types: - 
Hive Catalog - Numeric: INT/INTEGER, BIGINT, DOUBLE, FLOAT, DECIMAL
- Date: TIMESTAMP
- String: STRING, VARCHAR, CHAR
- Semi-structured: ARRAY
 
- 
Hudi Catalog - Numeric: BOOLEAN, INT, LONG, FLOAT, DOUBLE, DECIMAL
- Date: DATE, TimeMillis/TimeMicros, TimestampMillis/TimestampMicros
- String: STRING
- Semi-structured: ARRAY
 
- 
Iceberg Catalog - Numeric: BOOLEAN, INT, LONG, FLOAT, DOUBLE, DECIMAL(P, S)
- Date: DATE, TIME, TIMESTAMP
- String: STRING, UUID, FIXED(L), BINARY
- Semi-structured: LIST
 
 
- 
Usage notes
- 
The current version of StarRocks does not support creating multiple materialized views at the same time. A new materialized view can only be created when the one before is completed. 
- 
About synchronous materialized views: - Synchronous materialized views only support aggregate functions on a single column. Query statements in the form of sum(a+b)are not supported.
- Synchronous materialized views support only one aggregate function for each column of the base table. Query statements such as select sum(a), min(a) from tableare not supported.
- When creating a synchronous materialized view with an aggregate function, you must specify the GROUP BY clause, and specify at least one GROUP BY column in SELECT.
- Synchronous materialized views do not support clauses such as JOIN, and the HAVING clause of GROUP BY.
- When using ALTER TABLE DROP COLUMN to drop a specific column in a base table, you must ensure that all synchronous materialized views of the base table do not contain the dropped column, otherwise, the drop operation will fail. Before you drop the column, you must first drop all synchronous materialized views that contain the column.
- Creating too many synchronous materialized views for a table will affect the data load efficiency. When data is being loaded to the base table, the data in the synchronous materialized view and the base table will be updated synchronously. If a base table contains nsynchronous materialized views, the efficiency of loading data into the base table is about the same as the efficiency of loading data intontables.
 
- Synchronous materialized views only support aggregate functions on a single column. Query statements in the form of 
- 
About nested asynchronous materialized views: - The refresh strategy for each materialized view only applies to the corresponding materialized view.
- Currently, StarRocks does not limit the number of nesting levels. In a production environment, we recommend that the number of nesting layers not exceed THREE.
 
- 
About external catalog asynchronous materialized views: - External catalog materialized view only support async fixed-interval refresh and manual refresh.
- Strict consistency is not guaranteed between the materialized view and the base tables in the external catalog.
- Currently, building materialized views based on external resources is not supported.
- Currently, StarRocks cannot perceive if the base table data in the external catalog has changed, so all partitions will be refreshed by default every time the base table is refreshed. You can manually refresh only some of partitions using REFRESH MATERIALIZED VIEW.
 
Examples
Examples of synchronous materialized views
The schema of the base table is as follows:
mysql> desc duplicate_table;
+-------+--------+------+------+---------+-------+
| Field | Type   | Null | Key  | Default | Extra |
+-------+--------+------+------+---------+-------+
| k1    | INT    | Yes  | true | N/A     |       |
| k2    | INT    | Yes  | true | N/A     |       |
| k3    | BIGINT | Yes  | true | N/A     |       |
| k4    | BIGINT | Yes  | true | N/A     |       |
+-------+--------+------+------+---------+-------+
Example 1: Create a synchronous materialized view that only contains the columns of the original table (k1, k2).
create materialized view k1_k2 as
select k1, k2 from duplicate_table;
The materialized view contains only two columns k1 and k2 without any aggregation.
+-----------------+-------+--------+------+------+---------+-------+
| IndexName       | Field | Type   | Null | Key  | Default | Extra |
+-----------------+-------+--------+------+------+---------+-------+
| k1_k2           | k1    | INT    | Yes  | true | N/A     |       |
|                 | k2    | INT    | Yes  | true | N/A     |       |
+-----------------+-------+--------+------+------+---------+-------+
Example 2: Create a synchronous materialized view sorted by k2.
create materialized view k2_order as
select k2, k1 from duplicate_table order by k2;
The materialized view's schema is shown below. The materialized view contains only two columns k2 and k1, where column k2 is a sort column without any aggregation.
+-----------------+-------+--------+------+-------+---------+-------+
| IndexName       | Field | Type   | Null | Key   | Default | Extra |
+-----------------+-------+--------+------+-------+---------+-------+
| k2_order        | k2    | INT    | Yes  | true  | N/A     |       |
|                 | k1    | INT    | Yes  | false | N/A     | NONE  |
+-----------------+-------+--------+------+-------+---------+-------+
Example 3: Create a synchronous materialized view grouped by k1 and k2, and a SUM aggregation on k3.
create materialized view k1_k2_sumk3 as
select k1, k2, sum(k3) from duplicate_table group by k1, k2;
The materialized view's schema is shown below. The materialized view contains three columns k1, k2 and sum (k3), where k1, k2 are grouped columns, and sum (k3) is the sum of the k3 columns grouped according to k1 and k2.
+-----------------+-------+--------+------+-------+---------+-------+
| IndexName       | Field | Type   | Null | Key   | Default | Extra |
+-----------------+-------+--------+------+-------+---------+-------+
| k1_k2_sumk3     | k1    | INT    | Yes  | true  | N/A     |       |
|                 | k2    | INT    | Yes  | true  | N/A     |       |
|                 | k3    | BIGINT | Yes  | false | N/A     | SUM   |
+-----------------+-------+--------+------+-------+---------+-------+
Because the materialized view does not declare a sort column, and it adopts an aggregation function, StarRocks supplements the grouped columns k1 and k2 by default.
Example 4: Create a synchronous materialized view to remove duplicate rows.
create materialized view deduplicate as
select k1, k2, k3, k4 from duplicate_table group by k1, k2, k3, k4;
The materialized view's schema is shown below. The materialized view contains k1, k2, k3, and k4 columns, and there are no duplicate rows.
+-----------------+-------+--------+------+-------+---------+-------+
| IndexName       | Field | Type   | Null | Key   | Default | Extra |
+-----------------+-------+--------+------+-------+---------+-------+
| deduplicate     | k1    | INT    | Yes  | true  | N/A     |       |
|                 | k2    | INT    | Yes  | true  | N/A     |       |
|                 | k3    | BIGINT | Yes  | true  | N/A     |       |
|                 | k4    | BIGINT | Yes  | true  | N/A     |       |
+-----------------+-------+--------+------+-------+---------+-------+
Example 5: Create a non-aggregated synchronous materialized view that does not declare a sort column.
The schema of the base table is shown below:
+-------+--------------+------+-------+---------+-------+
| Field | Type         | Null | Key   | Default | Extra |
+-------+--------------+------+-------+---------+-------+
| k1    | TINYINT      | Yes  | true  | N/A     |       |
| k2    | SMALLINT     | Yes  | true  | N/A     |       |
| k3    | INT          | Yes  | true  | N/A     |       |
| k4    | BIGINT       | Yes  | true  | N/A     |       |
| k5    | DECIMAL(9,0) | Yes  | true  | N/A     |       |
| k6    | DOUBLE       | Yes  | false | N/A     | NONE  |
| k7    | VARCHAR(20)  | Yes  | false | N/A     | NONE  |
+-------+--------------+------+-------+---------+-------+
The materialized view contains k3, k4, k5, k6, and k7 columns, and no sort column is declared. Create the materialized view with the following statement:
create materialized view mv_1 as
select k3, k4, k5, k6, k7 from all_type_table;
StarRocks automatically uses k3, k4, and k5 as the sort columns by default. The sum of the bytes occupied by these three column types is 4 (INT) + 8 (BIGINT) + 16 (DECIMAL) = 28 < 36. So these three columns are added as sort columns.
The materialized view's schema is as follows.
+----------------+-------+--------------+------+-------+---------+-------+
| IndexName      | Field | Type         | Null | Key   | Default | Extra |
+----------------+-------+--------------+------+-------+---------+-------+
| mv_1           | k3    | INT          | Yes  | true  | N/A     |       |
|                | k4    | BIGINT       | Yes  | true  | N/A     |       |
|                | k5    | DECIMAL(9,0) | Yes  | true  | N/A     |       |
|                | k6    | DOUBLE       | Yes  | false | N/A     | NONE  |
|                | k7    | VARCHAR(20)  | Yes  | false | N/A     | NONE  |
+----------------+-------+--------------+------+-------+---------+-------+
It can be observed that the key field of the k3, k4, and k5 columns is true, which indicates that they are the sort keys. The key field of the k6, and k7 columns is false, which indicates that they are not the sort keys.
Example 6: Create a synchronous materialized view that contains the WHERE clause and complex expressions.
-- Create the base table: user_event
CREATE TABLE user_event (
      ds date   NOT NULL,
      id  varchar(256)    NOT NULL,
      user_id int DEFAULT NULL,
      user_id1    varchar(256)    DEFAULT NULL,
      user_id2    varchar(256)    DEFAULT NULL,
      column_01   int DEFAULT NULL,
      column_02   int DEFAULT NULL,
      column_03   int DEFAULT NULL,
      column_04   int DEFAULT NULL,
      column_05   int DEFAULT NULL,
      column_06   DECIMAL(12,2)   DEFAULT NULL,
      column_07   DECIMAL(12,3)   DEFAULT NULL,
      column_08   JSON   DEFAULT NULL,
      column_09   DATETIME    DEFAULT NULL,
      column_10   DATETIME    DEFAULT NULL,
      column_11   DATE    DEFAULT NULL,
      column_12   varchar(256)    DEFAULT NULL,
      column_13   varchar(256)    DEFAULT NULL,
      column_14   varchar(256)    DEFAULT NULL,
      column_15   varchar(256)    DEFAULT NULL,
      column_16   varchar(256)    DEFAULT NULL,
      column_17   varchar(256)    DEFAULT NULL,
      column_18   varchar(256)    DEFAULT NULL,
      column_19   varchar(256)    DEFAULT NULL,
      column_20   varchar(256)    DEFAULT NULL,
      column_21   varchar(256)    DEFAULT NULL,
      column_22   varchar(256)    DEFAULT NULL,
      column_23   varchar(256)    DEFAULT NULL,
      column_24   varchar(256)    DEFAULT NULL,
      column_25   varchar(256)    DEFAULT NULL,
      column_26   varchar(256)    DEFAULT NULL,
      column_27   varchar(256)    DEFAULT NULL,
      column_28   varchar(256)    DEFAULT NULL,
      column_29   varchar(256)    DEFAULT NULL,
      column_30   varchar(256)    DEFAULT NULL,
      column_31   varchar(256)    DEFAULT NULL,
      column_32   varchar(256)    DEFAULT NULL,
      column_33   varchar(256)    DEFAULT NULL,
      column_34   varchar(256)    DEFAULT NULL,
      column_35   varchar(256)    DEFAULT NULL,
      column_36   varchar(256)    DEFAULT NULL,
      column_37   varchar(256)    DEFAULT NULL
  )
  PARTITION BY date_trunc("day", ds)
  DISTRIBUTED BY hash(id);
  -- Create the materialized view with the WHERE clause and complex expresssions.
  CREATE MATERIALIZED VIEW test_mv1
  AS 
  SELECT
  ds,
  column_19,
  column_36,
  sum(column_01) as column_01_sum,
  bitmap_union(to_bitmap( user_id)) as user_id_dist_cnt,
  bitmap_union(to_bitmap(case when column_01 > 1 and column_34 IN ('1','34')   then user_id2 else null end)) as filter_dist_cnt_1,
  bitmap_union(to_bitmap( case when column_02 > 60 and column_35 IN ('11','13') then  user_id2 else null end)) as filter_dist_cnt_2,
  bitmap_union(to_bitmap(case when column_03 > 70 and column_36 IN ('21','23') then  user_id2 else null end)) as filter_dist_cnt_3,
  bitmap_union(to_bitmap(case when column_04 > 20 and column_27 IN ('31','27') then  user_id2 else null end)) as filter_dist_cnt_4,
  bitmap_union(to_bitmap( case when column_05 > 90 and column_28 IN ('41','43') then  user_id2 else null end)) as filter_dist_cnt_5
  FROM user_event
  WHERE ds >= '2023-11-02'
  GROUP BY
  ds,
  column_19,
  column_36;
Examples of asynchronous materialized views
The following examples are based on the base tables below:
CREATE TABLE `lineorder` (
  `lo_orderkey` int(11) NOT NULL COMMENT "",
  `lo_linenumber` int(11) NOT NULL COMMENT "",
  `lo_custkey` int(11) NOT NULL COMMENT "",
  `lo_partkey` int(11) NOT NULL COMMENT "",
  `lo_suppkey` int(11) NOT NULL COMMENT "",
  `lo_orderdate` int(11) NOT NULL COMMENT "",
  `lo_orderpriority` varchar(16) NOT NULL COMMENT "",
  `lo_shippriority` int(11) NOT NULL COMMENT "",
  `lo_quantity` int(11) NOT NULL COMMENT "",
  `lo_extendedprice` int(11) NOT NULL COMMENT "",
  `lo_ordtotalprice` int(11) NOT NULL COMMENT "",
  `lo_discount` int(11) NOT NULL COMMENT "",
  `lo_revenue` int(11) NOT NULL COMMENT "",
  `lo_supplycost` int(11) NOT NULL COMMENT "",
  `lo_tax` int(11) NOT NULL COMMENT "",
  `lo_commitdate` int(11) NOT NULL COMMENT "",
  `lo_shipmode` varchar(11) NOT NULL COMMENT ""
) ENGINE=OLAP
DUPLICATE KEY(`lo_orderkey`)
COMMENT "OLAP"
PARTITION BY RANGE(`lo_orderdate`)
(PARTITION p1 VALUES [("-2147483648"), ("19930101")),
PARTITION p2 VALUES [("19930101"), ("19940101")),
PARTITION p3 VALUES [("19940101"), ("19950101")),
PARTITION p4 VALUES [("19950101"), ("19960101")),
PARTITION p5 VALUES [("19960101"), ("19970101")),
PARTITION p6 VALUES [("19970101"), ("19980101")),
PARTITION p7 VALUES [("19980101"), ("19990101")))
DISTRIBUTED BY HASH(`lo_orderkey`);
CREATE TABLE IF NOT EXISTS `customer` (
  `c_custkey` int(11) NOT NULL COMMENT "",
  `c_name` varchar(26) NOT NULL COMMENT "",
  `c_address` varchar(41) NOT NULL COMMENT "",
  `c_city` varchar(11) NOT NULL COMMENT "",
  `c_nation` varchar(16) NOT NULL COMMENT "",
  `c_region` varchar(13) NOT NULL COMMENT "",
  `c_phone` varchar(16) NOT NULL COMMENT "",
  `c_mktsegment` varchar(11) NOT NULL COMMENT ""
) ENGINE=OLAP
DUPLICATE KEY(`c_custkey`)
COMMENT "OLAP"
DISTRIBUTED BY HASH(`c_custkey`);
CREATE TABLE IF NOT EXISTS `dates` (
  `d_datekey` int(11) NOT NULL COMMENT "",
  `d_date` varchar(20) NOT NULL COMMENT "",
  `d_dayofweek` varchar(10) NOT NULL COMMENT "",
  `d_month` varchar(11) NOT NULL COMMENT "",
  `d_year` int(11) NOT NULL COMMENT "",
  `d_yearmonthnum` int(11) NOT NULL COMMENT "",
  `d_yearmonth` varchar(9) NOT NULL COMMENT "",
  `d_daynuminweek` int(11) NOT NULL COMMENT "",
  `d_daynuminmonth` int(11) NOT NULL COMMENT "",
  `d_daynuminyear` int(11) NOT NULL COMMENT "",
  `d_monthnuminyear` int(11) NOT NULL COMMENT "",
  `d_weeknuminyear` int(11) NOT NULL COMMENT "",
  `d_sellingseason` varchar(14) NOT NULL COMMENT "",
  `d_lastdayinweekfl` int(11) NOT NULL COMMENT "",
  `d_lastdayinmonthfl` int(11) NOT NULL COMMENT "",
  `d_holidayfl` int(11) NOT NULL COMMENT "",
  `d_weekdayfl` int(11) NOT NULL COMMENT ""
) ENGINE=OLAP
DUPLICATE KEY(`d_datekey`)
COMMENT "OLAP"
DISTRIBUTED BY HASH(`d_datekey`);
CREATE TABLE IF NOT EXISTS `supplier` (
  `s_suppkey` int(11) NOT NULL COMMENT "",
  `s_name` varchar(26) NOT NULL COMMENT "",
  `s_address` varchar(26) NOT NULL COMMENT "",
  `s_city` varchar(11) NOT NULL COMMENT "",
  `s_nation` varchar(16) NOT NULL COMMENT "",
  `s_region` varchar(13) NOT NULL COMMENT "",
  `s_phone` varchar(16) NOT NULL COMMENT ""
) ENGINE=OLAP
DUPLICATE KEY(`s_suppkey`)
COMMENT "OLAP"
DISTRIBUTED BY HASH(`s_suppkey`);
CREATE TABLE IF NOT EXISTS `part` (
  `p_partkey` int(11) NOT NULL COMMENT "",
  `p_name` varchar(23) NOT NULL COMMENT "",
  `p_mfgr` varchar(7) NOT NULL COMMENT "",
  `p_category` varchar(8) NOT NULL COMMENT "",
  `p_brand` varchar(10) NOT NULL COMMENT "",
  `p_color` varchar(12) NOT NULL COMMENT "",
  `p_type` varchar(26) NOT NULL COMMENT "",
  `p_size` int(11) NOT NULL COMMENT "",
  `p_container` varchar(11) NOT NULL COMMENT ""
) ENGINE=OLAP
DUPLICATE KEY(`p_partkey`)
COMMENT "OLAP"
DISTRIBUTED BY HASH(`p_partkey`);
create table orders ( 
    dt date NOT NULL, 
    order_id bigint NOT NULL, 
    user_id int NOT NULL, 
    merchant_id int NOT NULL, 
    good_id int NOT NULL, 
    good_name string NOT NULL, 
    price int NOT NULL, 
    cnt int NOT NULL, 
    revenue int NOT NULL, 
    state tinyint NOT NULL 
) 
PRIMARY KEY (dt, order_id) 
PARTITION BY RANGE(`dt`) 
( PARTITION p20210820 VALUES [('2021-08-20'), ('2021-08-21')), 
PARTITION p20210821 VALUES [('2021-08-21'), ('2021-08-22')) ) 
DISTRIBUTED BY HASH(order_id)
PROPERTIES (
    "replication_num" = "3", 
    "enable_persistent_index" = "true"
);
Example 1: Create a non-partitioned materialized view.
-- create an unpartitioned materialized view sorted by lo_custkey
CREATE MATERIALIZED VIEW lo_mv1
DISTRIBUTED BY HASH(`lo_orderkey`)
ORDER BY `lo_custkey`
REFRESH ASYNC
AS
select
    lo_orderkey, 
    lo_custkey, 
    sum(lo_quantity) as total_quantity, 
    sum(lo_revenue) as total_revenue, 
    count(lo_shipmode) as shipmode_count
from lineorder 
group by lo_orderkey, lo_custkey;
Example 2: Create a partitioned materialized view.
-- create a partitioned materialized view partitioned by `lo_orderdate` and sorted by `lo_custkey`.
CREATE MATERIALIZED VIEW lo_mv2
PARTITION BY `lo_orderdate`
DISTRIBUTED BY HASH(`lo_orderkey`)
ORDER BY `lo_custkey`
REFRESH ASYNC START('2023-07-01 10:00:00') EVERY (interval 1 day)
AS
select
    lo_orderkey,
    lo_orderdate,
    lo_custkey, 
    sum(lo_quantity) as total_quantity, 
    sum(lo_revenue) as total_revenue, 
    count(lo_shipmode) as shipmode_count
from lineorder 
group by lo_orderkey, lo_orderdate, lo_custkey;
-- Use the date_trunc() function to partition the materialized view by month.
CREATE MATERIALIZED VIEW order_mv1
PARTITION BY date_trunc('month', `dt`)
DISTRIBUTED BY HASH(`order_id`)
REFRESH ASYNC START('2023-07-01 10:00:00') EVERY (interval 1 day)
AS
select
    dt,
    order_id,
    user_id,
    sum(cnt) as total_cnt,
    sum(revenue) as total_revenue, 
    count(state) as state_count
from orders
group by dt, order_id, user_id;
Example 3: Create an asynchronous materialized view.
CREATE MATERIALIZED VIEW flat_lineorder
DISTRIBUTED BY HASH(`lo_orderkey`)
REFRESH MANUAL
AS
SELECT
    l.LO_ORDERKEY AS LO_ORDERKEY,
    l.LO_LINENUMBER AS LO_LINENUMBER,
    l.LO_CUSTKEY AS LO_CUSTKEY,
    l.LO_PARTKEY AS LO_PARTKEY,
    l.LO_SUPPKEY AS LO_SUPPKEY,
    l.LO_ORDERDATE AS LO_ORDERDATE,
    l.LO_ORDERPRIORITY AS LO_ORDERPRIORITY,
    l.LO_SHIPPRIORITY AS LO_SHIPPRIORITY,
    l.LO_QUANTITY AS LO_QUANTITY,
    l.LO_EXTENDEDPRICE AS LO_EXTENDEDPRICE,
    l.LO_ORDTOTALPRICE AS LO_ORDTOTALPRICE,
    l.LO_DISCOUNT AS LO_DISCOUNT,
    l.LO_REVENUE AS LO_REVENUE,
    l.LO_SUPPLYCOST AS LO_SUPPLYCOST,
    l.LO_TAX AS LO_TAX,
    l.LO_COMMITDATE AS LO_COMMITDATE,
    l.LO_SHIPMODE AS LO_SHIPMODE,
    c.C_NAME AS C_NAME,
    c.C_ADDRESS AS C_ADDRESS,
    c.C_CITY AS C_CITY,
    c.C_NATION AS C_NATION,
    c.C_REGION AS C_REGION,
    c.C_PHONE AS C_PHONE,
    c.C_MKTSEGMENT AS C_MKTSEGMENT,
    s.S_NAME AS S_NAME,
    s.S_ADDRESS AS S_ADDRESS,
    s.S_CITY AS S_CITY,
    s.S_NATION AS S_NATION,
    s.S_REGION AS S_REGION,
    s.S_PHONE AS S_PHONE,
    p.P_NAME AS P_NAME,
    p.P_MFGR AS P_MFGR,
    p.P_CATEGORY AS P_CATEGORY,
    p.P_BRAND AS P_BRAND,
    p.P_COLOR AS P_COLOR,
    p.P_TYPE AS P_TYPE,
    p.P_SIZE AS P_SIZE,
    p.P_CONTAINER AS P_CONTAINER FROM lineorder AS l 
INNER JOIN customer AS c ON c.C_CUSTKEY = l.LO_CUSTKEY
INNER JOIN supplier AS s ON s.S_SUPPKEY = l.LO_SUPPKEY
INNER JOIN part AS p ON p.P_PARTKEY = l.LO_PARTKEY;
Example 4: Create a partitioned materialized view and use str2date to transform the STRING type partition key of the base table into the date type as for the materialized view.
-- Hive Table with string partition column.
CREATE TABLE `part_dates` (
  `d_date` varchar(20) DEFAULT NULL,
  `d_dayofweek` varchar(10) DEFAULT NULL,
  `d_month` varchar(11) DEFAULT NULL,
  `d_year` int(11) DEFAULT NULL,
  `d_yearmonthnum` int(11) DEFAULT NULL,
  `d_yearmonth` varchar(9) DEFAULT NULL,
  `d_daynuminweek` int(11) DEFAULT NULL,
  `d_daynuminmonth` int(11) DEFAULT NULL,
  `d_daynuminyear` int(11) DEFAULT NULL,
  `d_monthnuminyear` int(11) DEFAULT NULL,
  `d_weeknuminyear` int(11) DEFAULT NULL,
  `d_sellingseason` varchar(14) DEFAULT NULL,
  `d_lastdayinweekfl` int(11) DEFAULT NULL,
  `d_lastdayinmonthfl` int(11) DEFAULT NULL,
  `d_holidayfl` int(11) DEFAULT NULL,
  `d_weekdayfl` int(11) DEFAULT NULL,
  `d_datekey` varchar(11) DEFAULT NULL
) partition by (d_datekey);
-- Create the materialied view  with `str2date`.
CREATE MATERIALIZED VIEW IF NOT EXISTS `test_mv` 
PARTITION BY str2date(`d_datekey`,'%Y%m%d')
DISTRIBUTED BY HASH(`d_date`, `d_month`, `d_month`) 
REFRESH MANUAL 
AS
SELECT
`d_date` ,
  `d_dayofweek`,
  `d_month` ,
  `d_yearmonthnum` ,
  `d_yearmonth` ,
  `d_daynuminweek`,
  `d_daynuminmonth`,
  `d_daynuminyear` ,
  `d_monthnuminyear` ,
  `d_weeknuminyear` ,
  `d_sellingseason`,
  `d_lastdayinweekfl`,
  `d_lastdayinmonthfl`,
  `d_holidayfl` ,
  `d_weekdayfl`,
   `d_datekey`
FROM
 `hive_catalog`.`ssb_1g_orc`.`part_dates` ;
Example 5: Create a partition materialized view with a specific sort key:
CREATE MATERIALIZED VIEW lo_mv2
PARTITION BY `lo_orderdate`
DISTRIBUTED BY HASH(`lo_orderkey`)
ORDER BY `lo_custkey`
REFRESH ASYNC START('2023-07-01 10:00:00') EVERY (interval 1 day)
AS
select
    lo_orderkey,
    lo_orderdate,
    lo_custkey, 
    sum(lo_quantity) as total_quantity, 
    sum(lo_revenue) as total_revenue, 
    count(lo_shipmode) as shipmode_count
from lineorder 
group by lo_orderkey, lo_orderdate, lo_custkey;