🗃️ Catalog
10 items
📄️ File external table
This topic describes how to use file external tables to directly query Parquet and ORC data files in AWS S3.
📄️ Data Cache
This topic describes the working principles of Data Cache and how to enable Data Cache to improve query performance on external data. From v3.3.0, Data Cache is enabled by default.
📄️ Data cache warmup
Some data lake analytics and shared-data cluster scenarios have high performance requirements for queries, such as BI reports and proof of concept (PoC) performance testing. Loading remote data into local data cache can avoid the need to fetch the same data multiple times, significantly speeding up query execution and minimizing resource usage.