Cloudera Data Platform support#

Use the Starburst Hive connector to query Cloudera Data Platform (CDP) version 7.1 or higher.

Note

The Cloudera Data Platform support requires a valid Starburst Enterprise license.

Requirements#

The Starburst Hive connector can query the Cloudera Data Platform (CDP), available as version 7.x for SEP 345-e and higher. Cloudera Distributed Hadoop (CDH) 6.x and earlier are not supported.

Configuration#

  • Edit your catalog properties file using the Hive connector

  • Set the metastore to use thrift-cdp7.

  • Configure the URI to point to your Hive metastore Thrift service

connector.name=hive
hive.metastore=thrift-cdp7
hive.metastore.uri=thrift://cdp-master:9083

SQL support#

When using the Starburst Hive connector with Cloudera Data Platform, the following aspects apply in addition to the available SQL support:

Reading data#

CDP support includes read operations on the following tables:

  • compacted tables

  • bucketed tables

  • partitioned tables

  • unpartitioned tables

The following file formats can be read:

  • Avro

  • CSV

  • ORC ACID

  • Parquet

  • RCFile

Writing data#

Write operations, such as INSERT, DELETE and UPDATE, on ORC ACID tables are not supported.