9.4. Starburst Hive Connector#

The Starburst Hive Connector is an extended version of the Hive Connector with configuration and usage identical.

The following improvements are included:

HDFS Permissions#

Before running any CREATE TABLE or CREATE TABLE ... AS statements for Hive tables in Presto, you need to check that the operating system user running the Presto server has access to the Hive warehouse directory on HDFS.

The Hive warehouse directory is specified by the configuration variable hive.metastore.warehouse.dir in hive-site.xml, and the default value is /user/hive/warehouse. If that is not the case, either add the following to jvm.config on all of the nodes: -DHADOOP_USER_NAME=USER, where USER is an operating system user that has proper permissions for the Hive warehouse directory, or start the Presto server as a user with similar permissions. The hive user generally works as USER, since Hive is often started with the hive user. If you run into HDFS permissions problems on CREATE TABLE ... AS, remove /tmp/presto-* on HDFS, fix the user as described above, then restart all of the Presto servers.

Apache Sentry-Based Authorization#

The connector supports Apache Sentry usage for authorization with the details documented in Apache Sentry Based Authorization.

Transactional and ORC ACID Tables#

When connecting to Hive metastore version 3, the Hive connector supports reading from the following types of transactional tables:

  • insert-only and ACID,
  • partitioned and not partitioned,
  • bucketed and not bucketed.

Materialized Views#

The Hive connector supports reading from Hive materialized views. In Presto, these views are presented as regular, read-only tables.

Amazon Glue Support#

Statistics collection is supported for Hive Metastore and Amazon Glue.

Configuring and using Presto with AWS Glue is described in the AWS Glue Support documentation section.

HDFS Erasure Coding#

Hive connector supports Hadoop 3’s HDFS Erasure Coding.

Limitations#

The following limitation apply in addition to the limitations of the Hive Connector.

  • Writing to and creation of transactional tables is not supported.
  • Reading ORC ACID tables created with Hive Streaming ingest is not supported.