Trino

Trino is a distributed SQL query engine for big data analytics.

About

Trino is a distributed SQL query engine designed for fast analytic queries against data of any size. It allows you to query data where it lives, including Hive, Cassandra, relational databases or even proprietary data stores.

Available Tools

Requirements

Trino Cluster

You need access to a running Trino cluster with appropriate user permissions for the catalogs and schemas you want to query.

Example

kind: sources
name: my-trino-source
type: trino
host: trino.example.com
port: "8080"
user: ${TRINO_USER}  # Optional for anonymous access
password: ${TRINO_PASSWORD}  # Optional
catalog: hive
schema: default

Tip

Use environment variable replacement with the format ${ENV_NAME} instead of hardcoding your secrets into the configuration file.

Reference

fieldtyperequireddescription
typestringtrueMust be “trino”.
hoststringtrueTrino coordinator hostname (e.g. “trino.example.com”)
portstringtrueTrino coordinator port (e.g. “8080”, “8443”)
userstringfalseUsername for authentication (e.g. “analyst”). Optional for anonymous access.
passwordstringfalsePassword for basic authentication
catalogstringtrueDefault catalog to use for queries (e.g. “hive”)
schemastringtrueDefault schema to use for queries (e.g. “default”)
queryTimeoutstringfalseQuery timeout duration (e.g. “30m”, “1h”)
accessTokenstringfalseJWT access token for authentication
kerberosEnabledbooleanfalseEnable Kerberos authentication (default: false)
sslEnabledbooleanfalseEnable SSL/TLS (default: false)
disableSslVerificationbooleanfalseSkip SSL/TLS certificate verification (default: false)
sslCertPathstringfalsePath to a custom SSL/TLS certificate file
sslCertstringfalseCustom SSL/TLS certificate content