Trino

Trino is a distributed SQL query engine for big data analytics.

About

Trino is a distributed SQL query engine designed for fast analytic queries against data of any size. It allows you to query data where it lives, including Hive, Cassandra, relational databases or even proprietary data stores.

Available Tools

Requirements

Trino Cluster

You need access to a running Trino cluster with appropriate user permissions for the catalogs and schemas you want to query.

Example

sources:
    my-trino-source:
    kind: trino
    host: trino.example.com
    port: "8080"
    user: ${TRINO_USER}  # Optional for anonymous access
    password: ${TRINO_PASSWORD}  # Optional
    catalog: hive
    schema: default

Tip

Use environment variable replacement with the format ${ENV_NAME} instead of hardcoding your secrets into the configuration file.

Reference

fieldtyperequireddescription
kindstringtrueMust be “trino”.
hoststringtrueTrino coordinator hostname (e.g. “trino.example.com”)
portstringtrueTrino coordinator port (e.g. “8080”, “8443”)
userstringfalseUsername for authentication (e.g. “analyst”). Optional for anonymous access.
passwordstringfalsePassword for basic authentication
catalogstringtrueDefault catalog to use for queries (e.g. “hive”)
schemastringtrueDefault schema to use for queries (e.g. “default”)
queryTimeoutstringfalseQuery timeout duration (e.g. “30m”, “1h”)
accessTokenstringfalseJWT access token for authentication
kerberosEnabledbooleanfalseEnable Kerberos authentication (default: false)
sslEnabledbooleanfalseEnable SSL/TLS (default: false)