Skip to main content

Connect to Databricks

Securely connect your Hex workspace to your Databricks database.

info
  • Users need the Admin workspace role to create a shared workspace data connection.
  • Users need Can Edit or higher project permissions to create project data connections.
  • Hex currently only supports the legacy JDBC driver where the JDBC URL starts with jdbc:spark.

Databricks is a data analytics platform built on top of Apache Spark.

Databricks allows you to combine the robustness of a data warehouse with the flexibility of a data lake via their lakehouse architecture. It provides a unified platform that offers a wide variety of services, allowing you to store and process large amounts of data quickly and efficiently, including real-time analytics.

Databricks can be especially useful for machine learning/artificial intelligence, data engineering, and data science applications.

How to get set up

  1. Set up a Databricks account and create a workspace if you don't already have one.
  2. Create and configure a Databricks warehouse.
  3. Locate your JDBC URL.
  4. Generate an access token in Databricks for a user with access to the data you want to query.
  5. In Hex, go to SettingsData sources.
  6. Click + Connection, select Databricks, and fill out the fields below.

Basic settings

  1. Name and Description - Set a display name and optional description to help identify your data connection.
Databricks data connection name and description
  1. JDBC URL (instructions to locate)
  2. Access token (instructions to generate)
Databricks data connection database and authentication credentials

Additional settings

The data connection form includes several optional sections:

tip

If you use a firewall to restrict database access, you'll need to add Hex's IP addresses to your allowlist.