Data Export is available as an add-on feature to select plans. To learn more, read about our pricing. To add Data Export to your plan, contact Sales.
This topic explains how to create and test a Databricks destination for Data Export. Databricks is a cloud-based data processing and analysis platform that lets you work with large sets of data. By exporting your LaunchDarkly experiment data to the same Databricks warehouse as your other data, you can build custom reports in Databricks to answer product behavior questions.
You can also run experiments using warehouse native metrics. To learn more, read Creating experiments using warehouse native metrics.
To configure the Databricks Data Export integration, you must have the following prerequisites:
35.192.85.117We recommended that you use a Databricks Service principal to provide LaunchDarkly service access to your warehouse.
To create a new service principal, follow the Databricks instructions to Add service principals to your account.
Ensure that the principal has Workspace access and Databricks SQL access entitlements. To learn how, read the Databricks documentation about Access entitlements.
Next, generate and provide an access token for the service principal. You can create an access token using the Databricks REST API. To learn how, read the Databricks REST API documentation.
You must provide specific values to the REST API. In the following example, set these values:
databricks-instance with your Databricks workspace URL. For example, dbc-abcd1234-5678.cloud.databricks.com.your-existing-access-token with an existing valid PAT (string) that has permissions to create new tokens.Provide the values for these parameters:
Omitting lifetime_seconds sets the lifetime to the maximum allowed by the workspace configuration.
You will provide the access token later when setting up the integration in LaunchDarkly.
Now, follow the Databricks instructions to Create a new SQL warehouse.
Under permissions, assign the Can use permission to the service principal you created above.
Under Connection details, find and save the Server hostname and HTTP path of the warehouse. You will need these when setting up the integration in LaunchDarkly.
Then, give the service principal you created the appropriate permissions for your destination catalog and destination schema.
For your destination catalog, give the service principal the USE CATALOG permission.
For your destination schema, give the service principal the following grants:
USE SCHEMAAPPLY TAGMODIFYREAD VOLUMESELECTWRITE VOLUMECREATE MATERIALIZED VIEWCREATE TABLECREATE VOLUMEDatabricks selects “Unity Catalog” for metastore type by default. If your workspace uses the legacy Hive metastore, select it instead and provide the required S3 bucket details and access keys.
Finally, in LaunchDarkly, configure the Databricks Data Export integration.
To do this:
Your Databricks Data Export integration is now complete.
To view the different event kinds for Databricks Data Export destinations, read Warehouse Data Export schema reference.