Home

Part 2: Connecting Unravel to a Databricks cluster

Using the Azure Databricks UI or clusters API, connect Unravel to the Databricks jobs/cluster you want Unravel to monitor:

Add a cluster node initialization script:
- On the cluster configuration page, click Advanced Options.
- At the bottom of the page, click the Init Scripts tab.
- In the Destination drop-down, select a destination type.
- Specify the path to the initialization script as dbfs:/databricks/unravel/unravel-db-sensor-archive/dbin/install-unravel.sh
- Click Add.
Configure Spark:
- On the cluster configuration page, click Advanced Options.
- At the bottom of the page, click the Spark tab.
- Paste the snippet generated by databricks_setup.sh into the text box.
- Click Add.
Note
If you submit Spark jobs through spark-submit, you can't configure Spark this way.
Instead, you have to use spark-submit parameters with the snippet as provided by databricks_setup.sh.
Connect to cluster logs:
To configure the log delivery location:
- On the cluster configuration page, click Advanced Options.
- At the bottom of the page, click the Logging tab.
- Select a destination type DBFS.
- Enter the cluster log path.

Next steps

Enable additional instrumentation and configure optional settings.

In this section:

Was this helpful?

Would you like to provide feedback? Just click here to suggest edits.