Home

Prerequisites (Databricks)
Platform

Each version of Unravel has specific platform requirements. See ​Unravel's ​​Azure Databricks​ compatibility matrix to confirm that your Azure Databricks platform meets the requirements for the version of Unravel that you are installing.

Hardware
  • Minimum VM type suggested: Medium memory-optimized such as Standard_E8s_v3.

  • EC2 instance type:

    • Minimum: r4.2xlarge (61 GiB RAM)

    • Maximum: r4.8xlarge (244 GiB RAM)

    • Recommended: r4.4xlarge (122 GiB RAM)

    • Virtualization type: HVM

Permissions
  • You must already have an Azure account.

  • You must already have a resource group assigned to a region to group your policies, VMs, and storage blobs/lakes/drives.

    A resource group is a container that holds related resources for an Azure solution. In Azure, you logically group related resources such as storage accounts, virtual networks, and virtual machines (VMs) to deploy, manage, and maintain them as a single entity.

  • You must have root privilege to run commands on the VM.

Network
  • Your virtual network and subnet(s) must be big enough to be shared by the Unravel VM and the target Databricks cluster(s).

  • You can use an existing virtual network or create a new one, but the virtual network must be in the same region and same subscription as the Azure Databricks workspace that you plan to create.

  • A CIDR range between /16 - /24 is required for the virtual network.

  • Assign a public IP address to the Unravel Azure VM and open port 4043 for non-SSL and 4443 for unsecured SSL.

  • Allow inbound SSH connections to the Unravel VM.

  • You must allow outbound Internet access and all traffic within the subnet (VSNET).