Prerequisites (Databricks)
Platform
Each version of Unravel has specific platform requirements. See Unravel's Azure Databricks compatibility matrix to confirm that your Azure Databricks platform meets the requirements for the version of Unravel that you are installing.
Hardware
Minimum VM type suggested: Medium memory-optimized such as Standard_E8s_v3.
EC2 instance type:
Minimum: r4.2xlarge (61 GiB RAM)
Maximum: r4.8xlarge (244 GiB RAM)
Recommended: r4.4xlarge (122 GiB RAM)
Virtualization type: HVM
Permissions
You must already have an Azure account.
You must already have a resource group assigned to a region to group your policies, VMs, and storage blobs/lakes/drives.
A resource group is a container that holds related resources for an Azure solution. In Azure, you logically group related resources such as storage accounts, virtual networks, and virtual machines (VMs) to deploy, manage, and maintain them as a single entity.
You must have root privilege to run commands on the VM.
Network
Your virtual network and subnet(s) must be big enough to be shared by the Unravel VM and the target Databricks cluster(s).
You can use an existing virtual network or create a new one, but the virtual network must be in the same region and same subscription as the Azure Databricks workspace that you plan to create.
A CIDR range between /16 - /24 is required for the virtual network.
Assign a public IP address to the Unravel Azure VM and open port 4043 for non-SSL and 4443 for unsecured SSL.
Allow inbound SSH connections to the Unravel VM.
You must allow outbound Internet access and all traffic within the subnet (VSNET).