Installing Unravel on cloud platforms using Ansible
You can install Unravel on Amazon EMR, Databricks (Azure, AWS), and Google cloud platform (Dataproc, BigQuery) using Ansible.
Navigate to the
ansible-installationdirectory and create a copy of thevars_template.ymlfile.cp vars_template.yml vars.yml
Edit
vars.ymlby using any editor. For example, the vi editor.vi vars.yml
Update the value for the
cloud_platformparameter.# Update any required fields # Must set cloud_platform=emr/dataproc/databricks
The
vars.ymlfile contains deployment parameters. You can customize these parameters for installing and deploying Unravel on cloud platforms.From the Unravel server, run the Ansible playbook.
ansible-playbook -i inventories/ install_unravel.yml -e @vars.yml -vvv
Applies all values specified in the
vars.ymlto the Ansible playbook. See Verify Unravel installation using Ansible.(Optional) Update the
cloud_nodeinventory if you do not run the Ansible playbook from the Unravel server.Command
vi inventories/cloud_node/hosts
Output
# Example hosts file # [cloud_node] # abc.unraveldata.com ansible_user=unravel ansible_ssh_private_key_file=/root/.ssh/id_rsa
Define the hive metastore properties in a
.txtfile for Databricks. See Configuring Hive Metastore (Cloud).Define the hive metastore properties in a
.txtfile for Dataproc. See Configuring Hive Metastore (Cloud).Define the hive metastore properties in a
.txtfile for EMR. See Configuring Hive Metastore (Cloud) and Configuring external hive metastore (EMR).Specify the BigQuery configuration in Unravel. See Add BigQuery details in Unravel.
Configure the EMR cluster in Unravel. See Connect a new or existing EMR cluster to Unravel.
Configure the Dataproc cluster in Unravel. See Connecting Unravel Server to a new Dataproc cluster.
Specify the path of a
.txtfile in theprop_file_pathsvariable. Ensure that the.txtfile is readable by the Unravel user.ansible-playbook -i inventories/ install_unravel.yml -e @vars.yml -vvv --tag hive_metastore