Installing Unravel on cloud platforms using Ansible
You can install Unravel on Amazon EMR, Databricks (Azure, AWS), and Google cloud platform (Dataproc, BigQuery) using Ansible.
Navigate to the
ansible-installation
directory and create a copy of thevars_template.yml
file.cp vars_template.yml vars.yml
Edit
vars.yml
by using any editor. For example, the vi editor.vi vars.yml
Update the value for the
cloud_platform
parameter.# Update any required fields # Must set cloud_platform=emr/dataproc/databricks
The
vars.yml
file contains deployment parameters. You can customize these parameters for installing and deploying Unravel on cloud platforms.From the Unravel server, run the Ansible playbook.
ansible-playbook -i inventories/ install_unravel.yml -e @vars.yml -vvv
Applies all values specified in the
vars.yml
to the Ansible playbook. See Verify Unravel installation using Ansible.(Optional) Update the
cloud_node
inventory if you do not run the Ansible playbook from the Unravel server.Command
vi inventories/cloud_node/hosts
Output
# Example hosts file # [cloud_node] # abc.unraveldata.com ansible_user=unravel ansible_ssh_private_key_file=/root/.ssh/id_rsa
Define the hive metastore properties in a
.txt
file for Databricks. See Configuring Hive Metastore (Cloud).Define the hive metastore properties in a
.txt
file for Dataproc. See Configuring Hive Metastore (Cloud).Define the hive metastore properties in a
.txt
file for EMR. See Configuring Hive Metastore (Cloud) and Configuring external hive metastore (EMR).Specify the BigQuery configuration in Unravel. See Add BigQuery details in Unravel.
Configure the EMR cluster in Unravel. See Connect a new or existing EMR cluster to Unravel.
Configure the Dataproc cluster in Unravel. See Connecting Unravel Server to a new Dataproc cluster.
Specify the path of a
.txt
file in theprop_file_paths
variable. Ensure that the.txt
file is readable by the Unravel user.ansible-playbook -i inventories/ install_unravel.yml -e @vars.yml -vvv --tag hive_metastore