Home

Reference Guide

This reference covers both on-premises and cloud deployments. Some daemons and properties only apply to on-premises deployments.

Daemons

The Unravel service is composed of many daemons which are summarized in the next table. The suffix _N means 1, 2, 3, or 4 separate daemons.

Daemon Logical Name

Description

auto_action

AutoAction worker

datastore

Datastore REST API HTTP server

elasticsearch_N

Elasticsearch

event_worker_N

Event worker

healthcheck

Health check

hitdoc_loader

Hitdoc Loader

hive_worker

Hive worker

hive_worker

Hive Hook Worker EMR

host_monitor_N

Host Monitor

kafka

Bundled Kafka (on a custom port)

kafka_monitor

Kafka Monitor

log_receiver

Log Receiver

log_rotate

metric_analyzer_N

Metrics Analyzer

mysql

MySQL

ngui

aNGular Web UI

ondemand_flask

ondemand_tasks

oozie_sensor

Oozie v3 sensor

spark_worker_N

Spark Worker

table_worker

Table Worker

tidydir

Tidy Dir cleans up and archives hdfs directories, db retention cleaner.

unravel_ca

Cluster Access Service - provides access to cluster services and files (Unravel Edge node).

unravel_db

Databricks sensor (Unravel remote node)

unravel_emdb

Embedded metrics DB

unravel_es

EMR sensor (Unravel remote node)

unravel_ja

Job Analyzer summarizes jobs

unravel_jcse2

Job Collector sensor YARN for EMR

unravel_sensor_N

Universal sensor/Impala

yarn_jc_sensor

Job Collector sensor YARN

yarn_jc_worker_N

Job Collector sensor worker YARN

zookeeper_N

Bundled Zookeeper (on a custom port)

Directories and files

The following is a cross-reference of notable directories and files used by Unravel Server:

Adjustable properties

The file unravel.properties contains settings that can be preserved during an RPM upgrade. These properties are described in the following table:

Property Type/Property Name/Description

Default Value

General Unravel

com.unraveldata.tmpdir

The base location for Unravel process control files where Unravel's temp files reside.

/srv/unravel/tmp

HDFS

com.unraveldata.hdfs.batch.monitoring.interval.sec

Number of seconds between checks for presence of hive queries and MR logs to load into Unravel for batch visibility; should be between 300 and 1800 (inclusive).

300

com.unraveldata.hdfs.interactive.monitoring.interval.sec

Number of seconds between checks for presence of hive queries and MR logs to load into Unravel for interactive visibility; should be between 5 and 60 (inclusive).

30

Zookeeper

com.unraveldata.zk.quorum

Embedded Zookeeper ensemble in form host1:port1,host2:port2.

127.0.0.1:4181