v4.5.2 Release notes

Home

v4.5.2 Release notes

Focused on Amazon EMR, Azure HDInsight, and Azure Databricks

Software version

Release Date: 07/15/2019

See v4.5.2.0 for download information.

Software upgrade support

Upgrade from 4.5.0.5 only. See here for specific upgrade information.

Certified platforms

Please review your platform's compatibility matrix below before you upgrade/install.

Azure Databricks

	Databricks runtime version	5.4	5.3
Unravel VM	CentOS/ RHEL	7.4	7.4

Applications	Spark	2.4.1, 2.2	2.4.1, 2.2

Python		3.0, 2.7	3.0, 2.7

Browsers	Chrome	75.x	75.x
Firefox	66	66

EMR

	Platform version	5.24.x	5.23.x	5.22.x	5.21.x	5.20.x
Base OS	CentOS	7.4	7.4	7.4	7.4	7.4

Applications	Hadoop	2.8.5	2.8.5	2.8.5	2.8.5	2.8.5
	HBase	1.4.9	1.4.9	1.4.9	1.4.8	1.4.8
	Hive	2.3.4	2.3.4	2.3.4	2.3.4	2.3.4
	MR	MR v2	MR v2	MR v2	MR v2	MR v2
	Oozie	5.10	5.10	5.10	5.00	5.00
	Spark	2.4.2	2.4.0	2.4.0	2.4.0	2.4.0
	Tez	0.9.1	0.9.1	0.9.1	0.9.1	0.9.1

Storage type	HDFS	✓	✓	✓	✓	✓
Storage type	S3 (Only Spark event logs)	✓	✓	✓	✓	✓

Unravel DB	MySQL	5.7, 8	5.7, 8	5.7, 8	5.7, 8	5.7, 8
Unravel DB	MySQL Client JAR	5.1.47, 8.0.1.6	5.1.47, 8.0.1.6	5.1.47, 8.0.1.6	5.1.47, 8.0.1.6	5.1.47, 8.0.1.6

JAVA version		1.8.0_191	1.8.0_191	1.8.0_191	1.8.0_191	1.8.0_191

Browsers	Chrome	70.x	70.x	70.x	70.x	70.x
Browsers	Firefox	66	66	66	66	66

Unravel sensor		1.3.11.3	1.3.11.3	1.3.11.3	1.3.11.3	1.3.11.3

HDI

	Platform version	4.0	3.6
Base OS	CentOS/ RHEL	7.4	7.4

Applications	Hadoop	3.1.0	2.7.3
	HBase	2	1.1.2
	Hive	3.1.0	2.1.0
	Kafka	1.1	1.0.0
	MR	MR v2	MR v2
	Oozie	4.3.1	4.3.1
	Spark	2.3.1	2.1, 2.2
	Tez	0.9.1	0.8.0

Unravel DB	MySQL	5.7, 8	5.5 - 5.7
Unravel DB	MySQL Client JAR	5.1.47, 8.0.16	5.1.47

Storage type	WASB	✓	✓
	ABFS	✓	✓
	ADLS Gen 1	✓	✓

JAVA version		1.8.0_191	1.8.0_191

Browsers	Chrome	70.x	70.x
Browsers	Firefox	66	66

Unravel sensor		1.3.11.3	1.3.11.3

Unravel sensor upgrade

Sensor upgrade is required if you are upgrading from Unravel v4.5.0.5

Updates to Unravel's configuration properties

See v4.5.x - Updates to Unravel Properties.

Unsupported

All On-Demand Features

Sessions
Reports
- Capacity Forecasting
- Migration Planning
- Cluster Optimization
- File Reports
- Queue Analysis
- Small Files
- Top X

AutoActions
Incorporating true “workflows”, i.e., Databricks jobs orchestration via services like ADF.
We have not examined the functionality provided by Cluster Pools and how it can affect Unravel functionality.
No support for "interactive clusters"; only Job clusters supported including when New Job Clusters are created.
The following have not been tried out/tested.
- Connecting to a Hive Metastore which populates parts of the Data Insights tab.
- Behavior on setting Maximum Concurrent Runs value to > 1.
- Spark streaming applications.
Unravel's APIs

ATSv2.0
Head node HA.
YARN aggregate logs are not captured for MR jobs. Therefore, the
- Logs tab for MR jobs is empty.
- Information in the Errors tab which is derived from the logs isn't available.
AutoActions
- Kill and move actions. (AA-103)
- Rules that span across multiple clusters, i.e., you can not specify a rule that aggregates metrics from multiple clusters and monitor its violation. (AA-131)
AMI/CFN/ARM Template based installation support.
No special support for cloud-native constructs, e.g., IAM roles for RBAC etc.
Multi-Metastore
- This is an unsupported scenario - When a cluster has more than one metastore or if an external metastore is used (as opposed to the one that comes default with the cluster).
- Also, the Unravel node, at a given time, can retrieve data from one metastore (it needs to be configured as described in Hive Metastore configuration documentation).
Workflows
- Airflow
Marketplace (both Azure and AWS) has older versions of Unravel, not this current 4.5.2.0 cloud release.

New Features and improvements

Bug fixes

HBase
- Table pagination is correct. (HBASE-95)
- The Tables RegionCount and AverageRegionSize column are populated for all tables. (UIX-1673)
Kafka KPIs are correctly populated. (Customer-984)
MR
- The app Cluster ID is always listed. (PLATFORM-1430)
Platform
- Applications no longer slow down or throw Container "not starting" errors when you add new Nodes to an existing EMR cluster without an Unravel bootstrap script. (EMR-4, EMR-3)
Tez
- On HDInsights the Data I/O is no longer missing. (TEZLLAP-265)
- DAG counters and graphs are no longer missing. (TEZLLAP-277)
- NullPointerException is no longer raised when unable to reach ATS (TEZLLAP-278)
UI
- You can now filter on app names containing spaces. (CUSTOMER-652)
- Report for chargeback changes when different clusters are selected. (PLATFORM-1499)
- You can select a specific cluster for the Cluster Workload page. (UIX-1886)
- Last update for SQL query plan reaches the spark worker. (USPARK-162)

Known issues

Operations
- The Apps Running and Apps Pending count is not correct on the Operations page. (DT-119)
Applications
- Applications details view
  - The Timeline view of tasks only shows the job type Spark-Submit. This is because the Event Logs are not available for other job types.
  - In the very first horizontal section
    The right panel tabs (Program, Task Attempts, Resource) are shown even if there is no data. Sometimes you may see a notice about missing data.
  - Intermittently the Navigation tab may have jobs with incomplete stage information. (DT-201)
  - Cluster Name and Cluster ID tooltips are incorrectly exchanged. (DT-204)
- App Name value appears as “Databricks Shell” (class name instead of the last run_name) when jobs are submitted as Spark jar tasks, Notebook tasks and Python tasks. (DT-179, DT-176)
- In some cases the Spark SQL query text and query plan mismatch. (DT-186)
- When there are driver side exceptions in spark-submit application, the status is still captured as SUCCESS. It is difficult to capture driver side exceptions because they are not being logged in driver log by Databricks. (DT-192)
- When a Python script is executed as a spark-submit task with eventLog enabled it becomes stuck forever. This is a bug at Databricks end which we have raised in the databricks forum.
  Workaround is to not enable eventLog for spark-submit task which runs the Python script.
- Filter by Application Name has issues when it has special characters like hyphen etc. By default, the spark.app.name has hyphens (e.g. app-20190712214639-0000). So this will be an issue with a majority of Spark applications in Databricks. (DT-200)
- When a running job is cancelled forcefully than the application status is unpredictable as Unravel doesn't get enough time to send messages from Brace. (DT-202)
Jobs > Details View (DT-195)
- Jobs are labeled as Workflow in several places instead of Job
- The number of jobs is labeled as # of YARN Apps instead of # OF Apps”.(DT-195).
Tagging
- Unravel Tagging feature as described here, is not supported. This affects RBAC as well.
Data Insights (Reports > Data Insights > Overview)
- Accessed Partitions Section is empty. (DT-175)
Miscellaneous
- For spark-submit job, the Databricks’s default Spark extraJavaOptions are overwritten by Unravel's extraJavaOptions that required for monitoring. (DT-113)
- Application count is incorrect on Operations page. (DT-119)
- The Graphs tab is intermittently missing.(DT-209)

AutoActions
- In multi-cluster configurations AutoActions does not differentiate between entities of each cluster and setting up a policy targets all monitored clusters. For instance, setting up a rule to target root queue causes it to be monitored on all clusters. (AA-174)
  Workaround:
  - If the cluster ID is known isolate the policy for the cluster using policy options.
- Uses the internal Hadoop cluster ID instead of Unravel cluster ID/name. You must obtain the internal cluster ID in order to specify a Hadoop cluster in the policy options section. It can be obtained from HDFS namenode, where it’s stored in {dfs.namenode.name.dir}/current/VERSION. (AA-150)
- In case of transport message protocol synchronization error, n exceptionally rare occasion AutoAction can be triggered up to 180 seconds after the violation occurs. No data loss is expected.
- “RECENT EVENTS & ALERTS” shows the events across all clusters regardless of the currently selected cluster. (AA-127, AA-151)
- Application Master level metrics, such as job metrics and job counters, are not collected by EMR sensor by default and therefore can not be used in AA policies. Collection of AM metrics can be enabled manually using “am-polling” option in EMR sensor. (AA-184)
- In exceptionally rare cases AutoActions can be triggered up to 180 seconds later in case of transport message protocol synchronization error but no data loss is expected.
Data insights page
- Table size will be shown as 0 if data is in object storage. (DATAPAGE-104)
- Created and Accessed Partition details are missing. (DATAPAGE-109)
- Retention Table details shows number partitions as zero (0). (DATAPAGE-98)
HBase
- Spaces in names are missing in HBase table name. (UIX-1736)
MySQL
- Exception in unravel_jcse2, unravel_td and unravel_ja.log with external MYSQL8.0. (INSTALL-233)
  You can resolve by adding either of the below settings to unravel.properties.
  - unravel.jdbc.url.params=logger=com.mysql.cj.log.Slf4JLogger&disableMariaDbDriver
  - unravel.jdbc.url.params=disableMariaDbDriver
Platform
- appstatus.AppInfoAccessor: Unable to update ES document. (CUSTOMER-658)
Tez
- Hive queries that do not submit any DAG have empty cluster ID and queue name. (TEZLLAP-274)
- For Hive-Tez, the event "TEZ DAG VERTEX USED TOO MANY REDUCER TASKS" incorrectly reports the current value for 'hive.exec.reducers.bytes.per.reducer' as 67108864 when the configured value is less than 67108864. (TEZLLAP-282)
UI
- Search behavior is not consistent behavior across all search options/boxes. (CUSTOMER-341)
- The Resources' charts display is incorrect for single Unravel deployment for multiple EMR cluster. (Platform-1438)
- Selecting specific clusters for Cluster Summary and Cluster Compare in Reports > Operational Insights does not work. (PLATFORM-1610)
- The UI refreshes inconsistently across features. (UIX-1741, UIX-1721)
- For a large number of tables, UI Details page freezes if you try to filter the tables by labels.(UIX-1874)
- The Resource Usage graph does not display the entire container Data graph. (UIX-1732)

For support issues, visit Unravel Support.

In this section:

Would you like to provide feedback? Just click here to suggest edits.

Home

v4.5.2 Release notes

Focused on Amazon EMR, Azure HDInsight, and Azure Databricks

Software version

Software upgrade support

Certified platforms

Azure Databricks

EMR

HDI

Unravel sensor upgrade

Updates to Unravel's configuration properties

Unsupported

New Features and improvements

Bug fixes

Known issues

Search results