- Home
- Release Notes
- v4.5.x Release notes
- v4.5.2 Release notes
v4.5.2 Release notes
Focused on Amazon EMR, Azure HDInsight, and Azure Databricks
Software version
Release Date: 07/15/2019
See v4.5.2.0 for download information.
Software upgrade support
Upgrade from 4.5.0.5 only. See here for specific upgrade information.
Certified platforms
Please review your platform's compatibility matrix below before you upgrade/install.
Databricks runtime version | 5.4 | 5.3 | |
---|---|---|---|
Unravel VM | CentOS/ RHEL | 7.4 | 7.4 |
Applications | Spark | 2.4.1, 2.2 | 2.4.1, 2.2 |
Python | 3.0, 2.7 | 3.0, 2.7 | |
Browsers | Chrome | 75.x | 75.x |
Firefox | 66 | 66 |
Platform version | 5.24.x | 5.23.x | 5.22.x | 5.21.x | 5.20.x | |
---|---|---|---|---|---|---|
Base OS | CentOS | 7.4 | 7.4 | 7.4 | 7.4 | 7.4 |
Applications | Hadoop | 2.8.5 | 2.8.5 | 2.8.5 | 2.8.5 | 2.8.5 |
HBase | 1.4.9 | 1.4.9 | 1.4.9 | 1.4.8 | 1.4.8 | |
Hive | 2.3.4 | 2.3.4 | 2.3.4 | 2.3.4 | 2.3.4 | |
MR | MR v2 | MR v2 | MR v2 | MR v2 | MR v2 | |
Oozie | 5.10 | 5.10 | 5.10 | 5.00 | 5.00 | |
Spark | 2.4.2 | 2.4.0 | 2.4.0 | 2.4.0 | 2.4.0 | |
Tez | 0.9.1 | 0.9.1 | 0.9.1 | 0.9.1 | 0.9.1 | |
Storage type | HDFS | ✓ | ✓ | ✓ | ✓ | ✓ |
S3 (Only Spark event logs) | ✓ | ✓ | ✓ | ✓ | ✓ | |
Unravel DB | MySQL | 5.7, 8 | 5.7, 8 | 5.7, 8 | 5.7, 8 | 5.7, 8 |
MySQL Client JAR | 5.1.47, 8.0.1.6 | 5.1.47, 8.0.1.6 | 5.1.47, 8.0.1.6 | 5.1.47, 8.0.1.6 | 5.1.47, 8.0.1.6 | |
JAVA version | 1.8.0_191 | 1.8.0_191 | 1.8.0_191 | 1.8.0_191 | 1.8.0_191 | |
Browsers | Chrome | 70.x | 70.x | 70.x | 70.x | 70.x |
Firefox | 66 | 66 | 66 | 66 | 66 | |
Unravel sensor | 1.3.11.3 | 1.3.11.3 | 1.3.11.3 | 1.3.11.3 | 1.3.11.3 |
Platform version | 4.0 | 3.6 | |
---|---|---|---|
Base OS | CentOS/ RHEL | 7.4 | 7.4 |
Applications | Hadoop | 3.1.0 | 2.7.3 |
HBase | 2 | 1.1.2 | |
Hive | 3.1.0 | 2.1.0 | |
Kafka | 1.1 | 1.0.0 | |
MR | MR v2 | MR v2 | |
Oozie | 4.3.1 | 4.3.1 | |
Spark | 2.3.1 | 2.1, 2.2 | |
Tez | 0.9.1 | 0.8.0 | |
Unravel DB | MySQL | 5.7, 8 | 5.5 - 5.7 |
MySQL Client JAR | 5.1.47, 8.0.16 | 5.1.47 | |
Storage type | WASB | ✓ | ✓ |
ABFS | ✓ | ✓ | |
ADLS Gen 1 | ✓ | ✓ | |
JAVA version | 1.8.0_191 | 1.8.0_191 | |
Browsers | Chrome | 70.x | 70.x |
Firefox | 66 | 66 | |
Unravel sensor | 1.3.11.3 | 1.3.11.3 |
Unravel sensor upgrade
Sensor upgrade is required if you are upgrading from Unravel v4.5.0.5
Updates to Unravel's configuration properties
Unsupported
All On-Demand Features
Sessions
Reports
Capacity Forecasting
Migration Planning
Cluster Optimization
File Reports
Queue Analysis
Small Files
Top X
AutoActions
Incorporating true “workflows”, i.e., Databricks jobs orchestration via services like ADF.
We have not examined the functionality provided by Cluster Pools and how it can affect Unravel functionality.
No support for "interactive clusters"; only Job clusters supported including when New Job Clusters are created.
The following have not been tried out/tested.
Connecting to a Hive Metastore which populates parts of the Data Insights tab.
Behavior on setting Maximum Concurrent Runs value to > 1.
Spark streaming applications.
Unravel's APIs
ATSv2.0
Head node HA.
YARN aggregate logs are not captured for MR jobs. Therefore, the
Logs tab for MR jobs is empty.
Information in the Errors tab which is derived from the logs isn't available.
AutoActions
Kill and move actions. (AA-103)
Rules that span across multiple clusters, i.e., you can not specify a rule that aggregates metrics from multiple clusters and monitor its violation. (AA-131)
AMI/CFN/ARM Template based installation support.
No special support for cloud-native constructs, e.g., IAM roles for RBAC etc.
Multi-Metastore
This is an unsupported scenario - When a cluster has more than one metastore or if an external metastore is used (as opposed to the one that comes default with the cluster).
Also, the Unravel node, at a given time, can retrieve data from one metastore (it needs to be configured as described in Hive Metastore configuration documentation).
Workflows
Airflow
Marketplace (both Azure and AWS) has older versions of Unravel, not this current 4.5.2.0 cloud release.
Automatic configuration when Kafka cluster scales up/down.
Linking Spark Streaming app topic to Kafka page.
New Features and improvements
Currently
Only “Job clusters” are supported so there is no support for “interactive clusters”.
Unravel is focusing on Data Engineering Use Cases only and not ad hoc usage.
Job types supported (with “New Cluster” option only): Notebook, Jar, spark-submit, and Python Task.
See Known Issues for current limitations.
AutoActions are supported and enforced across multiple clusters, including the
Collection and aggregation of metrics across multiple clusters connected to an Unravel server as well as transient clusters. See here for currently unsupported aspects.
Metrics are collected and routed to a new AutoAction daemon which be stopped and started any time.
SparkSQL support Data insights tab
S3A file system metrics is now supported for EMR/HDI.
Support for auto-scaling clusters (or clusters scaling up without the need for manual intervention).
ADLS Gen 2 Storage Type is supported.
Ability to specify multiple ADLS Gen 1 Storage Accounts.
Bug fixes
HBase
Table pagination is correct. (HBASE-95)
The Tables RegionCount and AverageRegionSize column are populated for all tables. (UIX-1673)
Kafka KPIs are correctly populated. (Customer-984)
MR
The app Cluster ID is always listed. (PLATFORM-1430)
Platform
Applications no longer slow down or throw Container "not starting" errors when you add new Nodes to an existing EMR cluster without an Unravel bootstrap script. (EMR-4, EMR-3)
Tez
On HDInsights the Data I/O is no longer missing. (TEZLLAP-265)
DAG counters and graphs are no longer missing. (TEZLLAP-277)
NullPointerException is no longer raised when unable to reach ATS (TEZLLAP-278)
UI
You can now filter on app names containing spaces. (CUSTOMER-652)
Report for chargeback changes when different clusters are selected. (PLATFORM-1499)
You can select a specific cluster for the Cluster Workload page. (UIX-1886)
Last update for SQL query plan reaches the spark worker. (USPARK-162)
Known issues
Operations
The Apps Running and Apps Pending count is not correct on the Operations page. (DT-119)
Applications
Applications details view
The Timeline view of tasks only shows the job type Spark-Submit. This is because the Event Logs are not available for other job types.
In the very first horizontal section
The right panel tabs (Program, Task Attempts, Resource) are shown even if there is no data. Sometimes you may see a notice about missing data.
Intermittently the Navigation tab may have jobs with incomplete stage information. (DT-201)
Cluster Name and Cluster ID tooltips are incorrectly exchanged. (DT-204)
App Name value appears as “Databricks Shell” (class name instead of the last run_name) when jobs are submitted as Spark jar tasks, Notebook tasks and Python tasks. (DT-179, DT-176)
In some cases the Spark SQL query text and query plan mismatch. (DT-186)
When there are driver side exceptions in spark-submit application, the status is still captured as SUCCESS. It is difficult to capture driver side exceptions because they are not being logged in driver log by Databricks. (DT-192)
When a Python script is executed as a
spark-submit
task with eventLog enabled it becomes stuck forever. This is a bug at Databricks end which we have raised in the databricks forum.Workaround is to not enable eventLog for
spark-submit
task which runs the Python script.Filter by Application Name has issues when it has special characters like hyphen etc. By default, the spark.app.name has hyphens (e.g. app-20190712214639-0000). So this will be an issue with a majority of Spark applications in Databricks. (DT-200)
When a running job is cancelled forcefully than the application status is unpredictable as Unravel doesn't get enough time to send messages from Brace. (DT-202)
Jobs > Details View (DT-195)
Jobs are labeled as Workflow in several places instead of Job
The number of jobs is labeled as # of YARN Apps instead of # OF Apps”.(DT-195).
Tagging
Unravel Tagging feature as described here, is not supported. This affects RBAC as well.
Data Insights (Reports > Data Insights > Overview)
Accessed Partitions Section is empty. (DT-175)
Miscellaneous
For
spark-submit
job, the Databricks’s default SparkextraJavaOptions
are overwritten by Unravel'sextraJavaOptions
that required for monitoring. (DT-113)Application count is incorrect on Operations page. (DT-119)
The Graphs tab is intermittently missing.(DT-209)
AutoActions
In multi-cluster configurations AutoActions does not differentiate between entities of each cluster and setting up a policy targets all monitored clusters. For instance, setting up a rule to target root queue causes it to be monitored on all clusters. (AA-174)
Workaround:
If the cluster ID is known isolate the policy for the cluster using policy options.
Uses the internal Hadoop cluster ID instead of Unravel cluster ID/name. You must obtain the internal cluster ID in order to specify a Hadoop cluster in the policy options section. It can be obtained from HDFS namenode, where it’s stored in {dfs.namenode.name.dir}/current/VERSION. (AA-150)
In case of transport message protocol synchronization error, n exceptionally rare occasion AutoAction can be triggered up to 180 seconds after the violation occurs. No data loss is expected.
“RECENT EVENTS & ALERTS” shows the events across all clusters regardless of the currently selected cluster. (AA-127, AA-151)
Application Master level metrics, such as job metrics and job counters, are not collected by EMR sensor by default and therefore can not be used in AA policies. Collection of AM metrics can be enabled manually using “am-polling” option in EMR sensor. (AA-184)
In exceptionally rare cases AutoActions can be triggered up to 180 seconds later in case of transport message protocol synchronization error but no data loss is expected.
Data insights page
Table size will be shown as 0 if data is in object storage. (DATAPAGE-104)
Created and Accessed Partition details are missing. (DATAPAGE-109)
Retention Table details shows number partitions as zero (0). (DATAPAGE-98)
HBase
Spaces in names are missing in HBase table name. (UIX-1736)
MySQL
Exception in
unravel_jcse2
,unravel_td
andunravel_ja.log
with external MYSQL8.0. (INSTALL-233)You can resolve by adding either of the below settings to
unravel.properties
.unravel.jdbc.url.params=logger=com.mysql.cj.log.Slf4JLogger&disableMariaDbDriver
unravel.jdbc.url.params=disableMariaDbDriver
Platform
appstatus.AppInfoAccessor: Unable to update ES document. (CUSTOMER-658)
Tez
Hive queries that do not submit any DAG have empty cluster ID and queue name. (TEZLLAP-274)
For Hive-Tez, the event "TEZ DAG VERTEX USED TOO MANY REDUCER TASKS" incorrectly reports the current value for 'hive.exec.reducers.bytes.per.reducer' as 67108864 when the configured value is less than 67108864. (TEZLLAP-282)
UI
Search behavior is not consistent behavior across all search options/boxes. (CUSTOMER-341)
The Resources' charts display is incorrect for single Unravel deployment for multiple EMR cluster. (Platform-1438)
Selecting specific clusters for Cluster Summary and Cluster Compare in Reports > Operational Insights does not work. (PLATFORM-1610)
The UI refreshes inconsistently across features. (UIX-1741, UIX-1721)
For a large number of tables, UI Details page freezes if you try to filter the tables by labels.(UIX-1874)
The Resource Usage graph does not display the entire container Data graph. (UIX-1732)
YARN log aggregation IndexedFormat not working.
For support issues, visit Unravel Support.