Skip to main content

Home

v 4.7.9.6 Release notes

Release information

Release date

August 2024

4796 Software

Downloads

Configuration properties

Properties

New features

  • Insights

    This release brings backend improvements that enhance performance, efficiency, and the accuracy of insights across the platform. It brings more cost-saving recommendations focusing on workload rightsizing and optimization. The following insights are included in this release:

    • Rightsizing for Streaming Workloads:

      • For fixed-sized clusters, recommendations are provided to optimize cost and performance.

      • For autoscaling clusters, to ensure efficient resource allocation during scaling events.

    • Rightsizing Recommendations for Autoscaling Scenarios:

      • For long-running tasks within a job run, optimizing resource usage throughout task execution.

      • For parallel executions of tasks within a job, enhancing efficiency for simultaneous processes.

      • For contention on driver scenarios, reducing resource competition on the driver node to improve performance.

    • Conservative Savings Model: A new savings model provides cost-saving insights across all categories, even when direct cost bindings are unavailable.

    Instance type whitelisting has been implemented to provide enhanced control over the types of instances used for receiving insights, ensuring that only approved instance types are utilized.

  • Home page

    The Top-X page now includes a "Most Wasted Cost" widget, highlighting the costs associated with failed and killed jobs, allowing users to quickly identify and address inefficiencies.

  • Customer ValueGen app

    This release introduces a new Customer ValueGen app with comprehensive workload analysis, ROI computation, and productivity insights for Databricks environments. The app enables users to analyze historical data, benchmark costs, and identify optimization opportunities to drive enhanced cost efficiency and operational improvements. It includes features for evaluating savings potential, monitoring the cost and savings funnel, and running various diagnostics on the status of insights. Additionally, users can conduct productivity boost analysis and mark applications as interesting to focus on key areas for optimization.

  • Databricks Partner Connect

    Unravel Data is now integrated into the Databricks Partner Connect page, allowing end users to directly create trial accounts for greater flexibility.

  • Data Pipelines and Improved Cost Insights

    Enhancements to secondary data pipelines have been made to ensure better performance, improved monitoring, and greater consistency across the platform. Additionally, backend changes on the Cost Explorer page now deliver more accurate insights, providing you with more precise real-time costs in trends, drill-downs, and budget tabs.

  • UI Updates

    This release includes several UI improvements to enhance clarity and readability across the platform. These updates include refined color schemes and label adjustments to ensure better visual contrast and a more intuitive user experience. In addition, sorting and search issues as well as consistency and accuracy problems are addressed, ensuring a more seamless and accurate experience across the platform.

  • Security, Infrastructure and Compliance

    Critical security vulnerabilities related to Public/Internal Services have been addressed, and the handling of encrypted sensitive values has been improved.

    Activities related to CentOS 7’s end-of-life in June have been completed.

    Configuration management capabilities for non-Kubernetes environments have been enhanced, and the installation and build processes have been streamlined for faster deployments.

The following table contains key issues addressed in the 4.7.9.6 release.

ID

Description

App Store

APP-614

App Store tasks fail to start with SSL enabled on the MySQL database.

Auto Action

DT-3033

The date-time format in the policy-violation downloaded CSV file is incorrect.

Billing

DT-3186

The Billed Cost Over Time graph includes the first day of the next month on the X-axis, despite displaying only the data for the selected period. This issue does not affect functionality or data accuracy.

DT-3171

The downloaded CSV file name should be suffixed with the month of the year for which billing data is downloaded.

Cost

DT-2981

There is a significant cost deviation between dbx_cluster and Spark hitdocs.

DT-2125

The cost displayed for the "Executor Idle Time Detected" insight is incorrect

UIX-6305

The Others category is displayed twice in legends when the number of clusters exceeds 1000 in the Chargeback page.

DT-3130

In the Trends tab, the time displayed in the last x days selection has a time value greater than the current time on the trends page.

Compute

DT-2083

The Total Allocated Key Performance Indicators (KPIs) for Vcore and memory are not visible in the Compute > Trends page.

All jobs in the running status are displayed in the Finished tab under Job Runs instead of showing only the finished jobs.

Home

DT-3095

There is an inconsistency in displaying the 'Wasted Cost' and 'Untapped Savings' values compared to the 'Total Cost' value under each section in the Home -> TopX tab.

Spark

UIX-6567

The filters are not retained when switching between the Inefficient and All tabs in the Databricks UI.

DT-4040

The percentage shown in "Potential Savings per Run" for the Node Downsizing Insight is incorrect.

DT-3099

The Task Time and App Time fields are empty for certain jobs.

DT-2981

Some cost deviations are observed between Spark hitdocs and Databricks clusters.

UI

UIX-6321

The Workflow section currently displays jobs running within the specified duration instead of only jobs completed within the selected time frame.

Unravel Assistant

AI-134, AI-138

The Unravel Assistant provides incorrect responses to certain questions related to productivity and data skew jobs. The team is actively working on resolving this issue.

Workflows

CPLANE-3614

Audit logs are not getting written in the database.

The upcoming releases will include the following key fixes to enhance user experience. It is important to note that while these issues exist, there is no immediate critical impact on using the product, and users can continue to utilize its functionality with confidence.

ID

Description

Cost

DT-2968, DT-2967

Incorrect filters applied while redirecting from budgets to chargeback.

Data

DT-3123

In the Data page section, the default enabled checkbox is incorrectly disabled under tables.

Home

DT-3096, DT-3095, DT-2837

The Annualized savings under the sections highlighting the Most Savings in the Topx tab, is incorrectly displaying as zero dollars in certain scenarios.

DT-2920, DT-3060, DT-2821

Instead of displaying No data present for selected date range when data is unavailable for a selected date range, empty widgets are shown.

DT-2836

Wastage cost is incorrectly showing more than the Cluster cost in some cases.

Insights

DT-2006

Recommendations are provided for a failed pipeline when users utilize multiple tasks with shared job clusters, and one of the tasks fails.

DT-2125

The UI shows a cost discrepancy for the Executor Idle time detected insight in the Databricks version 14.2 with Photon enabled.

DT-3037

Node right sizing filters are not being applied on the Insights Preview page.

DT-3012

The insight preview page displays incorrect costs for job signatures.

Reports

DT-3122

Incorrect count of events shown on TopX Report.

Spark

DT-3104

Multiple instances of the same events are being generated for a single Spark application cluster.

DT-3103, DT-3026

There is a mismatch in the executor count when garbage collection (GC) events occur.

SaaS (Free)

DT-2037

In the Databricks Standard (free) environment, there is an issue where the User Flow badge obstructs pagination.

UI

UIX-6281

The cost comparison for all the instances is not displayed on the Pipeline detail page.

Workflows

DT-2104

Sorting is incorrect when the list contains both strings starting with capital and small case letters,

  • Azure bill integration

    • Actual bill data is not live; once configured, it will be available from Azure once every day. The cost for a few clusters may be updated after a couple of days in the bill.

    • One record per cluster per day is maintained, even if the cluster is restarted multiple times within a day or cluster sessions span across multiple days.

    • If there are issues with Azure billing, the cost data will not be updated on Unravel.

    • Tags on cost pages come from actual cost data, while tags on the compute page come from Spark configuration. There is a possibility that the tags on these pages don’t match. This issue will be fixed in the upcoming release.

  • Billing

    • Some discrepancies may occur in cost calculations due to differences between the user time zone displayed on the Compute page and the UTC-based aggregation on the Billing page. (DT-2350).

    • In certain scenarios, the budget status may inaccurately display as Ok even when the budget has been exceeded. This discrepancy occurs when Azure billing is enabled and is a known limitation. Notifications for budget can be delayed by two or three days due to the minimum 24-hour delay in receiving bills. (DT-3091)

  • Compute

    • Jobs by status graphs in the Trends tab display spark application details and not the job details. Our development is actively looking into this design limitation and efforts are underway to address this in future updates to enhance the product's capabilities. (DT-2008)

    • Data is inconsistent between Compute page and Cost page in some cases when Azure bill is enabled. This is a known limitation because of Azure bill data not being live. Our team is actively looking into this limitation and efforts are underway to address this issue in the future updates. (DT-3078)

  • Data

    If tables are created with the same name, accessed, deleted, and re-created, and if those tables are re-accessed, then their query and app count do not match. (DATAPAGE-502)

  • Home

    Home page does not display alerts on the UI when there is missing ROI data for a single day. (DT-2509)

    Hovering on Total Cost Trend on the Summary tab of the Home page may display inaccurate date information. (DT-2408)

  • Insights Preview

    In some cases, there is a discrepancy in the runs displayed in the Job Runs page for the selected job. When you click on View runs, only runs with dbx_cluster kind having a Spark app id are listed. The discrepancy is due to data inconsistency between entries of dbx_cluster kind and db kind and this is an expected behavior. (DT-3042)

  • Workflows

    The current implementation has a limitation where the wrong run count is displayed for the job ID when sorting by run count in the Workflows > Jobs section. This discrepancy is currently under investigation by our development team, and active efforts are being taken to resolve this issue. (UIX-6526)

Our development team is actively investigating the following Known issues and are working towards resolving them. It is important to note that while these issues exist, there is no immediate critical impact on using the product, and users can continue to utilize its functionality with confidence.

Bug ID

Description

Workaround

Compute

PIPELINE-1636

Inconsistent data is displayed for the cluster Duration and Start Time on the Compute page.

NA

CUSTOMER-3017

The job duration displayed in the TopX section of the "Longest Running Jobs" on the Job Compute graph is incorrect.

NA

Cost

UIX-5624

Data is not displayed when you click the Optimize button corresponding to OTHERS for the Cost > Chargeback results shown in the table.

NA

DT-1094

The No data available message is displayed on the Compute page after navigating from the  Trends and  Chargeback  pages with  Tag  filters.

NA

Datapage

DATAPAGE-473

For Hive metastore 3.1.0 or earlier versions, the creation time of partitions is not captured if a partition is created dynamically.  Therefore, the Last Day KPI for the partition section is not shown in Unravel.

NA

Insights

DT-1987

There is a mismatch in the computation of costs for fleet and spot instances in Databricks clusters. This issue arises due to the unavailability of the exact node type in the cluster info response.

NA

Performance

ASI-933

In the Lag setup, the Duration is not updated for running applications. The Duration should be updated every 15 minutes.

NA

ASI-936

In the Lag setup, the App Time data is missing in the Timing tab of many applications.

NA

Spark

PIPELINE-1616

If the Spark job is not running for Databricks, the values for the Duration and End time fields are not updated on the Databricks Run Details page.

NA

DT-2012

Incorrect details are displayed on the AppSummary > Job Run page when a user repairs a previously failed job. The displayed information may not accurately reflect the repaired job's details.

NA

DT-1742

The timezone for the NodeRightSizing insight event is inconsistent in the Spark details page.

NA

DT-2029

Applications in a success state may inaccurately display an associated job in a running state instead of transitioning to a failed state.

NA

DT-3122

The TopX Report displays an incorrect count of events.

UI

PIPELINE-1935

In the Pipeline details page, when you select the data for a specific date, all instances are displayed instead of displaying only the instances within a selected date.

NA

UIX-6263

The WhiteCross.png cross button on the Pipeline details page does not close the detail page when you click the bars inside the Gantt chart.

NA

Workflows

DT-1461, PIPELINE-1939, PIPELINE-1940, DT-1093, PIPELINE-1924

The UI and data exhibit inconsistencies, including problems with job run details, issues related to multiple workflow runs and UTC timestamps , empty content in workflow job pages and issues with filter values and duration display.

NA