A report to analyze the catalog of apps and tables and derive a sprint plan to migrate them to another environment. You can create this report either from the UI or the command line.
You can use this report to analyze Spark, Impala, Hive, and MR data.
Click Generate Reports > New button.
In the New Report dialog box, enter the following details.
Click OK. The generated reports will be listed under Reports on the App UI.
From the terminal, change directories to
unity-one/src
directory.cd /opt/unravel/data/apps/unity-one/src
Use a screen editor, such as vi editor, to edit the
job.json
file and copy-paste the following text into the file.{ "enabled": false, "retention_days": 10, "report_type": "dataflow_migration", "params": { "kind": [ "spark", "impala", "hive", "mr" ], "days": 300, "num_waves": 10, "num_hours_engg": 75 }, "notifications": {} }
Parameters
Description
start_date
Can be null or a valid start date.
end_date
Can be null or a valid start date.
days
The number of days to look back while downloading data.
topk
The number of applications to be shown in the report.
all_filters
Filter options.
Save and exit the screen editor.
Run the following command to generate the report:
./unity-cli.sh report generate job.json dataflowMigReport true
The Dataflow Migration report generates one file for each application type selected and one file to summarize all the data.
Dataflow Migration Report
It contains the number of apps, total hours, and number of engineers for each wave.
Dataflow Migration Report for hive Data
It contains an overall summary and rank list, and estimated effort with links to sample apps and signature.
Open the report and click any app link from the Signature column.
The Metrics tab displays the trend charts for the Duration, IO, MemorySeconds, and CPU metrics. The dots plotted on the trend charts are the app runs. The app info is provided in the right panel.
You can select any of the following features to view the metric trends:
status
cluster id
cluster UID
Click any dot on any of the Metric trend charts. This becomes the app target run. The details of the app target run appear in the App Run info panel on the right.
In the App Run info panel, click the ID link. This is the ID link of the app target run. The Metric tab is shown with the app target run you have chosen in a different color. The details of the app target run are displayed in the App Target Run panel on the right. Click the View in Unravel link to view the app's details in Unravel.
Select any dot (other than the dot for the app target run) on any of the Metric trend charts. This becomes the app baseline run. This is highlighted in a different color. The details of the app target run are displayed in the App Baseline Run panel on the right. Click the View in Unravel link to view the app's details in Unravel.
In the App Baseline Run panel, click Compare.
Select an app from the App Type options:
Hive on Tez
Spark
Impala
From the Cluster, select a cluster.
Specify the app runs that you want to compare.
Baseline App Run: Specify the ID of the baseline app.
Target App Run: Specify the ID of the target app with which you want to compare the baseline app.
Click Compare. The following app details are given for the baseline and target app run:
App details
Cluster resource usage when these apps were running
App statistics comparison details
App config comparison details
Query string comparison details
Query plan comparison details
In the App Baseline Run panel, click Refresh Insights. The insights are provided in the following sections.
Baseline comparison
The results of the comparison are displayed.
Concurrent heavy hitter apps
This section will show up only when the information is available.
Select this option to view the top five applications in a specific category.
Click the Concurrent heavy hitter apps option.
Select any option from the Top-5 Applications option.
The corresponding details are displayed.