Home

Cluster discovery

The Cluster discovery report provides overall information about your cluster. The report includes the following sections:

  • On-Prem Cluster Identity contains the cluster configuration details and host information.

  • Overall cluster usage graphs of:

    • Applications submitted By App Type, By user, and By queue.

    • CPU

    • Memory

  • A CPU/Memory heat-map that aggregates usage by weekday, and then hour within the day.

Configuring Cluster Discovery report

For the Cluster Discovery report, ensure that the configuration for Migration reports is set.

Generating Cluster Discovery report
  1. Go to Migrations > Cluster Discovery.

  2. Click the run.png button to generate a new report.

    migration-clusterdiscovery-run.png
  3. Select a period range from the date picker and click Run to generate the report.

    The progress of the report generation is shown on the top of the page and you are notified about the successful creation of the report.

    All reports (successful or failed attempts) are in the Reports Archive.

Note

Before the initial report generation, the default is a seven-day history.

Scheduling Cluster Discovery report
  1. Click Schedule to generate the report regularly and provide the following details:

    • History (Date Range): Select a period from the date picker.

    • Schedule Name: Name of the schedule.

    • Schedule to Run: Select any of the following schedule option from drop-down and set the time from the hours and minutes drop-down:

      • Daily

      • Weekdays (Sun-Sat)

      • Every two weeks

      • Every month

    • Notification: Provide an email ID to receive the notification of the reports generated.

  2. Click Schedule.

Viewing the Cluster Discovery report
migration-cluster-discovery_report.png
On-Prem Cluster Identity

This tile contains information about your cluster, including the hosts. The Host Summary section shows the cluster's capacity across all hosts.

cluster-discovery-on-prem-cluster-identity.png

To see each host's hardware specifications and the host's roles, click the # Hosts link. The table can be searched on hostname. The potential roles are:

  • Server: Has at least one server component, such as Zookeeper Server, HDFS.

  • Worker: Has at least one daemon component such as HDFS DataNode, YARN NodeManager, or HBase RegionServer.

  • Client: Has at least one client component, such as Zookeeper Client, Hadoop Client, Hive Client, etc.

Applications usage

The donut graphs present the overall usage, in a cluster, of the applications grouped by app type, user and queue. The top 10 in each category are shown. You can select or deselect the checkboxes corresponding to the categories to change the graph view.

cluster-discovery-graphs-resource_usage1.png
Resource availability and usage

The graphs display the cluster's CPU and memory utilization over the time period. The capacity and the actual usage trends are plotted on the graph. The average usage is listed on the right-hand side of the title bar. Hover over the text next to the resource's name to see Unravel's analysis of your cluster's usage for that resource.

cluster-discovery-graphs-resource_usage2.png

The heat-map is a map of the CPU/Memory usage and capacity by a weekday and hour, e.g., Monday between 5 and 6 a.m. Each time slot is color-coded to show how relatively hot the time slot is relative to the rest of the map. You can quickly see the load distribution across your cluster.age. You can filter the heatmap by CPU or memory. It opens displaying CPU. The CPU graph above noted the CPU is under-utilized and the heatmap graphically supports that analysis. Click expand.png to expand and view the heat map.

cluster-discovery-cpu-heatmap.png