Skip to main content

Home

Top-K users

A report of users that are ranked among the top based on metrics. This report is useful for analyzing top consumers of resources in the clusters.

  1. Click Generate Reports > New button.

  2. In the New Report dialog box, enter the following details.

    TopK

    Items

    Description

    General

    Name

    Name of the report.

    Environment

    Select your platform. The reports corresponding to the selected platform are only listed in the Report type. If you select the All option, all the reports are listed.

    Report type

    Type of report. Select Top-K Users.

    Schedule

    Select the checkbox to schedule the report to run daily, hourly, weekly, or monthly. You can also set the schedule using a cron expression. You can expand the Example drop-down and select the corresponding options. The next four sample run times are displayed for reference.

    Retention

    The number of days to retain the report files. All the reports are stored in the unity-one/src/assets/reports/jobs directory. After completion of the retention period, the report files are automatically purged.

    Parameters

    Look Back

    Period range in days when you can select applications for report generation.

    Time Range

    Check this option and select the from and to dates to schedule the report for the exact time range specified. The only anomaly with this option is that the same report will be scheduled to be generated repeatedly.

    Trend interval

    Select the interval to aggregate the metrics for the trend graphs. The options are daily, hourly, weekly, or monthly. If not selected the interval will be computed internally based on the Look Back days.

    Users

    Select users that you want to be included in the report.

    If none are selected, then all the users are included in the report. Otherwise, only the specified users will be considered in the Top-k analysis.

    Queues

    Select the queues that you want to include in the report. You can select multiple queues. If none are selected, all queues detected will be included in the report.

    Kinds

    Select the applications that must be included in the analysis report. The options are Impala, Hive, Spark, and MR.

    Clusters

    Select the clusters that you want to include in the report. You can select multiple clusters. If none are selected, all clusters detected will be included in the report.

    Tags

    The option to filter based on tags, if not selected then all tags will be considered. You can specify the tags as follows:

    • Key: Specify the tag name. A single key can have more than one value.

    • Values: Specify the values based on the selected tag name. If the value is empty, all the values for the selected key are considered.

    Click the Add button to add multiple key-value pairs. Click the Remove button to remove a key-value pair.

    Tag Values

    Select the tag values corresponding to the tag names you have selected that you want to include in the report. Options for tag values will only show when you select a tag. You can select multiple tag values. If none are selected, all tag values detected for a tag will be included in the report.

    TopK

    Select the number of top consumers that must be shown in the report.

    Features Filter

    Filters the key-value features fetched from feature indices. The custom features/fields can be added in features indices by modifying the customizable_feature.py. Those features will be available here.

    Notifications

    Email to

    Email ID to send the notification when the report is generated. You can select the Attach Files to Email checkbox to receive the reports as an attachment.

    Advanced Options

    Profile memory

    Select this option if you want to generate logs that help to troubleshoot scenarios where the report takes excessive time to generate or fails to generate.

    Caution

    The option will increase the report run time excessively.

  3. Click OK. The generated reports will be listed under Reports on the App UI.

  4. Select the generated report and then click Run. After the report is successfully run, the details of the report runs are listed in the Run box on the right.

  5. Click the following:

    • HTML files link to view the report details.

    • Input parameters link to view the parameters you chose to run the report.

    • Log file link to view the logs of the report.

The Top-K users report shows the trends of the top users based on the metrics (IO, compute, memory second, and duration). On the top, the details related to the selected parameters are specified.

The Top-K users report comprises the following:

  • Trend charts: These trend line graphs plot the top users on a time series chart based on each metric (IO, compute, memory second, and duration). In the following time series chart, the hradmin is the top resource consumer among other users.

    topk-users1.png
  • Pie chart: The usage pattern of the top users for a specific metric is shown in a pie chart. A tab is allotted for each metric. Click the metrics (IO, compute, memory second, and duration) tab to view the usage pattern of the top users for the corresponding metric. The others in the pie chart represent the rest of the users.

    topk-users2.png
  • Table: The usage is represented in a tabular format with details about each metric usage by the top-k users and the IO trend.

    topk-users3.png
  • Trendline chart: The trendline chart, based on tags associated with the apps (key:value pairs), for a selected metric.

    trendchart-line-metrics.png
  • Sunburst chart: The sunburst chart shows the distribution of tag keys and values.

    sunburst2.png

    You can click a section on this chart to view the details of a specific tag. Click again on the chart to view all the sections.

    sunburst1.png

Tip

If you want to analyze the usage trend of a specific top-k user, refer to Deep user analysis report.