Running AutoAction demos
The Demos program provides you a way to understand and experiment with AutoActions and their triggering. Example AutoActions are included for Map Reduce, Hive, Spark and Workflow jobs. Scripts are provided that trigger a violation in order to demonstrate the Actions in "action".
Contact Unravel Support to in order to download AutoActions demos.
Text
indicates where you must substitute your particular values for the text.
demo-path
is the complete path of the auto-actions-demos directory.
Unpack and install the AutoAction demos
Put the
auto-actions-demos.tgz
file in the directory Unravel Server host machine where you want to unpack it.Navigate to the directory and unpack the demos.
# tar -xvzf auto-actions-demo.tgz
Tar creates and unpacks the files into
auto-actions-demos
directory.The directory should contain the following files.
# ls auto-actions-demos demos/ setup/
Go to
demo-path
/setup
directory.Open
./settings
file and enter the email address to receive violation notifications.Execute the
./setup-all
script.# ./setup-all
Note
The AutoAction rules that include time specification are automatically adjusted to the current time period, for example, from CURRENT_HOUR:00 to CURRENT_HOUR+2:00.
After running the script go the Unravel Server UI and select Manage | AutoActions tab.
You should see all the AutoAction demos listed under Active AutoActions. Each AutoAction is entitled
AA-tag
, for example,AA-Spark-1c
,Map-1b
.
Executing the demos
Go to
demo-path
/demos
directory. For each AutoActions rules listed in Manage | AutoActions there is a corresponding script in the demo's directory. Each script triggers the corresponding AutoAction demo.For example, in the UI the AutoAction named
AA-Spark-1c
is listed. The corresponding triggering script file isdemo-Spark-1c
. You should see the following files in the demos directory.# cd
demo-path
/setup # ls demo-Hive-1a demo-MR-1a demo-MR-2b demo-Spark-1b demo-Hive-2a demo-MR-1b demo-MR-3a demo-Spark-1c demo-Hive-2b demo-MR-1c demo-MR-3b run-all-demos demo-Hive-3a demo-MR-2a demo-Spark-1a scripts/Execute
./demo-tag
script to trigger the correspondingAA-tag
rule. Each script is designed to simulate violation conditions for the corresponding AutoAction on the target Hadoop cluster, for example, to triggerAA-Spark-1c
you run thedemo-Spark-1c
script.Some of AutoAction's demo scripts trigger multiple AutoActions. This side effect can happen when running your own defined AutoActions due to AutoActions having overlapping definitions.
Cleaning up demos
Run ./clean-all
script to remove all the demo AutoAction from the Unravel Server. If you want to run the demos again, simply follow the procedure starting with extracting the files (step 3 above)to
# cd demo-path
/setup
# ./clean-all
.
AutoActions demos list
Application and Alert Type | Use case | AutoAction Triggering Script [empty] | Notes |
MapReduce Alert if a MapReduce job is grabbing majority of cluster resources and may affect other users jobs at any time. | Alert if any MapReduce job allocated memory > 20GB. | AA-MR-1a Demo-MR-1a | Submits to “root.sla” queue. |
Alert if any MapReduce job allocated vCores > 10. | AA-MR-1b Demo-MR-1b | Submits to “root.sla” queue. | |
Alert if any MapReduce job is running for longer than 10 minutes. | AA-MR-1c Demo-MR-1c | Submits to “root.sla” queue. May trigger MR-1b. | |
MapReduce Alert if a MapReduce job may affect any production SLA jobs running on a cluster. | Alert if any app is not in the queue ‘sla_queue’ and running between X and Y and allocated memory > 20GB. | AA-MR-2a Demo-MR-2a | Also triggers MR-1a as well. |
Alert if any app is not in the queue ‘sla_queue’ and running between X and Y and allocated vCores greater than 10. | AA-MR-2b Demo-MR-2b | Also triggers MR-2a as well. | |
MapReduce Alert if an ad hoc MapReduce job is grabbing majority of cluster resources and may affect cluster performance. | Alert if any MapReduce job allocated vCores > 10 between X and Y in queue ‘root.adhoc’. | AA-MR-3a Demo-MR-3a | Submits to “root.adhoc” queue. Also triggers MR-1a and MR-2a. |
Alert if any MapReduce job allocated memory > 20GB between X and Y in queue ‘root.adhoc’. | AA-MR-3b Demo-MR-3b | Submits to “root.adhoc” queue. Also triggers MR-1b and MR-2b. | |
Spark Alert if a Spark app is grabbing majority of cluster resources and may affect other users jobs at any time. | Alert if any Spark app has allocated more than 20GB of memory. | AA-Spark-1a Demo-Spark-1a | |
Alert if any Spark app allocated vCores > 8. | AA-Spark-1b Demo-Spark-1b | ||
Alert if any Spark app is running longer than 10 minutes | AA-Spark-1c Demo-Spark-1c | ||
Spark Alert if a Spark SQL query has unbalanced input vs output, which may point to an inefficient or “rogue” queries. | Alert if any Spark app is generating lots of rows in comparison with input,i.e. ‘outputToInputRowRatio’ > 1000. | TBD | |
Hive Alert if a Hive query duration is running longer than expected. | Alert if a Hive query duration > 5 minutes. | AA-Hive-1a Demo-Hive-1a | You can Ctrl-C the query once it triggers the AA. |
Hive Alert if SLA bound query is taking longer than expected. | Alert if a Hive query started between A:00 and B:00 in queue ‘root.prod’ and duration > 10 minutes. | AA-Hive-2a Demo-Hive-2a | You can Ctrl-C the query once it triggers the AA. |
Alert if any Hive query is started between A:00 and B:00 in any queue except ‘root.prod’. | AA-Hive-2b Demo-Hive-2b | Very short query. | |
Hive Alert if a Hive query is writing lots of data. | Alert if a Hive query writes out more than 200MB in total. | AA-Hive-3a Demo-Hive-3a | |
Alert if a Hive query reads in more than 10GB in total. | AA-Hive-3b Demo-Hive-3b | ||
Hive Detect inefficient and “stuck” Hive queries. | Alert if any Hive query has read less than 10MB in total and its duration is longer than 10 minutes. | AA-Hive-4a Demo-Hive-4a | |
Alert if any Hive query in the queue 'root.adhoc' is running for longer than 2 minutes. | AA-Hive-4b Demo-Hive-4b | ||
Workflow Alert if a workflow is taking longer than expected. | Alert if any workflow is running for longer than 10 minutes, might be stuck. | AA-WF-1a Demo-WF-1a | You can Ctrl-C the query once it triggers the AA. |
Alert if a SLA bound workflow named ‘market_report’ is running for longer than 5 minutes. | AA-WF-1b Demo-WF-1b | You can Ctrl-C the query once it triggers the AA. | |
Workflow Alert if a workflow is reading more data than expected. |