Vmware Validated Design 20 Monitoring Guide
Vmware Validated Design 20 Monitoring Guide
EN-002204-00
VMware Validated Design Monitoring and Alerting Guide
You can find the most up-to-date technical documentation on the VMware Web site at:
http://www.vmware.com/support/
The VMware Web site also provides the latest product updates.
If you have comments about this documentation, submit your feedback to:
[email protected]
© 2016 VMware, Inc. All rights reserved. This product is protected by U.S. and international copyright
and intellectual property laws. This product is covered by one or more patents listed at
http://www.vmware.com/download/patents.html.
VMware is a registered trademark or trademark of VMware, Inc. in the United States and/or other
jurisdictions. All other marks and names mentioned herein may be trademarks of their respective
companies.
VMware, Inc.
3401 Hillview Avenue
Palo Alto, CA 94304
www.vmware.com
Page 2 of 80
VMware Validated Design Monitoring Guide
Contents
Page 3 of 80
VMware Validated Design Monitoring Guide
List of Tables
Table 1. Delivery Properties of vRealize Operations Manager Notifications ........................................ 69
Table 2. VM and Host Notifications in SDDC ........................................................................................ 69
Table 3. Networking Notifications in SDDC .......................................................................................... 71
Table 4. Storage Notifications in SDDC ................................................................................................ 72
Table 5. Notifications of vRealize Operations Manager Issues ............................................................ 73
Page 4 of 80
VMware Validated Design Monitoring Guide
Note The VMware Validated Design Monitoring and Alerting Guide is compliant and validated with
certain product versions. See Introducing VMware Validated Design for more information
about supported product versions.
VMware Validated Design Monitoring and Alerting Guide is intended for cloud architects,
infrastructure administrators, cloud administrators and cloud operators who are familiar with and want
to use VMware software to deploy in a short time and manage an SDDC that meets the requirements
for capacity, scalability, backup and restore, and extensibility for disaster recovery support.
Page 5 of 80
VMware Validated Design Monitoring Guide
Region A https://vrli-cluster-01.sfo01.rainpole.local
Setting Value
Password vrli_admin_password
a. In the vRealize Log Insight UI, click the configuration drop-down menu icon and select
Content Packs.
b. Under Installed Content Packs, select the pack.
c. Click Alerts or Queries to view the full list of alerts for the product.
Page 6 of 80
VMware Validated Design Monitoring Guide
*** CRITICAL *** Hardware: The purpose of this widget is to notify Critical
Physical event detected when the following physical hardware
events have been detected, which
indicates a hardware problem. Under
most normal conditions, this widget
should return no results. The
following types of hardware events
are returned:
• Advanced Programmable
Interrupt Controller (APIC)
• Machine Check Exception
(MCE)
• Non-Maskable Interrupt (NMI)
*** CRITICAL *** ESX/ESXi: A core dump has been detected, Critical
Core dump detected which indicates the failure of a
component in ESX/ESXi. This issue
may lead to VM crashes and/or host
PSODs.
Page 7 of 80
VMware Validated Design Monitoring Guide
*** CRITICAL *** ESX/ESXi: The purpose of this alert is to notify Critical
Stopped logging when an ESXi host has stopped
sending syslog to a remote server.
*** CRITICAL *** ESX/ESXi: A root file system has reached its Critical
RAM disk / inode table is full resource pool limit. Various
administrative actions depend on the
ability to write files to various parts of
the root file system and might fail if
the RAM disk and/or inode table is
full.
Procedure
1. Log in to the vRealize Log Insight user interface
a. Open a Web browser and go to the following URL.
Region A https://vrli-cluster-01.sfo01.rainpole.local
Setting Value
Password vrli_admin_password
3. Click the icon and select Manage Alerts. You see all available alerts.
4. Select the alerts that are related to vSphere.
Page 8 of 80
VMware Validated Design Monitoring Guide
a. In the search box of the Alerts dialog box, enter Hardware, ESX/ESXi or vCenter
Server as a search phrase, and select the alerts from the results.
b. Repeat the step until you select all the alerts related to vSphere.
Alert Name
Page 9 of 80
VMware Validated Design Monitoring Guide
Setting Value
Criticality critical
6. In the Alerts dialog box, set the Raise an alert option for each enabled alert.
a. Click the Edit button on the first enabled vSphere alert
Page 10 of 80
VMware Validated Design Monitoring Guide
b. In the Edit Alert dialog box, under Raise an alert, select On any match, and click Save.
Page 11 of 80
VMware Validated Design Monitoring Guide
Page 12 of 80
VMware Validated Design Monitoring Guide
*** CRITICAL *** Storage: VSAN A Virtual SAN storage device Critical
device offline that backs up the datastores
might fail.
This occurs due to a faulty
device firmware, physical media,
or storage controller or when
certain storage devices are not
readable or writeable.
Typically, such failures are
irreversible. In some instances,
permanent data loss might also
occur, especially when data is
not replicated on other nodes
before failure. Virtual SAN
automatically recovers data
when new devices are added to
the storage cluster, unless data
lost is permanent.
Storage: NFS lock file issue The purpose of this alert is to Critical
notify when an NFS lock file
issue has been detected. Stale
NFS lock files can prevent VMs
from powering on.
Page 13 of 80
VMware Validated Design Monitoring Guide
Page 14 of 80
VMware Validated Design Monitoring Guide
Procedure
Open the vRealize Log Insight user interface.
a. Open a Web browser and go to the following URL.
Region A https://vrli-cluster-01.sfo01.rainpole.local
Setting Value
Password vrli_admin_password
Alert Name
Page 15 of 80
VMware Validated Design Monitoring Guide
Setting Value
Criticality critical
Page 16 of 80
VMware Validated Design Monitoring Guide
In the Alerts dialog box, set the Raise an alert option for each enabled alert.
a. Click Edit button on the first enabled Storage Resources alert.
Page 17 of 80
VMware Validated Design Monitoring Guide
b. In the Edit Alert dialog box, under Raise an alert, select On any match, and click Save.
Page 18 of 80
VMware Validated Design Monitoring Guide
VSAN - SSD health change to This alert will fire when the state of Critical
unhealthy state any SSD changes to unhealthy. The
reason could be either because of
permanent disk failure, disk
decommissioning, node shutdown,
etc.
Page 19 of 80
VMware Validated Design Monitoring Guide
Procedure
1. Open the vRealize Log Insight user interface.
a. In a Web browser, go to the following URL.
Region A https://vrli-cluster-01.sfo01.rainpole.local
Setting Value
Password vrli_admin_password
Page 20 of 80
VMware Validated Design Monitoring Guide
Alert Name
Setting Value
Criticality critical
Page 21 of 80
VMware Validated Design Monitoring Guide
6. In the Alerts dialog box, set the Raise an alert option for each enabled alert.
a. Click Edit button on the first enabled Storage Resources alert.
Page 22 of 80
VMware Validated Design Monitoring Guide
b. In the Edit Alert dialog box, under Raise an alert, select On any match, and click Save.
Page 23 of 80
VMware Validated Design Monitoring Guide
Page 24 of 80
VMware Validated Design Monitoring Guide
Network: ESXi physical NIC ESXi has reported that a physical Critical
down NIC has become unavailable.
Assuming other NICs are still online
this indicates a lack of redundancy
and a potential performance impact.
If all physical NICs for a
vSwitch/dvSwitch are unavailable
then communication problems to
VMs and/or the ESXi host may be
possible.
Procedure
1. Open the vRealize Log Insight user interface.
a. In a Web browser, go to the following URL.
Region A https://vrli-cluster-01.sfo01.rainpole.local
Setting Value
Password vrli_admin_password
Page 25 of 80
VMware Validated Design Monitoring Guide
Alert Name
Setting Value
Criticality critical
Page 26 of 80
VMware Validated Design Monitoring Guide
6. In the Alerts dialog box, set the Raise an alert option for each enabled alert.
a. Click the Edit button on the first enabled Network alert.
Page 27 of 80
VMware Validated Design Monitoring Guide
b. In the Edit Alert dialog box, under Raise an alert, select On any match, and click Save.
VMW_NSX_Manager - Host This event will be generated when NSX Manager fails to Critical
Communication Errors receive heartbeat from UserWorld Agent on the host within
the threshold period. The output is grouped by host-id. The
host-id can be found from vCenter.
Page 28 of 80
VMware Validated Design Monitoring Guide
Procedure
Open the vRealize Log Insight user interface.
a. In a Web browser, go to the following URL.
Region A https://vrli-cluster-01.sfo01.rainpole.local
Setting Value
Password vrli_admin_password
Alert Name
Page 29 of 80
VMware Validated Design Monitoring Guide
Alert Name
Setting Value
Criticality critical
Page 30 of 80
VMware Validated Design Monitoring Guide
In the Alerts dialog box, set the Raise an alert option for each enabled alert.
a. Click the Edit button on the first enabled vSphere alert.
Page 31 of 80
VMware Validated Design Monitoring Guide
b. In the Edit Alert dialog box, under Raise an alert, select On any match, and click Save.
Page 32 of 80
VMware Validated Design Monitoring Guide
Page 33 of 80
VMware Validated Design Monitoring Guide
vRops: VC The purpose of this alert is to notify when vROPs instance is not able to Critical
stats query get the data back from vCenter instance
timed out within the 5 minute interval and the metrics back up and get dropped with
occurred the error: Communication Error:
com.integrien.adapter.vmware.VcCollector.collectMetrics - Vc stats query
timed out (ms): 300377. This is usually
due to intermittent connection issues with the vCenter and hosts or down
to the network not able to handle the request and timing out.
vRops: Out This alert gets generated when OutOfMemoryError: Java heap space Critical
of Memory occurs.
errors This could indicate memory issues and could lead to degradation in
occurred performance.
Procedure
Open the vRealize Log Insight user interface.
a. Open a Web browser and go to the following URL.
Region A https://vrli-cluster-01.sfo01.rainpole.local
Setting Value
Password vrli_admin_password
Alert Name
Page 34 of 80
VMware Validated Design Monitoring Guide
Setting Value
Criticality critical
Page 35 of 80
VMware Validated Design Monitoring Guide
In the Alerts dialog box, set the Raise an alert option for each enabled alert.
a. Click the Edit button on the first enabled Network alert.
Page 36 of 80
VMware Validated Design Monitoring Guide
b. In the Edit Alert dialog box, under Raise an alert, select On any match, and click Save.
Page 37 of 80
VMware Validated Design Monitoring Guide
*** CRITICAL *** vRA A vRA CAFE service has become unavailable. This may happen Critical
CAFE service because:
unavailable!
A service has failed - if the service does not automatically
restart this may impact vRA's ability to function
A service is blocked and cannot response at the moment -
this may indicate increased load within the environment
vRA is starting and certain dependencies of the component are
not available yet - this issue should clear automatically as all
services come online.
*** CRITICAL *** vRA A vRA service has become unavailable. This may happen Critical
IaaS Services Stopped because:
A service has failed - if the service does not automatically
restart this may impact vRA's ability to function
A service is blocked and cannot response at the moment -
this may indicate increased load within the environment
vRA is starting and certain dependencies of the component
are not available yet - this issue should clear automatically
as all services come online.
*** CRITICAL *** vRA A vRA IaaS service has become unavailable. This may Critical
disk is full happen because:
A service has failed - if the service does not automatically
restart this may impact vRA's ability to function
A service is blocked and cannot response at the moment -
this may indicate increased load within the environment
Management Agent- in an HA deployment, only ONE
Management Agent instance should be running.
If more than one is running, this will cause issues with normal
functioning of the system.
Procedure
1. Open the vRealize Log Insight user interface.
a. Open a Web browser and go to the following URL.
Region A https://vrli-cluster-01.sfo01.rainpole.local
Setting Value
Page 38 of 80
VMware Validated Design Monitoring Guide
Password vrli_admin_password
Alert Name
Page 39 of 80
VMware Validated Design Monitoring Guide
Setting Value
Criticality critical
Page 40 of 80
VMware Validated Design Monitoring Guide
6. In the Alerts dialog box, set the Raise an alert option for each enabled alert.
a. Click the Edit button on the first enabled Network alert.
b. In the Edit Alert dialog box, under Raise an alert, select On any match, and click Save.
Page 41 of 80
VMware Validated Design Monitoring Guide
Page 42 of 80
VMware Validated Design Monitoring Guide
vRO: Orchestrator The Orchestrator server state switched to STANDBY mode. In Critical
STANDBY Alert general there could be two reasons:
There are enough RUNNING nodes in the cluster and
current node will stay on standby
playing a role of back-up node waiting to switch to
RUNNING state if needed.
Problems with critical components as database or
authentication provider has been detected
which prevents the normal functioning of the server node.
The current node is considered unhealthy.
The server will monitor that critical components and try to
recover as soon as the problems are solved.
The current work won't accept any new requests and all its
workflows will be resumed on other healthy nodes.
vRO: Invalid Login Alert Failed login attempt has been detected. The reason could be Critical
wrong credentials
used to login to the server or there could be malicious attempt
to access the server.
vRO: Orchestrator Reboot Orchestrator server has been started or rebooted. The cause Critical
Alert could be planned reboot or result of unwanted action.
vRO: Workflow The content of some workflow has been modified. This could Critical
Modification Alert be a planned
workflow content update or result of unwanted malicious
actions.
vRO: Orchestrator Orchestrator worklow run failures have been detected. Critical
Workflow Failure Alert This could be due to infrastructure problems with external
systems.
Procedure
1. Open the vRealize Log Insight user interface.
a. Open a Web browser and go to the following URL.
Region A https://vrli-cluster-01.sfo01.rainpole.local
Setting Value
Page 43 of 80
VMware Validated Design Monitoring Guide
Password vrli_admin_password
a. In the search box of the Alerts dialog box, enter vro as a search phrase, and select the
alerts from the results.
b. Repeat the step until you select all the alerts related to vRealize Orchestrator.
Alert Name
Page 44 of 80
VMware Validated Design Monitoring Guide
Setting Value
Criticality critical
Page 45 of 80
VMware Validated Design Monitoring Guide
In the Alerts dialog box, set the Raise an alert option for each enabled alert.
Page 46 of 80
VMware Validated Design Monitoring Guide
Page 47 of 80
VMware Validated Design Monitoring Guide
In the Edit Alert dialog box, under Raise an alert, select On any match, and click Save.
Page 48 of 80
VMware Validated Design Monitoring Guide
Setting Value
Password vrops_admin_password
On the Home page, from the Actions menu select Create Dashboard.
In the Dashboard Configuration section of the New Dashboard dialog box, configure the
following settings.
Is default No
Page 49 of 80
VMware Validated Design Monitoring Guide
b. Configure the following settings at the top of the Edit Scoreboard dialog box.
Page 50 of 80
VMware Validated Design Monitoring Guide
Refresh Content On
Self Provider On
Show Sparkline On
c. Click the Object Types tab, and add the metrics that this widget displays.
d. In the metrics pane on the right, enter Capacity Usage in the search box, and press Enter.
e. In the search result, expand the CPU metric category and double-click the Capacity Usage
(%) metric to add it to the list of metrics that the widget displays.
f. In the metrics list, double-click each of the following collection options, customize it and click
Update.
Yellow Bound 60
Orange Bound 75
Red Bound 90
Page 51 of 80
VMware Validated Design Monitoring Guide
Configure the second Scoreboard widget to show metrics for memory usage.
a. In the upper-right corner of the second Scoreboard widget, click the Edit icon.
b. Configure the following settings at the top of the Edit Scoreboard dialog box.
Refresh Content On
Self Provider On
Show Sparkline On
c. Click the Object Types tab, and add the metrics that this widget displays.
d. In the metrics pane on the right, enter Us ag e / Us able in the search box, and press
Enter.
Page 52 of 80
VMware Validated Design Monitoring Guide
e. In the search result, expand the Memory metric category and double-click the Usage /
Usable (%) metric to add it to the list of metrics that the widget displays.
f. In the metrics list, double-click each of the following collection options, customize it and click
Update.
Yellow Bound 70
Orange Bound 80
Red Bound 90
Page 53 of 80
VMware Validated Design Monitoring Guide
Configure the third Scoreboard widget to show metrics for storage usage.
a. In the upper-right corner of the third Scoreboard widget, click the Edit icon.
b. Configure the following settings at the top of the Edit Scoreboard dialog box.
Refresh Content On
Self Provider On
Show Sparkline On
c. Click the Object Types tab, and add the metrics that this widget displays.
d. In the metrics pane on the right, enter Used Space in the search box, and press Enter.
Page 54 of 80
VMware Validated Design Monitoring Guide
e. In the search result, expand the Capacity metric category and double-click the Used Space
(%) metric to add it to the list of metrics that the widget displays.
f. Enter Number of VMs in the search box and press Enter.
g. In the search result, expand the Summary metric category and double-click the Total
Number of VMs metric to add the number of VMs in the datastore to the list of metrics in the
widget.
h. In the metrics list at the bottom, double-click each metric row, customize the following
collection attribute for the metric and click Update.
Yellow Bound 60
Orange Bound 70
Red Bound 80
Page 55 of 80
VMware Validated Design Monitoring Guide
vRealize Operations Manager builds an application to determine how your environment is affected
when one or more components in an application experiences problems. You can also monitor the
overall health and performance of the application.
vRealize Operations Manager collects data from the components in the application and displays the
results in a summary dashboard for each application with a real-time analysis for any or all of the
components.
Page 56 of 80
VMware Validated Design Monitoring Guide
Because the Management Pack for vRealize Log Insight does not collect monitoring data about the
virtual machines of the vRealize Log Insight deployment, you create an application to watch their
state.
Procedure
In a Web browser, open the main page of vRealize Operations Manager.
If you use the public interface to the SDDC, go to https://vrops-cluster-01.rainpole.local
Use the adm in user name and the vro ps_ adm in _pas swo rd password to log in.
In the left pane of vRealize Operations Manager, click Environment menu and click
Applications.
On the Applications tab page, click Add icon to add an application.
In the Add Application dialog box, select Custom and click OK.
The Application Management dialog box appears where you select the objects for the
application.
In the Application Management dialog box, in the Name text box enter v R ea liz e L og
Ins ight .
In the Tiers pane, click Add Tier, enter Log In si gh t VM s as Tier Name and click Update.
In the objects list underneath, enter vrli in the search box, and press Enter.
Select the virtual machine objects of vRealize Log Insight and drag them to the Tier Objects
pane.
VMs
vrli-mstr-01.sfo01
vrli-wrkr-01.sfo01
vrli-wrkr-02.sfo01
Page 57 of 80
VMware Validated Design Monitoring Guide
Click Save.
Page 58 of 80
VMware Validated Design Monitoring Guide
5. In the Application Management dialog box, in the Application text box enter v R e al iz e
O r ch est r at o r .
6. In the Tiers pane, click Add Tier, enter O r ch e str at o r VM s as Tier Name and click Update.
7. In the objects list underneath, enter vravro in the search box and press Enter.
8. Select the virtual machine objects of vRealize Orchestrator and drag them to the Tier
Objects pane
VMs
vra01vro01a
vra01vro01b
9. Click Save.
Page 59 of 80
VMware Validated Design Monitoring Guide
Page 60 of 80
VMware Validated Design Monitoring Guide
Is default Yes
Refresh On On On
Content
Group by vCenter Adapter > vCenter Adapter > Datacenter vCenter Adapter >
Datacenter Datacenter
Then by - - -
Object type vCenter Adapter > Host vCenter Adapter > Host System vCenter Adapter >
System Datastore
Attribute type CPU > Capacity Network I/O > Usage Rate > Datastore I/O -> Read
Remaining (%) Capacity Remaining (%) Latency (ms)
Min Value 0 0 0
Max Value 25 20 30
Page 61 of 80
VMware Validated Design Monitoring Guide
Refresh On On On
Content
Group by vCenter Adapter > vCenter Adapter > Datacenter vCenter Adapter >
Datacenter Datacenter
Then by - - -
Object type vCenter Adapter > Host vCenter Adapter > Host System vCenter Adapter >
System Datastore
Attribute type Memory > Capacity Network I/O > Packets Dropped Datastore I/O > Write
Remaining (%) Latency (ms)
Min Value 0 0 0
Max Value 25 1 30
Refresh On On On
Content
Group by vCenter Adapter > vCenter Adapter > vCenter Adapter >
Datacenter Datacenter Datacenter
Then by - - -
Page 62 of 80
VMware Validated Design Monitoring Guide
Object type vCenter Adapter > Virtual vCenter Adapter > Virtual vCenter Adapter > Virtual
Machine Machine Machine
Attribute type CPU > Usage (%) Memory > Usage (%) Virtual Disk > Total
Latency
Min Value 80 50 0
Page 63 of 80
VMware Validated Design Monitoring Guide
Refresh On On On
Content
Group by vCenter Adapter > vCenter Adapter > vCenter Adapter >
Datacenter Datacenter Datacenter
Then by - - -
Object type vCenter Adapter > Virtual vCenter Adapter > Virtual vCenter Adapter > Virtual
Machine Machine Machine
Attribute type CPU > CPU Contention Memory > Swapped (KB) Disk Space > Capacity
(%) Remaining (%)
Min Value 0 0 5
Max Value 2 1 20
Refresh Content On
Self Provider On
Mode Children
Pagination number 15
Page 64 of 80
VMware Validated Design Monitoring Guide
Metric Health
c. From the objects list at the bottom, expand Function and select the SDDC Management
custom group.
In the New Dashboard dialog box, click Save.
The SDDC Overview dashboard becomes available on the Home page of the vRealize Operations
Manager user interface.
Page 65 of 80
VMware Validated Design Monitoring Guide
Recipients [email protected]
Filtering Criteria
Page 66 of 80
VMware Validated Design Monitoring Guide
Page 67 of 80
VMware Validated Design Monitoring Guide
Repeat the steps to create the notifications that are defined in List of Notifications for vRealize
Operations Manager.
Page 68 of 80
VMware Validated Design Monitoring Guide
You define notifications from the Content > Notifications page in vRealize Operations user interface.
See Create Notifications in vRealize Operations Manager.
Recipients [email protected]
Virtual machine has disk I/O Object Type Alert Virtual machine has disk I/O
latency problem caused by Definition latency problem caused by
vCenter
snapshots snapshots
Adapter >
Virtual Machine
Page 69 of 80
VMware Validated Design Monitoring Guide
Virtual Machine is running out Object Type Alert Virtual Machine is running out
of disk space Definition of disk space
vCenter
Adapter >
Virtual Machine
Virtual machine has large Object Type Alert Virtual machine has large disk
disk snapshots Definition snapshots
vCenter
Adapter >
Virtual Machine
Not enough resources for Object Type Alert Not enough resources for
vSphere HA to start the Definition vSphere HA to start the virtual
vCenter
virtual machine machine
Adapter >
Virtual Machine
Host has CPU contention Object Type Alert Host has CPU contention
caused by overpopulation of Definition caused by overpopulation of
vCenter
virtual machines virtual machines
Adapter > Host
System
Host has memory contention Object Type Alert Host has memory contention
caused by overpopulation of Definition caused by overpopulation of
vCenter
virtual machines virtual machines
Adapter > Host
System
vSphere DRS enabled cluster Object Type Alert DRS-enabled cluster has CPU
has CPU contention caused Definition contention caused by
vCenter
by overpopulation of virtual overpopulation of virtual
Adapter >
machines machines
Cluster
Compute
Resource
vSphere DRS enabled cluster Object Type Alert DRS-enabled cluster has
has unexpected high CPU Definition unexpected high CPU workload
vCenter
workload
Adapter >
Cluster
Compute
Resource
Page 70 of 80
VMware Validated Design Monitoring Guide
vSphere DRS enabled cluster Object Type Alert DRS-enabled cluster has
has memory contention Definition memory contention caused by
vCenter
caused by overpopulation of overpopulation of virtual
Adapter >
virtual machines machines
Cluster
Compute
Resource
vSphere DRS enabled cluster Object Type Alert DRS-enabled cluster has
has unexpected high memory Definition unexpected high memory
vCenter
workload and contention workload and contention
Adapter >
Cluster
Compute
Resource
Object Type
Distributed switch Distributed Switch
configuration is out of vCenter Adapter > Alert Definition configuration is out of
sync vSphere Distributed sync
Switch
Object Type
NSX Manager resource Manager resource usage
NSX-vSphere Adapter > Alert Definition
usage is high is high
NSX-vSphere Manager
Object Type
NSX Manager API calls Manager API calls are
NSX-vSphere Adapter > Alert Definition
are failing failing
NSX-vSphere Manager
Page 71 of 80
VMware Validated Design Monitoring Guide
Object Type
VXLAN segment range VXLAN segment range
NSX-vSphere Adapter > Alert Definition
has been exhausted has been exhausted
NSX-vSphere Manager
Object Type
Less than three NSX NSX-vSphere Adapter - Less than three
Alert Definition
Controllers are active > NSX-vSphere controllers are active
Controller Cluster
Object Type
Edge resource usage is Edge resource usage is
NSX-vSphere Adapter > Alert Definition
high high
NSX-vSphere Edge
Object Type
The Edge is not highly The Edge is not highly
NSX-vSphere Adapter > Alert Definition
available available
NSX-vSphere Edge
Datastore is running out Object Type Alert Definition Datastore is running out
of disk space of disk space
vCenter Adapter >
Datastore
Page 72 of 80
VMware Validated Design Monitoring Guide
Object Type
One or more vRealize One or more vRealize
vRealize Operations Alert
Operations services are Operations services are
Adapter > vRealize Definition
down down
Operations Node
Object Type
Disk space on a vRealize
vRealize Operations Alert
Operations Manager node is Disk space on node is low
Adapter > vRealize Definition
low
Operations Node
Object Type
Node processing queue is vRealize Operations Alert Node processing queue is
backing up Adapter -> vRealize Definition backing up
Operations Node
Object Type
vRealize Operations Alert
FSDB corrupted files Fsdb corrupted files
Adapter -> vRealize Definition
Operations Fsdb
Object Type
FSDB failed to repair vRealize Operations Alert Fsdb failed to repair
corrupted files Adapter -> vRealize Definition corrupted files
Operations Fsdb
Object Type
vRealize Operations Alert
FSDB overload Fsdb high load
Adapter -> vRealize Definition
Operations Fsdb
Object Type
Remote Collector one or vRealize Operations One or more vRealize
Alert
more vRealize Operations Adapter > vRealize Operations services are
Definition
services are down Operations Remote down
Collector
Page 73 of 80
VMware Validated Design Monitoring Guide
Object Type
Remote Collector not vRealize Operations Remote Collector not
Alert
reporting correct number of Adapter -> vRealize reporting correct number of
Definition
services Operations Remote services
Collector
Object Type
vRealize Operations Cluster vRealize Operations Cluster
vRealize Operations Alert
processes might be out of processes may not have
Adapter -> vRealize Definition
memory enough memory
Operations Cluster
Page 74 of 80
VMware Validated Design Monitoring Guide
Region A https://mgmt01vc01.sfo01.rainpole.local/vsphere-client
Region A mgmt01vdp01
On the Configuration tab, click the Email button and click Edit.
Configure the following settings for email notification and click Save.
Page 75 of 80
VMware Validated Design Monitoring Guide
To address(es) [email protected]
Click the Send test email hyperlink and verify that you receive the test email.
Page 76 of 80
VMware Validated Design Monitoring Guide
Page 77 of 80
VMware Validated Design Monitoring Guide
On the Home page of the vRealize Automation management console, click the Administration
tab and click Notifications.
Configure the scenarios to receive notifications about. By default all scenarios are active.
a. On the Notifications page, select Scenarios in the navigator.
Page 78 of 80
VMware Validated Design Monitoring Guide
b. If you do not want to be alerted on a scenario, select it and click the Suspend button.
c. Verify that each of the scenarios you want receive notifications about is Active.
b. Under Notifications, select English (United States) from the Language drop-down menu.
c. Select Enabled next to the Email protocol, click Apply and click Close.
Page 79 of 80
VMware Validated Design Monitoring Guide
Page 80 of 80