DP4400 Monitoring - Troubleshooting - v1.0
DP4400 Monitoring - Troubleshooting - v1.0
Sep 2018
• Along with hardware events, events for vCenter, ESXi and IDPA VMs will be seen in Health tab of ACM UI
• Only “critical” and “fatal” events will be sent to ESRS if ESRS is configured
• vCenter events window could be changed if required, by changing the parameters in below file on ACM
vi /usr/local/dataprotection/customscripts/HealthmonitorConfig.properties
shm.max_no_of_event=30
shm.max_time_in_min=30
shm.days_to_delete_event=30
shm.snmp_validator_period_in_mins=240
shm.snmp_validator_delay_in_mins=1
shm.vcenter_events_period_in_mins=20
shm.vcenter_events_delay_in_mins=0
• 7a. DP4400-Health-Monitoring.mp4
• Possible causes
• IP is already in use
• Wrong gateway or subnet mask provided
• Troubleshooting
• Look for diagnostic report on ACM UI to figure out cause of error
• If diagnostic report is not clear then, look for “Failed” keyword in
“/usr/local/dataprotection/var/configmgr/server_data/logs/server.log” file to trace down the step which failed.
• Click on retry to redo the configuration post correcting the issue
• 7b. DP4400-Initial-network-configuration-failure
• Possible causes
• Wrong IP input provided when IP range not used
• Wrong gateway or subnet mask provided
• License is incorrect
• Troubleshooting
• Look for diagnostic report on ACM UI to figure out cause of error
• Search the logs for ‘Received notifyStatus from the task’ statements. This will lead to the log statements, where the
configuration progress is reported for various steps being executed by multiple tasks.
• You can see status of tasks/step in IN_PROGRESS/FAILED/COMPLETED state. If you find some FAILED task
log, look at the log statements of same thread, just before this statement, to see the exact failure. Thread id in below
logs is pool-2-thread-2.
• If DDVE license failed to apply, check whether locking ID mentioned in the license matches the
appliance serial number.
• If it’s non-critical component (DPA or DPS or CDRA ), then you can retry deployment from ACM
Dashboard
• If its critical component (DDVE, AVE or DPC), retry the configuration post correcting the issue
• 7c. DP4400-DPS-failure.mp4
• Possible causes
• Most of the cases could be related to timing issue
• Troubleshooting
• Look for diagnostic report on ACM UI to figure out cause of error
• Integration tasks have messages as shown in below example of server.log file on ACM
• Start Log: Executing IntegrateCDRATask
• Completion Log: Execution of IntegrateCDRATask completed.
• Error Log: Exception occurred while executing IntegrateCDRATask.
• If you see a message “Health monitor processes are down“ on Health Monitoring UI
• Check services status on ACM
service dataprotection_database status
service rabbitmq-server status
• If the service status is not active, restart the respective service
/etc/init.d/DellPTAgent status
• If its not in running state, start the service
/etc/init.d/DellPTAgent start
• If Database entry does not exist then contact Install Base team
• If database entry exists and ESRS registration fails then check server.log on ACM to get more
information around error
• As a part of DP4400 Deployment and Configuration, a new user “idpauser” gets created on
vCenter as well as ESXi.
• This is limited privilege user with password set to common password provided by customer
• VCSA: idpauser permissions:
"VirtualMachine.Interact.ToolsInstall", "VirtualMachine.Interact.PowerOff", "VirtualMachine.Interact.PowerOn",
“VirtualMachine.Interact.Reset", "VirtualMachine.Interact.DeviceConnection", "VirtualMachine.Interact.Suspend",
"VirtualMachine.Interact.ConsoleInteract","VirtualMachine.Interact.AnswerQuestion", "Global.LogEvent",
"Global.Diagnostics","System.Read“
• ESXi: idpauser permission:
"System.View", "System.Read", "System.Anonymous", "Host.Config.Maintenance", "Host.Config.AutoStart",
"Host.Config.Network", "Host.Config.NetService", "VirtualMachine.Interact.PowerOn", "VirtualMachine.Interact.PowerOff",
"VirtualMachine.Provisioning.ReadCustSpecs", "VirtualMachine.Interact.Reset",
"VirtualMachine.State.RemoveSnapshot", "VirtualMachine.Interact.ConsoleInteract", "VApp.PowerOn", "VApp.PowerOff",
"Alarm.Acknowledge", "Network.Assign", "Network.Config", "Network.Delete", "Network.Move"
• Root user password for both vCenter and ESXi is randomly generated
• ACM saves VCSA and ESXi root user passwords in encrypted format.
• VCSA root password is stored in file
/usr/local/dataprotection/var/configmgr/server_data/config/componentCredentials.xml
<vCenterPassword>CIPHER_TEXT</vCenterPassword>
• Support can use decryption utility and run it on ACM to decrypt the password if required for
troubleshooting