100% found this document useful (1 vote)

154 views7 pages

IT Alert Operations: Standard Operating Procedure

This standard operating procedure document describes the alert operations for an RDSProxy back-end and other elements. It provides descriptions, dashboard links, alert definitions, symptoms and recovery processes. For the RDSProxy back-end, the alert is triggered when 3 health checks fail, notifying MySQL administrators. If unacknowledged for 30 minutes, senior DBAs are escalated. The recovery process involves manually checking and re-enabling the backend host in HAProxy.

Uploaded by

Abdul Kareem Kalmata

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

154 views7 pages

IT Alert Operations: Standard Operating Procedure

Uploaded by

Abdul Kareem Kalmata

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

IT Alert Operations

STANDARD OPERATING PROCEDURE

Author
[COMPANY NAME] | [COMPANY ADDRESS]
9/24/2019 1:40:00 AM
T ABLE OF C ONTENTS
RDSProxy Back-end ........................................................................................................................................................1
Description.................................................................................................................................................................1
Dashboard Links.....................................................................................................................................................1
Alert Definition ..........................................................................................................................................................1
State: Down ...........................................................................................................................................................1
Symptoms ..................................................................................................................................................................1
Recovery Process .......................................................................................................................................................1
[Element] .......................................................................................................................................................................2
Description.................................................................................................................................................................2
Dashboard Links.....................................................................................................................................................2
Alert Definition ..........................................................................................................................................................2
State [Warning/Critical/Down/Unreachable] ........................................................................................................2
Symptoms ..................................................................................................................................................................2
Recovery Process .......................................................................................................................................................2
Version Date Editor

1
RDSP ROXY B ACK - END

D ESCRIPTION

RDSProxy is just an instance running HAProxy in TCP-Proxy mode (wherein it binds a locally listening socket to a
remote socket on a “back-end” host and steps away, allowing the native transmission to occur on the wire).
HAProxy monitors the “Back-end” RDS instances by making a MySQL client connection to them using the
haproxy_check user.

D ASHBOARD L INKS
RDSProxy Dashboard
MySQL Dashboard

A LERT D EFINITION

S TATE : D OWN

T RIGGER
In the event that 3 sequential health checks fail for a given back-end RDS instance, HAProxy marks that system
"down" and sends no additional traffic to it. Once the back-end server is marked down. HAPROXY will not attempt
to re-enable it. You must do this manually.

N OTIFICATION
Team: MySQL Administrators

E SCALATION
If two or more systems alert with this message, escalate immediately to Senior DBAs.
If not acknowledged/resolved within 30 minutes, escalate notification to Senior DBAs

R ESET C ONDITION
Health check reports status as “up”

S YMPTOMS

Remote calls to the instance in question may result in slow returns of results. Multiple RDSProxy failures will affect
performance.

R ECOVERY P ROCESS

Manually check HAProxy health check:

mysql --user=haproxy_check --host=ha_host

Manually re-enable the Back-end host

hactl enable server ha_host/ha_service health up

hactl set server ha_host/ha_service health up

Follow the haproxy.log file to validate that the back-end host has been re-established.

tail -f /var/log/haproxy.log

1
[E LEMENT ]

D ESCRIPTION

Description of the element affected and what it entails – include plain language description of the element and its
role in the organization.

D ASHBOARD L INKS
Links to a dashboard for monitoring the service

A LERT D EFINITION

S TATE [W ARNING /C RITICAL /D OWN /U NREACHABLE ]

T RIGGER
Trigger conditions for the above state. Repeat the “State” header with additional statuses if this element can
trigger more than one state.

N OTIFICATION
Who will get the notifications and which transport will be used.

E SCALATION
If there are any escalation paths, define here.

R ESET C ONDITION
How do we know that this is resolved?

S YMPTOMS

What are the symptoms seen by IT, end-users, external services, etc.?

R ECOVERY P ROCESS

How do you recover from this alert? If things are done automatically via the NMS, define them here.

Service Delivery Model Templates - 2467 - 1359 - 42!40!30
100% (1)
Service Delivery Model Templates - 2467 - 1359 - 42!40!30
15 pages
PYZ IT Strategy
100% (1)
PYZ IT Strategy
56 pages
Operational Runbook Template
0% (1)
Operational Runbook Template
5 pages
Standard Service Level Agreement (SLA) 20180701
No ratings yet
Standard Service Level Agreement (SLA) 20180701
5 pages
Cloud Tech Professional 2022
No ratings yet
Cloud Tech Professional 2022
212 pages
SearchDisasterRecovery Network Disaster Recovery Plan
No ratings yet
SearchDisasterRecovery Network Disaster Recovery Plan
26 pages
Demand Management & Scrum Cycle
0% (1)
Demand Management & Scrum Cycle
33 pages
Most Common ITIL Interview Questions and Answers For ITIL Jobs in America Companies
No ratings yet
Most Common ITIL Interview Questions and Answers For ITIL Jobs in America Companies
4 pages
The Death of Managed Services
From Everand
The Death of Managed Services
Ernie Zibert
4.5/5 (3)
CIM Ebook CIO Onboarding
No ratings yet
CIM Ebook CIO Onboarding
16 pages
The Vulnerability Life Cycle
100% (1)
The Vulnerability Life Cycle
2 pages
Service Desk Strategy 2011
No ratings yet
Service Desk Strategy 2011
6 pages
Deployment Checklist: Project Name
No ratings yet
Deployment Checklist: Project Name
7 pages
ITIL Role Descriptions
No ratings yet
ITIL Role Descriptions
32 pages
8 Release MGT
No ratings yet
8 Release MGT
13 pages
LAPS TechnicalSpecification
No ratings yet
LAPS TechnicalSpecification
20 pages
Ict Support Services Sla
No ratings yet
Ict Support Services Sla
8 pages
Cloud Computing in Banking
100% (2)
Cloud Computing in Banking
12 pages
05 Cobit Assessor Sample Paper AX01 v1.1
No ratings yet
05 Cobit Assessor Sample Paper AX01 v1.1
50 pages
CRM Presentation
No ratings yet
CRM Presentation
20 pages
Valerie Arraj, Managing Director, Compliance Process Partners, LLC
No ratings yet
Valerie Arraj, Managing Director, Compliance Process Partners, LLC
5 pages
ITSM Approach For Cloud
No ratings yet
ITSM Approach For Cloud
28 pages
Configuration Management System Best Practices: Solution Brief
No ratings yet
Configuration Management System Best Practices: Solution Brief
4 pages
CRM Sales Discovery Questionnaire
No ratings yet
CRM Sales Discovery Questionnaire
4 pages
FSM IT Computer Deployment
No ratings yet
FSM IT Computer Deployment
1 page
Cloud Computing in Banking
100% (1)
Cloud Computing in Banking
4 pages
Change Management Process
No ratings yet
Change Management Process
24 pages
Solugenix The Right Way To Hire ServiceNow Staff
No ratings yet
Solugenix The Right Way To Hire ServiceNow Staff
17 pages
Introduction To The Issue Log Template: (Insert Project Name)
No ratings yet
Introduction To The Issue Log Template: (Insert Project Name)
5 pages
IT Risks in Banks
No ratings yet
IT Risks in Banks
24 pages
Slides Agile Impacting Change
100% (1)
Slides Agile Impacting Change
20 pages
It Security Audit Plan and Deliver Able Stem Plates
No ratings yet
It Security Audit Plan and Deliver Able Stem Plates
2 pages
Flex 3.0
No ratings yet
Flex 3.0
60 pages
Example Back Up Policy
100% (1)
Example Back Up Policy
11 pages
IVRS
No ratings yet
IVRS
9 pages
Change Management in IT
No ratings yet
Change Management in IT
18 pages
Interactive Reporting Capacity Planning Report - IR 11.1.1 Windows
No ratings yet
Interactive Reporting Capacity Planning Report - IR 11.1.1 Windows
31 pages
Delegation Framework For HCM V. 9.0: Peoplesoft
100% (1)
Delegation Framework For HCM V. 9.0: Peoplesoft
33 pages
Self-migration+Project+Plan+v1 0
No ratings yet
Self-migration+Project+Plan+v1 0
11 pages
IAM Project Management Processes
No ratings yet
IAM Project Management Processes
26 pages
Deployment Guide - Provance IT Asset Management Pack PDF
No ratings yet
Deployment Guide - Provance IT Asset Management Pack PDF
24 pages
Generally Accepted Recordkeeping Principles: Information Governance Maturity Model
No ratings yet
Generally Accepted Recordkeeping Principles: Information Governance Maturity Model
8 pages
Using Process Mining For ITIL Assessment A Case Study With Incident Management
No ratings yet
Using Process Mining For ITIL Assessment A Case Study With Incident Management
17 pages
IC Product Backlog Example 11532
No ratings yet
IC Product Backlog Example 11532
2 pages
Makana - IT Change MGT Policy
No ratings yet
Makana - IT Change MGT Policy
24 pages
Operations Manual FOR : Quick Reference Executive Sponsor Business Owner Technical Owner Manual Author
No ratings yet
Operations Manual FOR : Quick Reference Executive Sponsor Business Owner Technical Owner Manual Author
9 pages
ITIL Process Maturity
100% (1)
ITIL Process Maturity
8 pages
589502main - ITS-HBK-2810.09-02 (NASA Information Security Incident Management)
0% (1)
589502main - ITS-HBK-2810.09-02 (NASA Information Security Incident Management)
59 pages
Escalating A Problem With BMC Customer Support: 01 October 2015
No ratings yet
Escalating A Problem With BMC Customer Support: 01 October 2015
3 pages
CAIQ Lite
No ratings yet
CAIQ Lite
12 pages
An It Il Roadmap
No ratings yet
An It Il Roadmap
19 pages
Show Memory: Static (Inside, Outside) TCP Interface 2323 1.1.1.1 Telnet Netmask 255.255.255.255
No ratings yet
Show Memory: Static (Inside, Outside) TCP Interface 2323 1.1.1.1 Telnet Netmask 255.255.255.255
3 pages
ITIL - POCKET GUIDE - v3-2011 PDF
No ratings yet
ITIL - POCKET GUIDE - v3-2011 PDF
121 pages
ITIL - An Example Schedule of Change PDF
100% (1)
ITIL - An Example Schedule of Change PDF
1 page
Servicenow Governance, Risk, and Compliance
No ratings yet
Servicenow Governance, Risk, and Compliance
2 pages
Service catalogue management Complete Self-Assessment Guide
From Everand
Service catalogue management Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Process Configuring Method in Bpm Project
From Everand
Process Configuring Method in Bpm Project
Seyed Ahmad Daliri
No ratings yet
Continuity of Operations The Ultimate Step-By-Step Guide
From Everand
Continuity of Operations The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Network Access Control A Complete Guide
From Everand
Network Access Control A Complete Guide
Gerardus Blokdyk
No ratings yet
MCSA Windows Server 2016 Study Guide: Exam 70-741
From Everand
MCSA Windows Server 2016 Study Guide: Exam 70-741
William Panek
No ratings yet
Nature Is Beautiful Points
No ratings yet
Nature Is Beautiful Points
7 pages
Solaris
No ratings yet
Solaris
9 pages
ENIQ Storage Expansion
No ratings yet
ENIQ Storage Expansion
6 pages
EX200 S
No ratings yet
EX200 S
4 pages
AWS Certified Cloud Practitioner (CLF-C01) Exam Guide
No ratings yet
AWS Certified Cloud Practitioner (CLF-C01) Exam Guide
3 pages
Ports and Protocols - Jabber
No ratings yet
Ports and Protocols - Jabber
3 pages
Client/Server Paradigm: Mod 4 Process-To-Process Delivery
No ratings yet
Client/Server Paradigm: Mod 4 Process-To-Process Delivery
29 pages
HP Switching and Routing Technologies: Web-Based Training Course Companion
No ratings yet
HP Switching and Routing Technologies: Web-Based Training Course Companion
102 pages
3 15书源
No ratings yet
3 15书源
3,579 pages
Netopia (Covad) R7200 Router
No ratings yet
Netopia (Covad) R7200 Router
258 pages
19-Distance Vector Routing Protocol
No ratings yet
19-Distance Vector Routing Protocol
6 pages
MNS6K CLI User Guide 445
No ratings yet
MNS6K CLI User Guide 445
454 pages
Mitel RFP 12 System Guide
No ratings yet
Mitel RFP 12 System Guide
100 pages
Quectel GSM SSL TCP Application Note V3.1
No ratings yet
Quectel GSM SSL TCP Application Note V3.1
32 pages
Datasheet RMC-1000 v1.0
No ratings yet
Datasheet RMC-1000 v1.0
3 pages
802 Standards. IEEE 802.2, 802.3, 802.5, 802
No ratings yet
802 Standards. IEEE 802.2, 802.3, 802.5, 802
2 pages
Soal CCNA Chapter 1-4
No ratings yet
Soal CCNA Chapter 1-4
18 pages
Full List of Acronyms (SY0-701)
No ratings yet
Full List of Acronyms (SY0-701)
7 pages
One Voice Profile
No ratings yet
One Voice Profile
27 pages
Agente 16640303 Fecha 21-05-2023 Hora 02-21 Version 3-4-1-34 Estacion DESKTOP-0TIHU7H LodID
No ratings yet
Agente 16640303 Fecha 21-05-2023 Hora 02-21 Version 3-4-1-34 Estacion DESKTOP-0TIHU7H LodID
6 pages
5-RSA Stuff
No ratings yet
5-RSA Stuff
16 pages
InfoTap API Documentation For HTTP
No ratings yet
InfoTap API Documentation For HTTP
10 pages
Re-Ordering of Packets Using Retransmission Timer Abstract
100% (1)
Re-Ordering of Packets Using Retransmission Timer Abstract
5 pages
Configurations, Capacity and Performance of The Flexi BSC PDF
No ratings yet
Configurations, Capacity and Performance of The Flexi BSC PDF
4 pages
How To Access Files On Synology NAS Via FTP
No ratings yet
How To Access Files On Synology NAS Via FTP
13 pages
Cisco SPA901 1-Line IP Phone Cisco Small Business IP Phone
No ratings yet
Cisco SPA901 1-Line IP Phone Cisco Small Business IP Phone
6 pages
DS-2CD1641FWD-I (Z) 4.0 MP CMOS Vari-Focal Network Bullet Camera
No ratings yet
DS-2CD1641FWD-I (Z) 4.0 MP CMOS Vari-Focal Network Bullet Camera
3 pages
Computer Communication Networks
No ratings yet
Computer Communication Networks
36 pages
Cain - and - Abel Tutorial From Chiranjit
67% (3)
Cain - and - Abel Tutorial From Chiranjit
51 pages
22 Qos
No ratings yet
22 Qos
47 pages
Hosts
No ratings yet
Hosts
2 pages
SSL and Tls
No ratings yet
SSL and Tls
16 pages
HCIE-WLAN Lab Mock Exam
No ratings yet
HCIE-WLAN Lab Mock Exam
13 pages
Wi-Fi CERTIFIED™ Interoperability Certificate: Page 1 of 2 Certification ID: WFA83576
No ratings yet
Wi-Fi CERTIFIED™ Interoperability Certificate: Page 1 of 2 Certification ID: WFA83576
2 pages
TP-LINK TL-WR941HP - Manual
No ratings yet
TP-LINK TL-WR941HP - Manual
85 pages

IT Alert Operations: Standard Operating Procedure

Uploaded by

IT Alert Operations: Standard Operating Procedure

Uploaded by

IT Alert Operations

STANDARD OPERATING PROCEDURE

Manually check HAProxy health check:

mysql --user=haproxy_check --host=ha_host

Manually re-enable the Back-end host

hactl enable server ha_host/ha_service health up

S TATE [W ARNING /C RITICAL /D OWN /U NREACHABLE ]

You might also like