Visit Manager Workcard Viewer - AWS - Op Guide
Visit Manager Workcard Viewer - AWS - Op Guide
Consumer Code DL
ia l
DR Tier: Sev 2
nt
Recovery Priority: RP1.2
i d e
n f
C o
OpGuide v7.1 - 2023
Contents
Operational Guide ........................................................................................................................ 1
History ....................................................................................................................................... 4
System Information................................................................................................................... 6
ia l
System Description ................................................................................................................... 6
nt
Business Impact ..................................................................................................................... 7
d e
Technical Description .............................................................................................................. 7
i
f
User Profile ................................................................................................................................. 8
n
System Availability....................................................................................................................... 8
o
Infrastructure .............................................................................................................................. 9
C
Infrastructure Details for Production Environment .......................................................................... 9
Monitoring ................................................................................................................................ 10
System Contacts........................................................................................................................ 15
l
Log Files ................................................................................................................................ 17
ia
Troubleshooting Checklist ......................................................................................................... 18
t
Non-Production Environment ....................................................................................................... 22
n
Known Issues & Impact .............................................................................................................. 25
i d e
Business Impact ..................................................................................................................... 25
f
Technical Workaround to minimize impact .................................................................................. 25
n
C o
OpGuide v7.1 - 2023
History
OpGuide History
ia
nt
i d e
n f
C o
OpGuide v7.1 - 2023
Purpose:
Audience:
Operational Support Teams (e.g., Availability, Incident and Crisis Management, System
Monitoring, Network Monitoring).
ia l
nt
i d e
n f
C o
OpGuide v7.1 - 2023
System Profile
System Information
System Aliases AMDS-DS, PPt
ia l
t
Facilities Usage Comment
e n
Vendor hosted AWS
f i d
n
System Priority Level
o
Severity Level Sev 2
C Support
Style
Standard TechOps
System Description
Business Description
The AMDS-DS application provides access to work cards and work packages for the aircraft mechanics
worldwide 24 x 7. This application is used to print aircraft visit work packages for Delta aircraft for Line
Maintenance, Base Maintenance, and Outside Service Repairs.
OpGuide v7.1 - 2023
Business Impact
The airline maintenance technicians would not have access to the maintenance instructions needed to
perform the maintenance on the aircraft.
Airline maintenance technicians could view the aircraft maintenance manuals for maintenance
procedures located within the SkyView/OTIS application. But any customization that are on the work
cards would not be displayed in the manuals.
l
Technical Description
ia
Vendor hosted AWS
nt
Failover to DR region is the only workcaround
i d e
n f
C o
OpGuide v7.1 - 2023
User Profile
Type of Users TechOps Aircraft Maintenance Technicians
System Availability
Operational Windows – Eastern Time
ia l
Identify the primary business hours the system is in use. Please use the 24-hour Eastern Time clock when
0000
2359
0000
2359
0000
2359
nt
0000
2359
0000
2359
0000
2359
0000
2359
i d e
Identify the start and end times for any backup and maintenance activities for this system. Place a check
f
mark in the corresponding “yes” or “no” box indicating whether the system is available during those times.
Please use the 24-hour Eastern Time clock when indicating times.
Activity
No Mtc.
System
o
Monday
Yes
nTuesday
Yes
Wednesday
Yes
Thursday
Yes
Friday
Yes
Saturday
Yes
Sunday
Yes
available?
C No No No No No No No
OpGuide v7.1 - 2023
Infrastructure
ia l DR Server Names
5. Serverless/Lambda Functions No
e
6. Application type No N/A – Vendor hosted AWS
8. Operating system
9. Software/Language
f i d No
No
N/A – Vendor hosted AWS
o
technologies used
No
N/A – Vendor hosted AWS
C
approach
No
No
N/A – Vendor hosted AWS
Monitoring
Identify and describe the method used for Monitoring (i.e. Operating Systems, Application Servers,
Network Monitoring, databases and other). Identify and describe the tools used to troubleshoot and
diagnose problems. (I.e. NetIQ, UIM, Dynatrace, Sumo, CloudWatch, X-ray, Travelport and other).
3. Application Log
Monitoring
Example: Sumo Logic
Standards
ia l
Example: Reference Logging Example: PagerDuty, TCC
Availability, ServiceNow
t
N/A – Vendor hosted
4. Application Server
Monitoring
AWS
n
(availability)
N/A – Vendor hosted
5. Database Monitoring
e
AWS
N/A – Vendor hosted
6. Application Response
d
Time Monitoring
AWS
7. File System
Monitoring
8. Application Profiling
AWS
n f i
N/A – Vendor hosted
N/A – Vendor
o
hosted AWS
N/A – Vendor hosted
9. Transaction
C
Monitoring
AWS
Operational Tools
Identify and describe the method used for software delivery (i.e. RET Process Model, OS/390, VM, Radia or
other). Identify and describe the tools used to troubleshoot and diagnose problems. (i.e. Cisco Viewer,
Load Manager, Netcool, Openview). Describe the tool’s connectivity to the system, how alerts are
generated, how the monitoring occurs, and which group performs monitoring.
ia l
nt
i d e
n f
C o
OpGuide v7.1 - 2023
Failover Automation
Continuity Patrol Runbook – on Prem
Steps for executing the Go To Green orchestration: http://dpi948.delta.com/cobprod:/1%20-
%20Continuity%20of%20Business/Systems/Go%20To%20Green%20High%20Impact/ContinuityPatrolRunBook.do
c
Button Description:
ia l
Execution/Testing Results [x] Fully tested [] Limited testing [] No testing
1.
Date
RFC Number
(Manual/Automated)
nt
Total Time - execution until
service is restored
Results
2.
AWS
i d e
n f
High Level Description of Execution steps of Failover Procedure (includes assignment groups and
Organization SME’s for support purposes)
1.
2.
Step#
C o Description
AWS Failover
Steps for executing failover
AWS Failover
Steps for executing failover
2.
ia l
nt
i d e
n f
C o
OpGuide v7.1 - 2023
System Diagram
Provide the below Architecture Diagrams for your system. These diagrams Must Include Server/Device
Name Details. Provide Documentum links to these documents below, do not paste into this document.
Email [email protected] with the location/link to this guide.
Deployment Architecture
http://dpi948.delta.com/cobprod:/1%20-
%20Continuity%20of%20Business/Systems/Visit%20Manager%20Workcard%20Viewer-
AWS/Supplemental%20Documentation/PPt_CA_LA_Arch_Diagram.vsdx
Interface Architecture
http://dpi948.delta.com/cobprod:/1%20-
l
%20Continuity%20of%20Business/Systems/Visit%20Manager%20Workcard%20Viewer-
AWS/Supplemental%20Documentation/PPt_CA_LA_Arch_Diagram.vsdx
ia
Failover System Architecture
http://dpi948.delta.com/cobprod:/1%20-
nt
%20Continuity%20of%20Business/Systems/Visit%20Manager%20Workcard%20Viewer-
AWS/Supplemental%20Documentation/PPt_CA_LA_Arch_Diagram.vsdx
Physical Architecture
i d
http://dpi948.delta.com/cobprod:/1%20- e
f
%20Continuity%20of%20Business/Systems/Visit%20Manager%20Workcard%20Viewer-
AWS/Supplemental%20Documentation/PPt_CA_LA_Arch_Diagram.vsdx
Logical Drawing
n
o
http://dpi948.delta.com/cobprod:/1%20-
%20Continuity%20of%20Business/Systems/Visit%20Manager%20Workcard%20Viewer-
C
AWS/Supplemental%20Documentation/PPt_CA_LA_Arch_Diagram.vsdx
OpGuide v7.1 - 2023
System Contacts
ia l
DR Core Recovery Team
nt
The DR Core Recovery Team list is to be used in the event of a disaster. List the SME’s who are the most
knowledgeable about all components of the System and will begin the recovery efforts. The list must contain
Name
if d
Role Telephone Numbers
612-266-3138 - office
n
763-233-3643 - home
952-221-4749 - cell
o
Wanda Wirt (Secondary) Sr. Developer 612-266-3182 - Office
651-322-2331 - Home
C
651-398-1829 - Cell
OpGuide v7.1 - 2023
Business Contacts
ia l
Business contacts include your primary business contact and business users who provide acceptance or
t
integration testing assistance.
Tuvayas Duckworth
James Griffie
Product Owner
e n
Manager – Tech Procedures
404-245-8971 - Cell
404-714-1326 – Office
d
678-857-9119 - Cell
Nazim Raza
Brian Winters
n f i Product Owner
C o
OpGuide v7.1 - 2023
Log Files
Application Log File Location Content Description &
Name Error. Patterns
ia l
nt
i d e
n f
C o
OpGuide v7.1 - 2023
Troubleshooting Checklist
Use this template to record the metrics you are monitoring and actions you will take when those
metrics degrade. Include expected behavior, observed symptoms, and steps to check / monitor
escalation path for technical assistance. ZAP guidance on building troubleshooting guides will be
coming soon. Examples of metrics to add to this list can be found below.
l
• Error Rate
And actions to take
ia
when things fail
across the split by
t
components – what to
look for to
n
debug/troubleshoot
i d e
Ref: SLO+SLA - Delta
CCOE Documentation
n f
o
To each alert that N/A – Vendor hosted
comes out of system, AWS
what to look for to
C
troubleshoot
SLI-ROSA-K8s - Delta
CCOE Documentation
Troubleshooting Checklist
Use this template to record the metrics you are monitoring and actions you will take when those
metrics degrade. Include expected behavior, observed symptoms, and steps to check / monitor
escalation path for technical assistance. ZAP guidance on building troubleshooting guides will be
coming soon. Examples of metrics to add to this list can be found below.
- Disk space
-
Disk i/o
Traffic
ia l
- Errpt
messages(AIX)/
/var/adm/msg logs
in Linux
nt
Process (zombie/ hung,
d
consuming more cpu)
i e
f
Business Throughput Any sudden increase/ N/A – Vendor
decrease in business hosted AWS
throughout
Saturation
o
(How much of the
system's upper
n
Interoperability N/A – Vendor
hosted AWS
C
threshold are your
workloads utilizing)
Troubleshooting Checklist
Use this template to record the metrics you are monitoring and actions you will take when those
metrics degrade. Include expected behavior, observed symptoms, and steps to check / monitor
escalation path for technical assistance. ZAP guidance on building troubleshooting guides will be
coming soon. Examples of metrics to add to this list can be found below.
l
sessions hosted AWS
Service Response
Packet drops
nt
N/A – Vendor
ia
e
Time hosted AWS
d
Service Failures N/A – Vendor
i
hosted AWS
f
N/A – Vendor
hosted AWS
DB availability
DB performance
o n
DB alerts / traps
Load profile
N/A – Vendor
hosted AWS
N/A – Vendor
hosted AWS
C
(Include DB lock
actions, RW lock
actions, and query or
general response
degradation actions)
Troubleshooting Checklist
Use this template to record the metrics you are monitoring and actions you will take when those
metrics degrade. Include expected behavior, observed symptoms, and steps to check / monitor
escalation path for technical assistance. ZAP guidance on building troubleshooting guides will be
coming soon. Examples of metrics to add to this list can be found below.
(Include specific
actions needed by
l
response code, or
potential DB record
entries for specific
ia
events)
Application fatal
errors
River Tx Response t
N/A – Vendor
hosted AWS
n
N/A – Vendor
e
hosted AWS
d
Sys Logs Any error messages N/A – Vendor
i
hosted AWS
Access Logs
n f
(Application access logs
and any anomalies
needing specific actions
N/A – Vendor
hosted AWS
Storage Issues
C o diagnosed or addressed)
Non-Production Environment
Provide the Infrastructure Non-Production components and processes details including Development
environment, System Integration Environment and Test Environment. Available for this system.
Supported Components
l
SI Configuration
ia
Test Configuration
nt
Detailed Non-Production Configuration – SQL - <Database Name>
Components
Server name(s)
d e
Dev Configuration
i
SI Configuration Test Configuration
Server Location(s)
n f
N/A – Vendor hosted AWS
o
Detailed Non-Production Configuration – SYBASE - <Server Name>
C
N/A – Vendor hosted AWS
ia l
t
Server Location(s) N/A – Vendor hosted AWS
n
Machine Configuration N/A – Vendor hosted AWS
Components
if d
Dev Configuration
e
Detailed Non-Production Configuration – MidTier SUN
Server name(s)
Server Location(s)
o n
N/A – Vendor hosted AWS
C
Machine Configuration
Operating System
N/A – Vendor hosted AWS
ia
SI Configuration Test Configuration
nt
Components
i d e
Detailed Non-Production Configuration – System Name Application
Server name(s)
Server Location(s)
n f
N/A – Vendor hosted AWS
C o
OpGuide v7.1 - 2023
Known Issues
List all known issues in this table – for example Single Sign-on is not supported etc.
Business Impact
What is the impact to Delta’s business in this scenario? Provide 1-2 paragraphs using only business
terms. Avoid acronyms Provide a 1-2 paragraph description of how the business is impacted
ia l
Technical Workaround to minimize impact
nt
e
N/A – Vendor hosted AWS
f i d
o n
C