Operation Analytics and Investigating Metric Spike
Operation Analytics and Investigating Metric Spike
Metric Spike
Description:
In this project, we have to analyze various datasets and derive insights
for operational improvements. The analysis focuses on two main case
studies: Job Data Analysis and Investigating Metric Spikes. The goal is
to use advanced SQL skills to analyze the data and provide actionable
insights that help different departments within the company.
Operational Analytics is a crucial process that involves analyzing a
company's end-to-end operations. One of the key aspects of
Operational Analytics is investigating metric spikes. This involves
understanding and explaining sudden changes in key metrics, such as a
dip in daily user engagement or a drop in sales.
Approach:
For the project, we will create database and tables. In the first case
study we insert the data into tables manually while for the second case
study we will import the provided CSV files into MySQL Workbench to
create the necessary tables. Now ensure the tables are correctly
structured and populated with data.
Now utilize SQL queries to answer the questions posed in the case
studies. Also focus on understanding the table structures and the
meaning of various columns. Prepare a comprehensive report
summarizing the findings, approach, tech-stack used, insights, and
results.
Tech Stack:
• MySQL Workbench: For database creation, table management,
and executing SQL queries.
• MS-Word: To write the report.
• Google Drive: To store and share the final report.
The number of jobs reviewed per hour for each day in the month
of November 2020 is 0.0111.
B. Throughput Analysis:
Calculate the 7-day rolling average of throughput (number of
events per second). Write an SQL query to calculate the 7-day
rolling average of throughput. Additionally, explain whether you
prefer using the daily metric or the 7-day rolling average for
throughput, and why.
The 7-day rolling average of throughput is maximum for 30th
November 2020. There were 2 jobs reviewed on that day. Also, I
would prefer using the 7-day rolling average for throughput as it
smoothens out daily fluctuations and provides a more stable view
of trends over time, helping to make better-informed decisions.
Insights:
• Jobs Reviewed Over Time: Peaks in the number of jobs reviewed
indicate high activity periods, which can help allocate resources
more effectively. The jobs reviewed per hour is 0.0111.
• Throughput Analysis: A stable 7-day rolling average of throughput
ensures that operational performance is consistently monitored
and maintained. This is highest for 30-11-2020 which is 1.34.
• Language Share Analysis: Identifying dominant languages helps in
localizing content and improving user engagement. The highest
percent of language share is for Persian language. All other
languages have same share.
• Duplicate Rows Detection: Detecting and eliminating duplicate
rows ensures data accuracy and integrity.
Case Study II: Investigating Metric Spike
We will be working with three tables:
1. users: Contains one row per user, with descriptive information
about that user’s account.
2. events: Contains one row per event, where an event is an action
that a user has taken (e.g., login, messaging, search).
3. email_events: Contains events specific to the sending of emails.
The data is provided in Case Study-II in form of .csv files on google
drive. We need to create three tables i.e. users, email_events and
events. The data in tables in fed using import function in MySQL
Workbench.
Tasks:
A. Weekly User Engagement:
Measure the activeness of users on a weekly basis. Write an SQL
query to calculate the weekly user engagement.
The weekly engagement of week 33 is highest and 21 is lowest.
https://drive.google.com/file/d/1JXqXrOYALV0sk1oehIFcKRl6lFPaz
MIF/view?usp=sharing
The full output table is provided in the given link.
https://drive.google.com/file/d/1IIzkAxJJP5UvTI-4HSQModwrWo7-
MU/view?usp=sharing
The Email opening rate is 32.11 and Email clicking rate is 13.18.
Insights:
• Weekly User Engagement:
By analyzing the weekly user engagement, we can identify trends
in user activity. The weekly engagement of week 33 is highest and
21 is lowest.
• User Growth Analysis:
Tracking the growth of users over time helps in understanding the
success of marketing efforts and product acceptance in the
market. The active number of users per week is 9381.
Result:
This project enhanced my understanding of using SQL for operational
analytics, providing valuable insights for improving company operations
and addressing metric spikes. The analysis led to actionable
recommendations for resource allocation, performance monitoring,
content localization, and data accuracy.