Webinar _Learn More About Data Scientist, Data Analyst & Data Engineer
Webinar _Learn More About Data Scientist, Data Analyst & Data Engineer
05 Minigame
ROLES IN
01 BUSINESS
INTELLIGENCE
BENEFITS OF
BUSINESS
INTELLIGENCE
ROLES IN BI
Learn more about
02 DATA
ANALYST
01 What is Data Analytics?
04 Career Path
06 Q&A
WHAT IS
1 DATA ANALYTICS?
Data analytics is the science of analyzing raw data to make conclusions about
that information, then, help a business optimize its performance.
SCIENCE OF OPTIMIZE
RAW DATA
ANALYZING PERFORMANCE
OPTIMISATION
8 What’s the best can happen?
BUSINESS VALUE
FORECASTING
6 What if this trend continue?
STATISTICAL ANALYSIS
5 Why is this happening?
ALERTS
4 What action are needed?
AD-HOC REPORTS
2 How many? How often? Where?
BUSINESS INTELLIGENCE
STANDARD REPORT
1 What Happened?
DEGREE OF INTELLIGENCE
SOME PROBLEMS TO
SOLVE IN CỐC CỐC
1. How to organize reports/dashboards for easy usage of other team?
2. How to define meaningful metrics?
3. Why do users use browser more active in a specific period?
4. What features do user use most frequently? How to improve them?
5. How does a new feature impact to user behavior?
6. Why do user churn?
7. Which groups of user retains most?
8. How to trigger aha moment to increase retention?
Job Tasks
Data Analyst Data
Analytics
Business Business
Analyst Analytics
SKILL SET OF
3 DATA ANALYST
DATA
RAW DATA BUSINESS
ANALYST
Technical skills: Soft skills:
● Statistics ● Critical thinking and
● BI Tools: Power BI, problem solving
Excel, Tableau ● Writing and
● Programming: R, communication
Python ● Domain knowledge
DATA ANALYST ROADMAP
4 CAREER PATH
.... Depend on how do you know yourself.
Data Analytics in Cốc Cốc
5 CHALLENGES & LESSONS LEARNED
FORECASTING
6 What if this trend continue?
STATISTICAL ANALYSIS
5 Why is this happening?
ALERTS
4 What action are needed?
AD-HOC REPORTS
2 How many? How often? Where?
BUSINESS INTELLIGENCE
STANDARD REPORT
1 What Happened?
DEGREE OF INTELLIGENCE
Learn more about
03 DATA
ENGINEER
01 Who are Data Engineers?
06 Q&A
WHO ARE THEY?
1 WHAT THEY DO?
They build and maintain their
organization’s data ecosystem.
Data sources:
Other
Tech leader Process disciplines
*.nix environment
Web services
ROADMAP
Source: simplilearn.com
DATA ENGINEERING IN CỐC CỐC
5 CHALLENGES & LESSONS LEARNED
Mình năm nay 26 tuổi. Vừa bắt đầu tìm hiểu và học các
kiến thức cơ bản để trở thành 1 Data Engineer. Tương lai
Cốc Cốc có mở các chương trình tuyển các thực tập sinh
hoặc JUNIOR không ạ?
QUESTION FOR
WEBINAR ATTENDEES
3 players who has correct and fastest answer
will receive a voucher 50K
Which phase that “ETL" stands for?
Extract - Transform - Load
Learn more about
04 DATA
SCIENTIST
Increase in Demand for Data Scientist
Source: kickstarter.com
What Industries hiring us?
ROLES OF
DATA SCIENTIST
04 Career Path
07 Q&A
1 WHO ARE THEY?
A DAY IN THE LIFE OF
2 A DATA SCIENTIST
WHAT DOES
A DATA SCIENTIST DO?
RESEARCH
Data Analysis ?
DATA SCIENCE IN CỐC CỐC
5 CHALLENGES & OPPORTUNITIES
● Build ML/AI Prediction model for AdEngine ( CTR prediction, Quality Score model)
● Designing and developing machine learning and deep learning systems
● Running machine learning tests and experiments
● Implementing appropriate ML algorithms, and turn them into microservices on our
production environments.
● Improve current targeting model ( user interest/user catalogs)
● Build recommendation/suggestion model for our current products (Music
Recommendation Engine, B2B, B2C products)
● Others: Analysis data, doing ad-hoc analysis and presenting results in a clear manner.
LESSON LEARNED FROM
6 NEWS FEED PROJECT
The personalization distributed news content
platform (NewsFeed)
RESEARCH
GraphDB Relationship.
Query samples:
● Get Top K articles are in {categories} OR {topics} that we can recommend to our target user recently and their neighbors ? @Đức Lê Huỳnh please check the query grammaticaly,
and the benchmark.
● neighbors = MATCH (c:User)-[r:IS_SIMILAR]->() WHERE userId= $uid RETURN c.userId
○ recommendation = SUM OF
■ (1) similar users MATCH (c:User)-[r:READ]->() WHERE userId= $uid RETURN c.userId AND
■ (2) MATCH (a:Article)-[r:BELONGS_TO]->($lst_categories) OR MATCH (a:Article)-[r:BELONGS_TO]->($lst_topics) WHERE
(u:INTERSTED_IN:_)>0.5 AND u.uid IN ($lst_neighbors) RETURN a.articleId AND
■ (3) BY user himself : MATCH (a:Article)-[r:BELONGS_TO]->($userid) OR MATCH (a:Article)-[r:BELONGS_TO]->($lst_topics) WHERE
(u:INTERSTED_IN:_)>0.5 AND u.uid IN ($lst_neighbors) RETURN a.articleId
Step 4:
Run Experiment ( Online/Offline A/B Testing)
Step 4:
Run Experiment ( Online/Offline A/B Testing)
Step 5:
Monitor and measure the results
The click through rate increased up to 20%
compared to the human setting with 99% of
statistical confidence interval
Q&A
Những dự án phù hợp với newbie để dần làm quen với môi trường công
việc thực tế ? Các lỗi cơ bản, sai lầm của newbie trong lúc học và ra làm
thực tế ?