0% found this document useful (0 votes)

204 views4 pages

Customer Analytics at Flipkart

1. The document discusses recommendation systems used by ecommerce companies Flipkart and Bigbasket. 2. For Flipkart, it recommends using a collaborative filtering algorithm that analyzes user purchase histories and items to find relationships and make personalized recommendations. 3. For Bigbasket, in addition to collaborative filtering, it suggests using association rule mining on repeat purchases to identify commonly bought item groups based on support and confidence metrics.

Uploaded by

Rachana jala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

204 views4 pages

Customer Analytics at Flipkart

Uploaded by

Rachana jala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Customer Analytics at Flipkart:

Ecommerce has been a platform of advantage for many retail webstores. Flipkart being one of the
older players who is trying to set its competitive advantage uses right strategies of gaining customer
interest, pattern of buying behaviour using certain recommendation Engines.

Here, in our case we can implement – Collaborative filtering algorithm in recommendation system of
Flipkart.

Recommendation System – Collaborative filtering in Flipkart.

 Take the historical data of all the user list for a particular period and demographic of that
particular location.
 Input data for collaborative filtering can be target items (which users usually purchase) and
users.

+The data is stored in matrix form or rows and columns where rows are the number of users and
columns as the targeted items of particular section. Say apparels.

 In our case we can see how each user purchased 5 items of that section, where each -
“purchased” that particular item.
 Hence matching criteria of the user with target item is also important.
 Here, we saw the criteria as purchased, it can be “Added to cart”, “ratings given” , “click per
view” etc.
 Based on the above parameters we use “utility matrix” to find the relation between users
and targeted items.
 Utility Matrix: C x S- R, where R = Targeted Criteria; C= set of customers; S= set of targeted
Items.
 Collaborative Filtering Algorithm:
Step 1: Find the set of N users whose purchases are similar to the targeted customer X
Step 2: Estimate X purchase item based on set of N users
Step 3: If the purchase similarity matches accurately, Suggest the recommended items of N
users, to the user X.

Hybrid = Content + Collaborative

Content = Based on articles/blogs customer clicked, liked per view etc factors

So, this Flipkart can use Hybrid also.

Using Jaccard Similarity:

View : Customer – Item Perspective

Sim (a,b) = |rA intersects rB|

Sim(A,B) =1/5;Sim(A,C) =2/4 ; Hence we can conclude similarity of user A, user B is less compared to
similarity of A,C.
Hence items recommended A are suggested for C instead of B user.
Flipkart & Bigbasket–

 Finding Similarity >> Knowledge based similarity / Collaborative ( Use distance

measures like Cosine similarity , Jaccard ) – For Flipkart & Bigbasket

Big basket – Focussed on RECOMMENDATION ENGINE ONLY (not on Churn prediction )

 Repeat purchases are done in bigbasket so for this along with collaborative filtering
we use the evaluation tools like “ support” and “confidence” while applying
association rule mining .
 Association is measured using support and confidence Support(A)=n(A)/N
Support=n(A∩B)/N where n(A∩B) is the no.of times both A and B are purchased and
N is the total no.of purchases Confidence=n(A∩B)/n(A)=Support(A,B)/Support(A)
Lift(A,B)=Confidence(A,B)/Support(B) = Support(A,B)/ Support(A) * Support(B) =
n(A∩B)/n(A) * n(B) value >1 likely to be bought =1 no association <1 unlikely to buy
 Association Rule Mining – Algorithm:
1. Apriori algorithm is used to find the frequency patterns keeping “frequent item
set “ condition in mind.
2. Generating Association rules
Eg : Person who buys bread will 100% purchase milk and egg or When milk and
egg are bought there is 100% chance that bread is bought.
Popularity of associate rule mining is checked through threshold limit .
 One of the product recommendation techniques is “Market basket analysis”.
 “Lift” is the measure of correlation and measure of association. In bigbasket
case , we see if lift =1 there is no association between bread and butter
 Lift <1 then there is a negative association where when bread is bought
chance of buying butter is decreases.
 Lift >1 is a positive association where when bread is bought chance of buying
butter increases.
 In the bigbasket case study , I suggest for Lift >1 .

Flipkart & QWE & HR Analytics – Logistic Regression -Algorithm Steps :

1. Find the categorical features in the data set – Gender, age

Algorithm - See the link

https://towardsdatascience.com/a-complete-logistic-regression-algorithm-from-scratch-in-python-
step-by-step-ce33eae7d703

Confusion Matrix – Evaluation Metrics considered during Churn prediction.

A confusion matrix is an extremely strong method of evaluating the performance of our classifier. A

confusion matrix is a visual representation which tells us the degree of four important classification

metrics:

 True Positives (TP): The number of observations where the model predicted the

customer would churn (1), and they actually do churn (1)

 True Negatives (TN): The number of observations where the model predicted the

customer would not churn (0), and they actually do not churn (0).

 False Positives (FP): The number of observations where the model predicted the

customer will churn (1), but in real life they do not churn (0).

 False Negatives (FN): The number of observations where the model predicted the

customer will not churn (0), but in real life they do churn (1).

One axis of a confusion matrix will represent the ground-truth value, while the other will represent

the predicted values. At this step, it is very important to have business domain knowledge. Certain

metrics are more prevalent to our model than not. For example, if we were modeling whether a

patient has a disease, it would be much worse for a high number of false negatives than a high

number of false positives. If there are many false positives, then that just means some patients

would need to undergo some unnecessary testing and maybe an annoying doctor visit or two. But, a

high false negative means that many patients would actually be sick and diagnosed as healthy,

potentially having dire consequences. For our purposes of churn, it is worse for us to predict a

customer not churning when that customer actually churns in reality, meaning that our False

Negatives are more important to pay attention to.

Over All

Flipkart / QWE/ HR Analytics – Logistic Regression

Flipkart – Collaborative / Hybrid filtering

Bigbasket :Collaborative filtering >> Concept of repeat purchases >> Association rule mining >>
Support and confidence as evaluation tools or metrics.

QWE – Content Filtering >> Why ?? – Based on the articles, blogs which mention the services
provided by the org .

HR – no recommendation engine as it is about prediction for joining the company or not .

Flipkart, QWE, HR >> Uses Confusion matrix – precision , accuracy, specificity etc as evaluation
metrics to determine the churn rate model .

Making Stickk Stick: The Business of Behavioral Economics
No ratings yet
Making Stickk Stick: The Business of Behavioral Economics
6 pages
AQA Physics P7 Radioactivity Past Paper Questions
No ratings yet
AQA Physics P7 Radioactivity Past Paper Questions
23 pages
Linear Programming On Work Scheduling - Operations Management
100% (1)
Linear Programming On Work Scheduling - Operations Management
3 pages
Netflix Uv7080 Xls Eng
100% (1)
Netflix Uv7080 Xls Eng
11 pages
Amazon ACE Challenge - Operations Case Breaker
No ratings yet
Amazon ACE Challenge - Operations Case Breaker
6 pages
Case-2 - Group 5
No ratings yet
Case-2 - Group 5
5 pages
ABDE™ - Introduction
No ratings yet
ABDE™ - Introduction
16 pages
Bella Healthcare India: Type To Enter A Caption
No ratings yet
Bella Healthcare India: Type To Enter A Caption
15 pages
Decision Modelling
No ratings yet
Decision Modelling
280 pages
Group 14 - BeM Casestudy
100% (1)
Group 14 - BeM Casestudy
13 pages
Solution Report For: Home My Test My Profile
No ratings yet
Solution Report For: Home My Test My Profile
11 pages
Bella Healthcare India
No ratings yet
Bella Healthcare India
14 pages
Session6 Withoutsolution
No ratings yet
Session6 Withoutsolution
33 pages
APPLICATION Goal Programming
No ratings yet
APPLICATION Goal Programming
6 pages
Consumer Behaviour Seminar (FINAL)
No ratings yet
Consumer Behaviour Seminar (FINAL)
25 pages
Analysis of Cost Controlling in Construction Industries by Earned Value Method Using Primavera
100% (1)
Analysis of Cost Controlling in Construction Industries by Earned Value Method Using Primavera
9 pages
SmartCityProject PDF
No ratings yet
SmartCityProject PDF
39 pages
Dea PDF
No ratings yet
Dea PDF
149 pages
B2B - 03 - EMC2 Case
No ratings yet
B2B - 03 - EMC2 Case
2 pages
Group1 Final Report
No ratings yet
Group1 Final Report
54 pages
Case Analysis
No ratings yet
Case Analysis
14 pages
B2BMarketing Group6 FormPrintOrtho500
No ratings yet
B2BMarketing Group6 FormPrintOrtho500
3 pages
Linear Regression Analysis
No ratings yet
Linear Regression Analysis
2 pages
(Tata Steel'S Acquisition of Corus) : (Business Policy and Strategic Management)
No ratings yet
(Tata Steel'S Acquisition of Corus) : (Business Policy and Strategic Management)
6 pages
Post Jio Impact - Research Project PDF
No ratings yet
Post Jio Impact - Research Project PDF
12 pages
Stickk
No ratings yet
Stickk
4 pages
BM Case
No ratings yet
BM Case
6 pages
Goal Programming Group4 Amat112b
No ratings yet
Goal Programming Group4 Amat112b
24 pages
Design Thinking Project Saloni Sunny C - PPT
No ratings yet
Design Thinking Project Saloni Sunny C - PPT
12 pages
TrailBlazers Conjoint
No ratings yet
TrailBlazers Conjoint
6 pages
DEA Class Material PDF
100% (1)
DEA Class Material PDF
53 pages
Design Thinking Rahul
No ratings yet
Design Thinking Rahul
2 pages
Solarwinds NPM ASA Monitoring
No ratings yet
Solarwinds NPM ASA Monitoring
13 pages
Global Expansion Strategy of Bumrungrad Hospital by Dr. Zayar Naing
No ratings yet
Global Expansion Strategy of Bumrungrad Hospital by Dr. Zayar Naing
7 pages
Presented To:-Prof - Sandeep Anand: Presentation On "OPERATION RESEARCH"
No ratings yet
Presented To:-Prof - Sandeep Anand: Presentation On "OPERATION RESEARCH"
30 pages
Business To Business Marketing Session 2
No ratings yet
Business To Business Marketing Session 2
26 pages
Case Study # 3 Business Unit Shut Down
No ratings yet
Case Study # 3 Business Unit Shut Down
3 pages
Valuation of Bonds and Stocks
No ratings yet
Valuation of Bonds and Stocks
71 pages
Chris DiFrancesco CORMETECH
No ratings yet
Chris DiFrancesco CORMETECH
22 pages
Predictive Analytics - Share - V5
No ratings yet
Predictive Analytics - Share - V5
32 pages
Scalene Works-HR Analytics
0% (1)
Scalene Works-HR Analytics
10 pages
Icici Case
No ratings yet
Icici Case
2 pages
DCF SBI Template
No ratings yet
DCF SBI Template
7 pages
Supply Chain Management and Logistics Collabortion
No ratings yet
Supply Chain Management and Logistics Collabortion
21 pages
ICICI Global Expansion Strategy
No ratings yet
ICICI Global Expansion Strategy
15 pages
Financial Performance Analysis Through Position Statements of Selected FMCG Companies
No ratings yet
Financial Performance Analysis Through Position Statements of Selected FMCG Companies
8 pages
Consumer Behaviour Towards Branded V/S Non Branded Computers
No ratings yet
Consumer Behaviour Towards Branded V/S Non Branded Computers
55 pages
DCF Analysis Conclusion
No ratings yet
DCF Analysis Conclusion
1 page
EMC2: Delivering Customer Centricity - Case Memo
No ratings yet
EMC2: Delivering Customer Centricity - Case Memo
2 pages
Network Atlas - Orion Platform
No ratings yet
Network Atlas - Orion Platform
48 pages
BMO5501 Business Ethics and Sustainability1.edited
No ratings yet
BMO5501 Business Ethics and Sustainability1.edited
11 pages
Questions Dow
No ratings yet
Questions Dow
5 pages
Market Research Project: Should IIM Lucknow Start A Five Year Course in ?
No ratings yet
Market Research Project: Should IIM Lucknow Start A Five Year Course in ?
18 pages
Valuation of Stressed Assets: The Institute of Cost Accountants of India
No ratings yet
Valuation of Stressed Assets: The Institute of Cost Accountants of India
43 pages
Decision Models and Optimization: Sample-Endterm-with Solutions
No ratings yet
Decision Models and Optimization: Sample-Endterm-with Solutions
6 pages
Tata and Corus: A Case of Acquisition: BY: Yogendra Agarwal Vinod Sharma Anil Netar Ridhi Goyal Rajesh Jangid
100% (1)
Tata and Corus: A Case of Acquisition: BY: Yogendra Agarwal Vinod Sharma Anil Netar Ridhi Goyal Rajesh Jangid
12 pages
Sector Analysis Capstone (Rohan Pandita)
No ratings yet
Sector Analysis Capstone (Rohan Pandita)
17 pages
Relative Valuation: Aswath Damodaran
No ratings yet
Relative Valuation: Aswath Damodaran
130 pages
Did United Technologies Overpay For Rockwell Collins
No ratings yet
Did United Technologies Overpay For Rockwell Collins
2 pages
WACC
No ratings yet
WACC
6 pages
Coverfox
No ratings yet
Coverfox
13 pages
IoT Platform Complete Self-Assessment Guide
From Everand
IoT Platform Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
The Principles of a Combined Cycle Power Plant
No ratings yet
The Principles of a Combined Cycle Power Plant
47 pages
Understanding UPS Overload Capabilities
No ratings yet
Understanding UPS Overload Capabilities
19 pages
MIDTERM SCI MCQ Class 6
No ratings yet
MIDTERM SCI MCQ Class 6
16 pages
Landslide Hazard Evaluation and Zonation Mapping in Mountainous Terrain
No ratings yet
Landslide Hazard Evaluation and Zonation Mapping in Mountainous Terrain
9 pages
Sego Everfocus Eq610 - Manual
No ratings yet
Sego Everfocus Eq610 - Manual
26 pages
Slide 5 - BPMN
No ratings yet
Slide 5 - BPMN
131 pages
Design of Deep Excavation in Soft Clay: Haga Station in Västlänken Railway Tunnel in Gothenburg
No ratings yet
Design of Deep Excavation in Soft Clay: Haga Station in Västlänken Railway Tunnel in Gothenburg
189 pages
Growatt Hybrid sph5000 Manual
No ratings yet
Growatt Hybrid sph5000 Manual
66 pages
CHAPTER - 3 - Logic - Circuits
No ratings yet
CHAPTER - 3 - Logic - Circuits
12 pages
Scheme of Studies: Master of Science in Management Sciences (MSMS)
No ratings yet
Scheme of Studies: Master of Science in Management Sciences (MSMS)
49 pages
Core 321/4541: EN 1.4541, ASTM TYPE 321 / UNS S32100
No ratings yet
Core 321/4541: EN 1.4541, ASTM TYPE 321 / UNS S32100
8 pages
Inkscape Keyboard Shortcuts
No ratings yet
Inkscape Keyboard Shortcuts
39 pages
4.kakku Heavy Duty Cam Operated Limit Switch KTLS-4000
No ratings yet
4.kakku Heavy Duty Cam Operated Limit Switch KTLS-4000
3 pages
Special Theory of Relativity
No ratings yet
Special Theory of Relativity
16 pages
Paper Pattern 12th Class
No ratings yet
Paper Pattern 12th Class
3 pages
Chem.kinetics Akash Lvl1-SolN
No ratings yet
Chem.kinetics Akash Lvl1-SolN
27 pages
Mapua Institute of Technology: Experiment No. 2
No ratings yet
Mapua Institute of Technology: Experiment No. 2
14 pages
Diesel Generator Set QST30 Series Engine: 680 KW - 1000 KW 60 HZ Data Center Continuous
No ratings yet
Diesel Generator Set QST30 Series Engine: 680 KW - 1000 KW 60 HZ Data Center Continuous
5 pages
Excel 2016 Intermediate Cheat Sheet
No ratings yet
Excel 2016 Intermediate Cheat Sheet
3 pages
Pelapisan Hidrokoloid Untuk Menurunkan Penyerapan Minyak Pada French Fries
No ratings yet
Pelapisan Hidrokoloid Untuk Menurunkan Penyerapan Minyak Pada French Fries
8 pages
Industrial Training Institute List of Lesson Semester - 2: SR. No. Weekn O Lesso N No. Description Time Remark S
No ratings yet
Industrial Training Institute List of Lesson Semester - 2: SR. No. Weekn O Lesso N No. Description Time Remark S
4 pages
Are Discretionary Accruals A Good Measure of Audit Quality?
No ratings yet
Are Discretionary Accruals A Good Measure of Audit Quality?
17 pages
Properties of Pipe
No ratings yet
Properties of Pipe
5 pages
X Ray Technician Q Paper of Written Examination Dated 05.02.2023
No ratings yet
X Ray Technician Q Paper of Written Examination Dated 05.02.2023
12 pages
B.Tech 7th Sem PDF
No ratings yet
B.Tech 7th Sem PDF
11 pages
Cable Route Survey Dictionary
No ratings yet
Cable Route Survey Dictionary
18 pages
Electrial Power Distribution Systems
100% (1)
Electrial Power Distribution Systems
69 pages
SOR_Description_CC18276
No ratings yet
SOR_Description_CC18276
21 pages

Customer Analytics at Flipkart

Uploaded by

Customer Analytics at Flipkart

Uploaded by

Customer Analytics at Flipkart:

Recommendation System – Collaborative filtering in Flipkart.

Hybrid = Content + Collaborative

So, this Flipkart can use Hybrid also.

Using Jaccard Similarity:

Sim (a,b) = |rA intersects rB|

 Finding Similarity >> Knowledge based similarity / Collaborative ( Use distance

Big basket – Focussed on RECOMMENDATION ENGINE ONLY (not on Churn prediction )

Flipkart & QWE & HR Analytics – Logistic Regression -Algorithm Steps :

1. Find the categorical features in the data set – Gender, age

Algorithm - See the link

Confusion Matrix – Evaluation Metrics considered during Churn prediction.

customer would churn (1), and they actually do churn (1)

Negatives are more important to pay attention to.

Flipkart / QWE/ HR Analytics – Logistic Regression

HR – no recommendation engine as it is about prediction for joining the company or not .

You might also like