Final Project Report
Final Project Report
on
Recommendation System
Submitted in partial fulfillment for award of
BACHELOR OF TECHNOLOGY
Degree
In
COMPUTER SCIENCE & ENGINEERING
2022-23
Under the Guidance of Submitted By:
Priya Kumari Singh
Miss. Vernika Singh (2100330100174)
Assistant Professor
In today's technical era, every startup or company attempts to establish a better sort of
communication between their products and the users, and for that purpose, they require a type
of mechanism that can promote their product effectively, and here the recommender system
serves this motive. It is basically a filtering system that tries to predict and show the items that
a user would like to purchase. By analyzing the preference of the users, companies can decide
which product to be launched in the market to procure more benefits. These systems are proved
to be very beneficial in a variety of domains involving music, books, movies, research articles,
and products in common. In this paper, we review various mechanisms and techniques that are
required for recommender systems for recommending products or items in the domain of
fashion and books.
One of the main areas where this concept is currently used is e-commerce which interacts
directly with customers by suggesting products of interest with the aim of improving sales.
Motivated by the observation, a novel Domain-sensitive Recommendation (DsRec) algorithm
is proposed, to make the rating prediction by exploring the user-item subgroup analysis
simultaneously, in which a user-item subgroup is deemed as a domain consisting of a subset of
items with similar attributes and a subset of users who have interests in these items.
Collaborative Filtering (CF) is an effective and widely adopted recommendation approach.
Different from content-based recommender systems which rely on the profiles of users and
items for predictions, CF approaches make predictions by only utilizing the user-item
interaction information such as transaction history or item satisfaction expressed in ratings, etc.
ii
TABLE OF CONTENT
iv
LIST OF TABLES
v
CHAPTER 1
INTRODUCTION
1.1Motivation
Nowadays, a recommender system can be found on almost every information-intensive
website. For example, a list of likely preferred products is recommended to a customer when
browsing the target product on Amazon. Moreover, when watching a video clip on Youtube, a
recommender system employed in the system suggests some relevant videos to users by
learning the users’ behaviors that were generated previously. So to speak, recommender
systems have deeply changed the way we obtain information. Recommender systems not only
make it easier and more convenient for people to receive information, but also provide great
potential for economic growth as described in. As more and more people realize the importance
and power of recommender systems, the exploration of designing high-quality recommender
systems have been remaining an active topic in the community over the past decade. Due to
the continuous efforts in the field, thankfully, many recommender systems have been
developed and used in a variety of domains. Based on this, a key question arising is how to
know the performance of recommender systems so that the most suitable ones can be found to
apply in certain contexts or domains. The answer goes to evaluating recommender systems by
conducting rigorous and scientific evaluation experiments.
Evaluating recommender systems has become increasingly important with the growing
popularity of recommender systems in some applications. It is often the case that an application
designer needs to choose between a set of candidate recommendation algorithms. This goal can
be achieved by comparing the performance of these algorithms in evaluation experiments.
Besides, evaluating recommender systems can help researchers to select, tune and design
recommender systems as a whole. This is because when designing a recommender system,
some key factors influencing the system’s quality often come too noticed in the process of
evaluation. For example, in, Her locker et al. highlight that, when considering evaluation
metrics, evaluators should not only take into account the accuracy metrics, but also some extra
quality metrics, or to say beyond accuracy metrics, which attach importance to the fact that
users are often not interested in the items that they already know and surely like but sometimes
in discovering new items and exploring diverse items.
1.2 Objective
The objective of recommender systems is to provide recommendations based on recorded
information on the users' preferences. These systems use information filtering techniques to
process information and provide the user with potentially more relevant items.
1
CHAPTER2
- Storage: 2 GB
- Internet Connection
2
CHAPTER 3
The system architecture consists of the 3 main steps or the technologies which help in
completing the recommendation system:
COLLABORATIVE FILTERING
SENTIMENT ANALYSIS
3
CHAPTER 4
Train Test
Dataset Dataset
Apply Adjusted
Cosine Similarity
4
4.1.2 APPLY ADJUSTED COSINE SIMILARITY
As the different users rate products differently so we need to use a mechanism to find the
similarities between the users’ choices. firstly, the products rated by user 1 and user 2 will be
collected and then we will use the Cosine similarity or the adjusted Cosine similarity to find
the user’s similarity.
User 1 =
User 2 =
PRODUCTS Aussie volume Colorex Blu-ray/dyd
shampoo disinfecting
bathroom cleaner
RATINGS 5 4 5
Table 2: rated products of user 2
Now,
we will use the adjusted Cosine similarity, will find the average rating of the users
User 1 = {4+5+5}/3
= 4.6
User 2 = {5+4+5}/3
= 4.6
The next step will consist of the subtraction of the average rating of each user from each user’s
different rating products.
User 1 =
6
IMPORTING LIBRARY
AND DATA
APPLY OVERSAMPLING
TECHNIQUE
APPLY TF-IDF
VECTORIZATION
CALCULATE
SENSITIVITY AND
SPECIFICITY
OVER SAMPLING refers to copying or duplicating the minority class until it matches the
majority class count. Oversampling is a technique used in data mining and data analytics to
modify unequal data classes to create balanced data sets.
We can easily understand it by an example of the minority class containing 600 values and the
majority class containing 30000 values, we will duplicate the minority class 50 times to make
it equal to the majority class.
TF-IDF = TF*IDF
7
IDF = log (total number of documents)
No. of lines in which that particular word appears
As users give more priority to reviews rather than ratings because reviews clarify emotions and
expressions in a better way than ratings. so, we have assigned w1=1 and w2=2 means reviews
are taken as more important than the ratings. So, those products that will have higher rankings
will have higher ratings and reviews. So, using product ranking will make it much easier to
recommend the products.
8
CHAPTER 5
PROJECT SNAPSHOT
Based on the proposed experimental scenarios, the following gives the results and analysis of
them. These are the recommended products for Joshua after using sentiment analysis:
9
CHAPTER 6
LIMITATIONS
• The cold-start problem: Collaborative filtering systems are based on the action of
available data from similar users. If you are building a brand new recommendation
system, you would have no user data to start with. You can use content-based filtering
first and then move on to the collaborative filtering approach.
• Scalability: As the number of users grows, the algorithms suffer scalability issues. If
you have 10 million customers and 100,000 movies, you would have to create a sparse
matrix with one trillion elements.
• The lack of the right data: Input data may not always available.
• Lack of data analytics capability: Deep learning-based recommendation engines can
demand high computational complexity. If the data that is fed to the model is less
accurate or valuable, the result will be less useful. So, before investing in
recommendation engines, make sure your business is up to the complex data analytics
demands required.
• Inability to capture changes in user behavior : Consumers do not stand still – they
are constantly behaving and evolving both as people and customers. Staying on top of
these changes is a constant battle. A strong recommendation engine will be able to
identify changes (or signs of impending changes in customers’ preferences and
behavior, and constantly auto-train themselves in real-real-time order to serve relevant
recommendations
10
CHAPTER 7
FUTURE WORK
Expand to support more algorithms. The application now only supports three collaborative
filtering algorithms, user-based, item-based, and biased matrix-factorization. As the
application is readily extensible for recommendation algorithms, it is expected to include more
other algorithms, such as hybrid algorithms, content-based algorithms, demographic-based
algorithms, etc.
In the future, the recommendation system can be solved by using the concept of deep learning.
In this, we can use an RNN which means Recurrent Neural Network. It is a sub-class of neural
network that has a memory unit and feedback unit which makes it better to find the patterns in
the data set. The main component in the RNN is the memory element which keeps a record of
all the previous calculations.
11
CONCLUSION
12
REFERENCES
[1] Khanvilkar, G., & Vora, D. (2018). Sentiment Analysis for Product Recommendation
Using Random Forest. International Journal of Engineering & Technology.
https://doi.org/10.14419/ijet.v7i3.3.14492
[2] Yi Ren, Jingke Xu, Jie Huang and Cuirong Chi “Research on Collaborative Filtering
Recommendation Algorithm for Personalized Recommendation System”, 2019 9th
International Conference on Education and Social Science (ICESS 2019).
[3] Prateek Sappadla, Yash Sadhwani and Pranit Arora “Movie Recommender System”, Search
Engine Architecture, spring 2017, NYU Courant.
[6] Lavanya, R., Rithika Lahari, PalakGupta. (2019). “An Optimal Enhancement of the
Dynamic Features of Recommender Systems” International Journal of Recent Technology and
Engineering, 8(2S4), 51-55.
[7] Gharbi Alshammari, Stelios Kapetanakis, Abdullah Alshammari and Nikolaos Polatidi
“Improved Movie Recommendations Based on a Hybrid Feature Combination Method“,
International Journal Of Engineering And Computer Science, July 2019.
[8] Yibo Wang, Mingming Wang, and Wei Xu “A Sentiment-Enhanced Hybrid Recommender
System for Movie Recommendation: A Big Data Analytics Framework”, 1School of
Information, Renmin University of China, Beijing 100872, China, March 2018.
[9] Vishwa, Bhavesh, Aman Gupta, Pranal Soni “Movie Recommendation System”,
International Research Journal of Engg and Technology Volume: 05 Issue: 02, Feb-2018.
[10] Qianzi Shen, Zijian Wang, and Yaoru Sun “Sentiment Analysis of Movie Reviews Based
on CNN-BLSTM”, Springer International Publishing AG 2017.
[11] Yashar Deldjoo, Markus Schedl, Balázs Hidasi, Peter Knees. "Multimedia Recommender
Systems." , RecSys ’18 Proceedings of the 12th ACM Conference on Recommender Systems,
Pages 537-538.
13