Data Mining Final Project Report
Data Mining Final Project Report
Filtering
AD Aakash Nanthan
16BCE1061
Daniel Jebakumar
17BLC1080
VIT CHENNAI
ABSTRACT:
Recommendation systems are used for the purpose of suggesting new items
to a user depending on their past activity. They direct users towards items
which can meet their needs by processing a large database of Information.
Various techniques have been introduced for recommending items such as
content, collaborative and association mining techniques.
KEYWORDS:
INTRODUCTION:
A recommender system is a type of information filtering system that predicts
the rating or preference that a user would give to an item and uses it to
recommend similar items that the user might rate similarly. Recommendation
Systems might be targeted to individuals who do not have enough personal
experience to evaluate the potentially overwhelming alternatives that a web
site might offer. Some companies such as Amazon, Facebook and Google are
already utilizing the power of recommender systems in building their new and
existing systems. As examples, it is common to rely on peer
recommendations when selecting a book to read; employers use
recommendation letters in their recruiting decisions; and when selecting a
movie to watch, individuals read and rely on the movie reviews that a critic
has written. This approach is popularly known as collaborative-ltering and its
aim is to ensure that if the active user agreed in the past with some users,
then the other recommendations coming from these similar users should be
relevant as well and of interest to the active user. In this report, we introduce a
novel architecture for the implementation of a recommender system algorithm
which can be used to improve the recommendation of books to users.
METHODOLOGY:
PROCEDURE ADOPTED:
Find similar users: For this step we select users that have in common that they
rated the same books.
Get predictions for other books: In order to get recommendations for our user
we would take the most similar users and average their ratings for books and In
order to get recommendations for our user we would take the most similar users
and average their ratings for books.
1. Weighting the ratings by similarity - This means that the more similar a
user is to the current user the more weight his/her ratings receive in the
users co-rated. The more books these users co-rated the more reliable is
Fig.1 Fig.2
Fig.3 Fig.4
Fig.5
Fig.6 Fig.7
Fig.8
From Fig.1, it is quite apparent that most books have ratings of 4 and 5,
From Fig.2, we can see that a lot of users have rated only 0 to 10 books,from
Fig.3, we can see that the average rating is around 4, from Fig.4, it is clearly
visible that at least around 20 people rate the book and from Fig.5, it is visible
that the average user rates 4 for any book.
Fig.6, tells us that Fantasy and Romance are the most rated books out of all the
books, fig.7, tells us that a majority of the books are in english and finally Fig.8,
tells us that the longer the title length, the higher the average rating. We also
found that The Hunger Games is the most rated book, followed by Harry Potter.
CONCLUSION:
2. Davidson, J., Livingston, B., Sampath, D., Liebald, B., Liu, J., Nandy, P.,
Vleet, T.V., Gargi, U., Gupta, S., He, Y., Lambert, M.: The YouTube video
recommendation system.
3. Jacobson, K., Murali, V., Newett, E., Whitman, B., Yon, R.: Music
personalization at Spotify. In: Proceedings of the 10th ACM Conference
on Recommender Systems (RecSys ’16). ACM, New York.
4. Sarwar, B., Karypis, G., Konstan, J., Reidl, J.: Item-based collaborative
filtering recommendation algorithms, Proceedings of the tenth
international conference on World Wide Web - WWW 01 (2001).
8. Hu, Y., Koren, Y., Volinsky, C.: Collaborative Filtering for Implicit
Feedback Datasets.
9. Liu, N.N., Xiang, E.W., Zhao, M., Yang, Q.: Unifying explicit and implicit
feedback for collaborative filtering.