Movie Recommendation System
Movie Recommendation System
https://doi.org/10.22214/ijraset.2020.32064
International Journal for Research in Applied Science & Engineering Technology (IJRASET)
ISSN: 2321-9653; IC Value: 45.98; SJ Impact Factor: 7.429
Volume 8 Issue XI Nov 2020- Available at www.ijraset.com
Abstract: Recommendation systems have become an important role in today’s digital world. During the last few decades, with the
rise of YouTube, Amazon, Netflix and many other such web services, recommender systems have taken more and more place in
our lives. From e-commerce (suggest to buyers’ articles that could interest them) to online advertisement (suggest to users the
right contents, matching their preferences), recommender systems are today unavoidable in our daily online journeys. Users get
more data related to them using these systems. Here, we are developing a recommendation system to suggest movies to users.
Keywords: Bigdata, Pyspark, Machine Learning, Recommend
I. DATASET DESCRIPTION
The dataset has 10 million ratings and 100,000 tag applications applied to 10,000 movies by 72,000 users. It was released in 1/2009.
Dataset link: https://grouplens.org/datasets/movielens/
Attributes
A. Movie id
B. Movie name
C. Rating
D. User id
E. No of ratings
A. PYSPARK
Apache spark is an open source clustering computing framework. This has been written in Scala language. Pyspark is a
collaboration of Apache spark. It can perform both stream processing and batch processing. It is widely used for machine learning
and real time streaming analytics. When you use PySpark, you will likely to get high data processing speed of about 10x faster on
the disk and 100x faster in memory. By reducing the number of read-write to disk, this would be possible.
The above output is a basic recommendation for the users. It contains the list of movies with highest average ratings and number of
ratings above 500.
We checked error for 3 ranks of matrix, and chose rank 2 as it has the least error among them.
This is given as input as the ratings given by user for 10 movies. With these, we are going to predict top movies which this user will
like.
So, we obtained the top-rated movies similar to the likes of the user.
IV. CONCLUSION
This model can be used to recommend movies to customers based on their interest. This model has a good accuracy and less error.
V. FUTURE SCOPE
This recommendation system can be used in real time recommendations like in Netflix, Amazon Prime, YouTube. It can also be
used in product recommendations like in flipkart, amazon, etc.,
REFERENCES
[1] https://ieeexplore.ieee.org/document/8663822
[2] https://www.researchgate.net/publication/319487277_Design_and_Implementation_of_Movie_Recommendation_System_Based_on_Knn_Collaborative
_Filtering_Algorithm
[3] https://link.springer.com/chapter/10.1007/978-3-642-21793-7_63
[4] https://www.semanticscholar.org/paper/A-Movie-Recommender-System%3A-MOVREC-Kumar-Yadav/621c3acadd3de1401d2bbf039ba4afb43f6e4dd2