Project On Movie Recommendation by Using Data Mining
Project On Movie Recommendation by Using Data Mining
Content Based
ARCHITECTURE OF
RECOMMENDATION SYSTEM-Content
Based
DIAGRAM FOR INTERACTION
WORKING OF AZURE SYSTEM IN
CONTENT BASED
• This scenario covers the training and evaluating of the machine learning model using the Spark alternating least squares
(ALS) algorithm on a dataset of movie ratings. The steps for this scenario are:
1.The front-end website or app service collects historical data of user-movie interactions, which are represented in a table
of user, item, and numerical rating tuples.
2.The collected historical data is stored in blob storage.
3.A Data Science Virtual Machine (DSVM) is often used for smaller workloads to experiment or develop a product based
on a Spark ALS recommender model. The ALS model is trained using a training dataset, which is produced from the
overall dataset by applying a data splitting strategy. For example, the dataset can be split into sets randomly,
chronologically, or stratified, depending on the business requirement. Similar to other machine learning tasks, a
recommender is validated by using evaluation metrics (for example, precision@k, recall@k, MAP, nDCG@k).
4.Azure Machine Learning coordinates the experimentation, such as hyperparameter sweeping and model management.
5.A trained model is saved to Azure Cosmos DB, which can then be applied for recommending the top k movies for a
given user.
6.The model is then deployed onto a web or app service by using Azure Container Instances or Azure Kubernetes Service.
• For an in-depth guide to building and scaling a recommender service, see the article
Build a real-time recommendation API on Azure.
PRINCIPLE OF RECOMMENDATION
SYSTEM
• Recommendation systems helps users by providing useful suggestions, thus reducing their
search time.
• These recommendations can be generated in various ways like content based, collaborative
filtering, hybrid methods and other approaches.
• In this paper, a movie data-set has been used which contains user ratings for a number movies,
by different users.
• The user ratings are used for classifying data into various categories which can further be helpful
to generate recommendations.
• In this paper, we are going to use data mining techniques to analyse user preferences and
determine user-specific movie ratings through the help of data mining techniques.
• We will use a movie database from IMDB and determine user specific ratings for each of them.
• The analysis of attributes of these movies will help us identify the decisive factors and identify
user preferences accurately
HOW DO IT SEARCH?
PROCESS BEHIND SEARCHING IN NETFLIX
Historically, Netflix has used a template-based
approach to tackle this problem of page
generation i.e. a massive blood bath of rows
competing for precious screen real estate. It is a
task focused on not only accuracy, but also
providing diversity, accessibility and stability at
the same time. Other considerations include
hardware capabilities (what device is being
used) and which rows/columns are visible at
first glance and upon scroll.
With all these requirements, one can see why a
template-based approach can work quite well
for a start because one can have a few fixed set
of criterions to be met at all times. However,
having many of such rules in place naturally
landed Netflix into a local optimum in terms of
providing a good member experience.
NETFLIX RECOMMENDED SYSTEM
• 80% of stream time is achieved through Netflix’s recommender
system, which is a highly impressive number. Moreover, Netflix
believes in creating a user experience that will seek to improve
retention rate, which in turn translates to savings on customer
acquisition (estimated $1B per year as of 2016).
• The solution and approach that Netflix uses is a Machine Learning
one, where they aim to create a scoring function by training a model
using historical information of which homepages they have created for
their members — including what they actually see, how they
interacted with and what they played.
CONTENT BASED RECOMMENDER
SYSTEMender System