ICDAIC 2023 Paper 51
ICDAIC 2023 Paper 51
Techniques: A Review
Nikhil Kumarline Jatin Suteri Amit Vishkarma
Department of CSE Department of CSE Department of CSE
Galgotias University Galgotias University Galgotias University
Greater Noida, UP, India Greater Noida, UP, India Greater Noida, UP, India
nikhil.21SCSE1120001@galgotiasuniv [email protected] amit.21SCSE1420044@galgotiasuniver
ersity.edu.in sity.edu.in
Abstract—With the advancement of technology and World trained and used by businesses for their benefits. This
Wide Web, people have been able to express their thoughts to study's main objective is to provide a comprehensive
everyone. The sentiments of individuals have now gained a new overview of different research papers by different authors
medium to spread, which has resulted in people expressing and viewing from their point of perspective what or how a
their opinions/thoughts on something. It is a medium with a sentiment analysis model should be. We will delve into the
huge amount of information where users can not only share research these researchers did, the methodology they used,
their opinions, but read others' opinions, which can affect their the outcome they got and the conclusion they derived. By
opinion as well. This opinion rich information stored in a
medium called “Internet” is an excellent source for companies
analyzing the existing literature and empirical studies, we
and researchers to understand the sentiment of masses, which will assess the performance of different machine learning
lead to the development of sentiment analysis. Therefore, many algorithms, such as support vector machines (SVM), Naïve
researchers have been doing their studies in this field to Bayes, and random forests, in various sentiment analysis
analyze the emotions of the public. This research is useful for tasks.
the public as their sentiment gets noticed and is useful for Furthermore, this paper will shed light on the importance of
businesses as they can do sentiment analysis and by analyzing high-quality labeled datasets and feature engineering used
the sentiment, take strategies accordingly which may benefit by these authors in sentiment analysis. We will explore
them. Our goal in this paper is to study research conducted by different feature representation methods, including bag-of-
the researchers to identify underlying assumptions, highlight
methodological limitations, and propose avenues for future
words, n-grams, word embeddings, and deep learning-based
techniques, highlighting their impact on sentiment
research.
classification accuracy. Moreover, we will discuss the
Keywords—Machine learning, Sentiment Analysis, Algorithms,
Tools and Techniques
challenges associated with training supervised models, such
as data imbalance, domain adaptation, and the interpretation
I. INTRODUCTION (HEADING 1) of model decisions. By investigating the current views of
In today's world, where social media platforms, online engineers and how they propose their model on sentiment
reviews, and customer feedback play a key role in shaping analysis, this research aims to contribute to the ongoing
public opinion, understanding sentiment has become discourse on sentiment analysis using supervised machine
necessary. Sentiment analysis, a subfield of natural language learning. The insights gained from this study can aid
processing (NLP), has become a powerful tool used to researchers, practitioners, and industry professionals in
extract and comprehend human emotions, attitudes, and selecting appropriate algorithms and methodologies for
opinions from textual data. With the growth of information sentiment analysis applications across diverse domains,
on the internet, sentiment analysis has become important to ranging from customer feedback analysis and brand
understand today's world and the opinion of the people. management to social media monitoring and public
Supervised machine learning approaches have gained sentiment tracking.
significant attention and proven to be highly effective in The rest of the paper is organized into various sections
addressing sentiment analysis challenges. These methods which discusses about the work done by various researchers
learn from labeled examples, enabling them to generalize in different domains, preprocessing of data, which is
and make predictions on unseen text with impressive simplifying the data into simpler parts which can be more
accuracy. This paper aims to explore the potential of easily comprehended by the algorithms by breaking the
supervised machine learning algorithms in sentiment sentences into keywords, approach we followed for
analysis, uncovering their strengths, limitations, and the sentiment analysis, and the rest of the sections are about
underlying methodologies that make them invaluable in this implementations details and results followed by future work
domain. Quite a lot of data is available on the internet which and paper's implementation in daily life. We conducted a
is a particularly reliable source of dataset for these learning comprehensive analysis of 40 research papers authored by
algorithms. Some of the sites that provides important different individuals to gain a comprehensive understanding
datasets are Twitter, YouTube, IMDB movie review, of their studies.
Amazon etc. Excluding these websites, certain sites provide
datasets for these algorithms so that these models can be