0% found this document useful (0 votes)

16 views

Sentiment Analysis of IMDb Movie Reviews A Comparative Study On Performance of Hyperparameter-Tuned Classification Algorithms

Uploaded by

Saad Tayef

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Sentiment Analysis of IMDb Movie Reviews A Comparative Study On Performance of Hyperparameter-Tuned Classification Algorithms

Uploaded by

Saad Tayef

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

2022 8th International Conference on Advanced Computing and Communication Systems (lCACCS)

Sentiment Analysis of IMDb Movie Reviews : A

comparative study on Performance of
Hyperparameter-tuned Classification Algorithms
Ayanabha Ghosh
Department of Computer Science and Engineering
Indian Institute of Technology Jodhpur
Jodhpur, India
[email protected],in ghoshayanabha@ grnail.com

Abstract-Sentiment Analysis (SA) is a sub-domain of Nat- semi-structured format depending upon the source. Pre-
ural Language Processing where useful insights on sentiment processing of texts includes several computational steps
and opinion of people can be obtained and analyzed from such as Removing punctuation, stemming and/or lemma-
various textual data in structured, unstructured and semi-
structured format. In this work, I have tried to analyze tization, removal of stopwords and building vocabulary
6 with filtered tokens. It is an important step as it eventually
o the sentiment of viewers from the IMDb Movie Reviews
u.J
u.J
dataset. For this, I have taken three different Supervised filters out various important tag words. One can easily
u.J
learning methods, namely, Linear Support Vector Machine, understand the implication of a review by looking at
Logistic Regression and Multinomial Naive Bayes Classifier,
N
N
o the tokens. Thus, a wrong or bad pre-processing may
N
each with different settings of hyperparameters. Moreover,
g
o to capture the notion of informal jargon, approach of N- sometimes lead to wrong conclusions, which is not at
o
,...; grams has been followed. Furthermore, a comparative study all intended. After this, the analysis can be done by the
'"
</l-
-..... has been performed to find the one best model from each of machines by capturing the tag words that may actually
N
N
-..... the different types of above mentioned Supervised learning describe the meaning of a sentence.
U'l
J, techniques based on their Accuracy Score, FI-Score and AVC The next step of pre-processing involves building the
.-<
Score. In this approach, I have obtained the best accuracy
00
o
score of around 0.910 and mean Fl-score after 10-fold Cross classifier which can use the knowledge base and extract
""
U'l
\!)

If
validation of around 0.894. meaningful insights. There are several algorithms out
.-< Index Terms-Text mining, Information retrieval, Senti- there such as Linear Support Vector Machines, Logistic
00 ment analysis, Binary Classification, Supervised learning, Regression, K-Nearest Neighbours, Random Forest and so
"
en
IMDb dataset, AVC score. on. There are two types of learners namely Eager Learners
Vl
u
u and Lazy Learners. Eager learners build model from the
«
S:! I. INTRODUCTION existing knowledge base i.e. training data and use the
V>
E model to predict the unknown data which is given to it.
OJ
t:>- In the present era of social media and online content On the other hand, Lazy learners are those who store the
U'l
C
distribution, huge amount of data is generated at a daily training data and perform classification at the inference
o basis by the users of such platforms and thus, the amount
:;:;
ro stage when unknown data is available.
.~
c of data is exponentially growing day by day. One of In this work, I have used the IMDb Movie Reviews
::J
E the major part of such data is the reviews given by the dataset to perform the task of sentiment analysis us-
E
U
o users on various things such as their purchases on e- ing three different supervised learning algorithms namely
"'0
C commerce platforms, the shows they watch, the news they Linear Support Vector Machine, Logistic Regression
ro
OJ)
c
read and so on. Such opinions expressed by the people and Multinomial Naive Bayes Classifier. Various com-
:;:;
::J
Q.
help the stakeholders to understand their sentiment. From binations of the hyperparameters have been taken and
E
o a business point of view, the reviews given by a group fine-tuned so as to find the best performing model on this
U
"'0 of consumers on a particular product help to gain insights dataset. Furthermore, the performance of the models have
OJ
u
c regarding customers' needs or requirements, which help a been evaluated using three different evaluation metrics
ro
>
"'0 business to grow and improve. namely, Accuracy Score, FI-Score and AVC Score.
«
c
o As the reviews or opinions corne from various sources In Section II, I have given a brief survey of previ-
OJ
u
c
in unstructured or semi-structured format, it becomes quite ous works done in this area. Section III describes the
e: difficult for a human being to bring those in structured methodology of experiments carried out in this work. In
~
c
o format, analyse them and find valuable insights from those. the second last Section IV, experimental results have been
u
roc Here is the need of machines which have the capability to shown and explained with proper diagrams and plots.
o
:;:; do aforementioned objectives in a short span of time.
ro
c The ability of a machine to learn and process things at a II. PREVIOUS WORKS
~ higher rate gives rise to a sub-domain of text mining called The authors of [1] have applied four different classi-
oS
-;:; Sentiment Analysis or Opinion Mining. Text processing fication algorithms on IMDb movie reviews dataset and
00
N
N
o
and Sentiment analysis is a challenging task as it includes evaluated their performance using Accuracy Score, FI
N
processing of streams of textual data in unstructured or Score and AVC Score. The authors have not mentioned

978-1-6654-0816-5/22/$31.00 ©2022 IEEE

289
Authorized licensed use limited to: Rajshahi University Of Engineering and Technology. Downloaded on January 28,2024 at 11:41:06 UTC from IEEE Xplore. Restrictions apply.
2022 8th International Conference on Advanced Computing and Communication Systems (lCACCS)

any particular settings of the hyperparameters for the 4) Removal of Stopwords: Each review is split into to-
classification algorithms. kens and from the generated token streams, stopwords are
The authors of [6] have used 3 different classification removed. For this step, the English language stopwords'
algorithms for performing sentiment analysis. The draw- set from nltk. corpora has been used.
back is they have not taken different hyperparameters 5) Stemming: Stemming is a heuristic process of chop-
and language models. Moreover, the authors have only ping off the tail of each word in order to achieve a
used Term Frequency for quantifying the reviews. They base form of the token. In this step, stemming has
have obtained accuracies 85.69%, 86.23% and 83.54% for been performed on the remaining tokens using Porter's
Linear SVM, Logistic Regressor and Multinomial Naive Stemming Algorithm. In this work, I have used the
Bayes classifiers respectively. PorterStemmer () method of nl tk.
The authors of [3] have examined the sentiment from Another alternative to stemming is the process called
IMDb Movie Reviews dataset to find the polarity of the Lemmatization, which involves converting the token into
movie reviews on a scale of 0 (highly disliked) to 4 their actual dictionary form. It is a computationally expen-
(highly liked). Then they have performed feature extrac- sive process as compared to Stemming and using this to
tion, followed by training a multilabel classifier to classify pre-process large dataset will take much more time.
the reviews into its correct label. They have obtained an C. Bag-of-Words & TF-IDF Vectorization
accuracy of around 88.95%.
In text dataset, tokens are the features of a particular
The authors have [4] have applied sentiment analysis on
document. Unique token are extracted from the entire
IMDb Movie Reviews Dataset in which, they have applied
dataset or corpus and a dictionary is created. Relevant
various steps of text processing and feature selection,
information such as Count and/or TF_IDF weightage etc.
followed by classifying them into positive or negative
is stored corresponding to each of the features and a Bag-
reviews. Furthermore, they have evaluated the model with
of-Words representation is created. As machines cannot
eight different classifiers using five different evaluation
understand the tokens in text format, these numerical
metrics.
valued features are fed to the learner so as to make it
analyzable by the machine.
III. METHODOLOGY
Here, I have used the TF-IDF scoring of the tokens
The steps which have been followed are described in to map them into numerical values. TF-IDF stands for
this section. Term Frequency - Inverted Document Frequency. Term
Frequency is defined as the total number of times a
A. The Dataset token is appearing in a document. As the number of
words in a document may greatly vary, Document Length
I have chosen the IMDb Movie Reviews Dataset from Normalization has been performed while finding TF. The
Kaggle. This dataset is originally derived from the Large expression for finding TF is as follows,
Movie Review Dataset, which consists of around 50,000
highly polar movie reviews among which, 25,000 are given tf(t, d) = d.l~~~th
as Training data and rest 25,000 are for Test. The dataset where ft,d is the number of times term t occurs in
in CSV format I have used here is made from the original document d; d.length denotes the total number of words
dataset. It has two columns namely review and sentiment. in document d.
The review column consists of all 50,000 reviews from Inverted Document Frequency of a term is defined as the
Training and Test set. The sentiment column consists of logarithmically scaled fraction of the corpus size and the
the corresponding sentiment as either positive or negative. number of documents which contain that term. It captures
The entire dataset has been divided into 3 subsets by how much information a particular term may contain.
randomly picking up each instances, Train, Validation and Moreover, the logarithm rewards the terms appearing less
Test set in a ratio of 70:15:15. number of times as well as penalizes the influence of
terms occurring higher number of times. The expression
B. Preprocessing for finding IDF is as follows,
After loading the dataset, it has to be pre-processed in a idf(t) = log dICt)
proper step-by-step manner. The pre-processing steps that
we have followed are described below, where df(t) = Id ED: t E dl is the Document Frequency
1) Removal of HTML and CSS tags: The reviews of t; D is the corpus; N = IDI is the size of corpus. It
contain certain HTML tags such as <br> ... </br> and may also happen that a term may not be present in the
CSS stylings, which need to be removed during pre- corpus, therefore making the denominator df(t) = O. To
processing. For this purpose I have used the built-in HTML avoid such problem, we add 1, hence smoothing the IDF
parser of BeautifulSoup library. score.
2) Casefolding: Casefolding is done in order to nor- idf(t) = log l+;?;-(t)
malize tokens so that same words written with different
cases or one with first letter capitalized can be treated as Therefore, the TF-IDF score is calculated by multiply-
same tokens. ing the TF and IDF of a term obtained by following the
above steps.
3) Removal of Punctuations: The punctuations have
been removed from each of the reviews. tfidf(t, d, D) = tf(t, d) x idf(t, D)

290
Authorized licensed use limited to: Rajshahi University Of Engineering and Technology. Downloaded on January 28,2024 at 11:41:06 UTC from IEEE Xplore. Restrictions apply.
2022 8th International Conference on Advanced Computing and Communication Systems (lCACCS)

But in this approach also, tf( t, d) of term t gets rewarded At the end, the model is being saved into the secondary
when it appears significantly higher number of times. So, a memory. For each of the candidate supervised techniques,
common modification can be done as taking the logarithm best model is found and saved into the secondary storage.
of the term frequency as shown, 3) Loading and Prediction: At the inference stage, the
saved models are listed and parsing the filenames, the
tfidf(t, d, D) = (1 + log(tf(t, d))) x idf(t, D)
particular type of classifier is loaded into main memory.
All the reviews of the dataset are pre-processed through The model predicts the unseen Test data and gives its
the steps explained above. Text pre-processing is a lengthy prediction. Then, based on the prediction of Test data
and time-taking process. For this reason, to ease the points, the Accuracy Score, F1 Score and AUC Score are
implementation, I have saved a copy of the pre-processed computed for Test dataset.
dataset in a seperate CSV file and used in further steps. The steps have been depicted in Fig. 1
D. Model Selection and Hyperparameter tuning
As the problem, being addressed, is a Binary Classifica-
tion Problem, we have chosen the Linear Support Vector
Machine, which uses Linear hyperplane to separate the
two classes. Other classification methods I have chosen
are Logistic Regression and Multinomial Naive Bayes.
Linear SVM has been widely regarded to be one of the
best classification algorithm for classifying text data [5].
1) Setting up different hyperparameters: At first, dif-
ferent hyperparameter values for each of the algorithms
have been taken. In case of Linear SVM and Logistic
Regression, the following hyperparameter settings have
been taken,
• C: It is a regularization parameter which is inversely
proportional to the regularization strength. Lower
values of C allow the classifier to misclassify more
number of points. The following values of C have
been taken while finding the best model: 0.001, 0.01,
0.1, 1, 10, 100.
• Class Weight : In case of imbalanced training
dataset, we can provide more weightage to the class Best Linear

having less number of corresponding training sam- SVM

ples. This is set to 'balanced' which adjusts weights

inversely proportional to class frequencies in the input
data.
• Maximum iteration: It denotes the maximum iter-
ations to be taken by the solver to converge. In this
case, the value has been kept as 100000.
In case of Multinomial Naive Bayes Classifier, different
Fig. 1. Flowchatt of Methodology
values of smoothing hyperparameter a have been taken. I
have taken the same values as of the hyperparameter C in
case of Linear SVM and Logistic Regression. The values IV. EXPERIMENTAL RESULTS
of hyperparameters ficprior and class_prior have been left
with default values which are True and None respectively, For the evaluation of the models, I have taken 3 metrics
as per Scikit-learn's implementation. namely Accuracy Score, F1 Score and Area Under the
Furthermore, I have also considered different n-gram Curve (AUe) Score.
ranges starting from (1, 1) to (1, 4) i.e. considering a • Accuracy Score : It is defined by the total number
single token as a single feature and considering upto 4 of correct classifications predicted by the classifier.
consecutive tokens together as a single feature. In case It is computed as,
of n-gram value of 2, it means taking single token as
single feature, then considering two consecutive tokens as accuracy(y, f)) = 11 XI:~~1 I(Yi = Yi)
features, therefore increasing the size of feature space.
2) Training: For each different hyperparameters and where N is the total number of samples, y is the
for each different n-gram range, separate classifiers of the ground truth and f) is the predicted values given by
selected supervised learning techniques have been trained the classifier.
using Train set. In each iterations, a best_model flag • Fl Score: The formula for computing F1-score is,
keeps track of the model's accuracy score and F1 score on
Validation set. If a model yields better F1 score than in 2 x Precision x Recall
the previous iteration, the best_model flag gets updated. f Iscore = -=------=----,-,---
Precision + Recall

291
Authorized licensed use limited to: Rajshahi University Of Engineering and Technology. Downloaded on January 28,2024 at 11:41:06 UTC from IEEE Xplore. Restrictions apply.
2022 8th International Conference on Advanced Computing and Communication Systems (lCACCS)

where Precision is defined as, for a particular label,

Linear SVM Performance for different ngram and C
the fraction of total data points correctly classified
into that label (True Positives) by the classifier among 0.91
-
-
I-gram
2-gram
all the data points that the classifier has predicted to 0.90
- 3-gram
be of that label (i.e. fraction of Relevant data among 4-gram
flI 0.89
the Retrieved data). o
u
~ 0.88
u
i!'
G 0.87
.. TruePositives .';!
P reC2s2on = -=---=-------=----,-----:::----- 0.86
TruePositives + FalsePositives
0.85

Recall is the fraction of total data points correctly 0.84

L....,----,------r----r------r---,.------r---'
classified into a label (True Positves) by the classifier 0.0001 0.001 0.01 0.1
C values
10 100

among all the data points with that label present in the
dataset (i.e. fraction of Retrieved among the Relevant Fig. 2. C-Accuracy Score plot of Linear SVM with different n-gram
Data). range

TruePositives Linear SVM Performance for different ngram and C

R£ca II = - - - - - - - - - - - - - -
TruePositives + FalseN egatives 0.89 - I-gram
- 2-gram
0.88 - 3-gram

The FI-score has been calculated by taking the mean 4-gram

0.87
of FI-scores after lO-fold Cross validation using the flI
Validation dataset. .x 0.86
o

• AVe Score: Area Under Curve (AUC) Score is the ~

0.85
measure or the ability of a classifier to distinguish
between the classes. It measures the area under the 0.84

Receiver Operating Characteristic (ROC) curve. ROC 0.83

Curve plots two parameters, namely, True Positive
0.0001 0.001 0.01 0.1 10 100
Rate (TPR) along the vertical axis and False Positive C values
Rate (FPR) along the horizontal axis.
Fig. 3. C-Fl Score plot of Linear SVM with different n-gram range

TPR = TruePositive
TruePositive + FalseN egative
1----:==========:::;;-]
ROC Curve of Best LinearSVM Classifier
1.0

FPR =
0.8
FalsePositive
FalsePositive + TrueN egative
.&
8!ClI 0.6
>
"'o
·in
The higher the AUC score, the better the classifier can c.. 0.4
ClI

distinguish between positive and negative classes, in

:J
Fe
case of a binary classification problem. 0.2

During the training, at each iteration a record of accuracy 0.0 - ROC curve (area = 0.97)
score and FI score along with the C and n-gram range 0.0 0.2 0.4 0.6 0.8 1.0
values have been kept and plotted for each model in a 2D Fa Ise Positive Rate

space. The obtained results are as follows,

Fig. 4. ROC plot of Linear SVM with AUC Score

A. Linear SVM
The C vs Accuracy Score plot is shown in Fig.-2 and B. Logistic Regression
the C vs FI Score for the same is shown in Fig.-3. Clearly, The C vs Accuracy Score plot is shown in Fig.5 and the
it can be said that for I-gram, the accuracy and FI score C vs FI Score for the same is shown in Fig.-6. Clearly,
increase at first but it drastically reduce for higher value of it can be said that for I-gram, the FI score was higher
C. Moreover, it never performed better than other n-gram than other n-gram ranges at first but it reduced for higher
range. Among all, the best performing n-gram range is (1, value of C. On the other hand, the Accuracy can be seen
2) when the values of C are higher. varying somewhat similar to what has been observed in
The Test Accuracy score and F I-score of the best model case of Linear SVM. Among all, the best performing n-
are 0.910 and 0.894 respectively. The AUC score for the gram range is (1, 2) when the value of C is higher.
best performing Linear SVM model is 0.97 [Fig. 4]. The Test Accuracy score and F I-score of the best model

292
Authorized licensed use limited to: Rajshahi University Of Engineering and Technology. Downloaded on January 28,2024 at 11:41:06 UTC from IEEE Xplore. Restrictions apply.
2022 8th International Conference on Advanced Computing and Communication Systems (lCACCS)

are 0.907 and 0.893 respectively. The AVC score for the Multinomoal Naive Bayes is giving very poor performance
as compared to other n-gram language models considered
here. The Accuracy score as well as FI Scores (for differ-
Logistic Regression Performance for different ngram and C ent a values) of Trigram and 4-gram language models are
0.91 - I-gram close to each other and decrease with higher values of a.
- 2-gram
0.90 - 3-gram However, in this experiment, 4-gram model has yielded
0.89
4-gram highest Accuracy score for a=O.I and highest FI Score
flI
.x 0.88
o for a=l.
>-
u
The Test Accuracy score and F I-score of the best model
0.87
~
:J
u
are 0.896 and 0.872 respectively. The AVC score of the
.'i 0.86

0.85
MNB Performance for different ngram and alpha
0.84
0.89
0.0001 0.001 0.01 0.1 10 100 0.88
C values
0.87
flI
Fig. 5. C-Accuracy Score plot of Logistic Regression with different n- .x 0.86
o

gram range i'7

~ 0.85
:J
u
.'i 0.84
- I-gram
0.83 - 2-gram
Logistic Regression Performance for different ngram and C
- 3-gram
0.82
0.89 - I-gram 4-gram
- 2-gram
0.88 - 3-gram 0.0001 0.001 0.01 0.1 10 100
No. of Alpha
4-gram
0.87

flI 0.86 Fig. 8. a value - Accuracy Score plot of Multinomial NB with different
8
ttl
n-gram range
~ 0.85

0.84

0.83

0.82
I
0.0001 0.001 0.01 0.1 10 100
0.86
-
-
MNB Performance for different ngram and alpha
I-gram
2-gram
- 3-gram
C values
4-gram
0.84

Fig. 6. C-Fl Score plot of Logistic Regression with different n-gram flI
range ~ 0.82
~
best performing Logistic Regression model is 0.97 [Fig. 0.80
7].
0.78

ROC Curve of Best Logistic Regressor 0.0001 0.001 0.01 0.1 10 100
1.0 No. of Alpha

0.8 Fig. 9. a value - F1 Score plot of Multinomial NB with different n-gram

.& range
~'" 0.6
>
best performing Multinomial Naive Bayes is 0.96 [Fig.
"'o
·in
10].
~ 0.4
:J
F
Classifier Accuracy F1 Score AUC Score
0.2
Score
Linear SVM 0.910 0.894 0.97
0.0 - ROC curve (area = 0.97) Logistic Regression 0.903 0.893 0.97
L....,------r------r-----,-----r-----,---l
0.0 0.2 0.4 0.6 0.8 1.0 Multinomial Naive Bayes 0.896 0.872 0.96
False Positive Rate
TABLE I
EXPERIMENTAL RESULTS
Fig. 7. ROC plot of Logistic Regression with AUC Score

C. Multinomial Naive Bayes V. CONCLUSION

The a values vs Accuracy Score plot is shown in Fig.- In this work, I have tried to present a comparative
8 and the a values vs FI Score for the same is shown study of different traditional classification algorithms with
in Fig.-9. Clearly, it can be said that for unigram models, different hyperparameter settings on the task of sentiment

293
Authorized licensed use limited to: Rajshahi University Of Engineering and Technology. Downloaded on January 28,2024 at 11:41:06 UTC from IEEE Xplore. Restrictions apply.
2022 8th International Conference on Advanced Computing and Communication Systems (lCACCS)

ROC Curve of Best MNB CLassifier

0.8

0.6

0.4

0.2

0.0 - ROC curve (area = 0.96)

L....,------r------r------r------r------r--'
0.0 0.2 0.4 0.6 0.8 1.0
False Positive Rate

Fig. 10. ROC plot of Multinomial NB with AUC Score

analysis from IMDb Movie Reviews Dataset. The reviews

are properly pre-processed so that it can be fed to these
models. The final results were evaluated using 3 different
evaluation metrics. From that, it can be concluded that in
this problem, the combination of TF-IDF with n-gram
range (1, 2) + Linear SVM with C=10 has given highest
accuracy score and F I-score.
In future, the experiment can be carried out using more
advanced word embeddings such as Word2Vec which
captures the semantic similarity between the words. Exper-
iment can also be carried out to find if one classification
algorithm can correctly classify the misclassified examples
of another classifier. In this way, committee can be created
which will be able to correctly predict all the examples,
yielding higher performance.
REFERENCES
[I] S. Tripathi, R. Mehrotra, V. Bansal and S. Upadhyay, "Analyzing
Sentiment using IMDb Dataset," 2020 12th International Confer-
ence on Computational Intelligence and Communication Networks
(CICN), 2020, pp. 30-33, doi: 1O.1l09/CICN49253.2020.9242570.
[2] Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang,
Andrew Y. Ng, and Christopher Potts. (2011). Learning Word
Vectors for Sentiment Analysis. The 49th Annual Meeting of the
Association for Computational Linguistics (ACL 2011).
[3] T. P. Sahu and S. Ahuja, "Sentiment analysis of movie re-
views: A study on feature selection & classification algorithms,"
2016 International Conference on Microelectronics, Computing and
Communications (MicroCom), 2016, pp. 1-6, doi: lO.l109/Micro-
Com.2016.7522583.
[4] M. Yasen and S. Tedmori, "Movies Reviews Sentiment Analysis
and Classification," 2019 IEEE Jordan International Joint Con-
ference on Electrical Engineering and Information Technology
(JEEIT), 2019, pp. 860-865, doi: 10.1l09/JEEIT.2019.8717422.
[5] D. O. Ratmana, G. Fajar Shidik, A. Z. Fanani, Muljono and
R. A. Pramunendar, "Evaluation of Feature Selections on Movie
Reviews Sentiment," 2020 International Seminar on Application for
Technology of Information and Communication (iSemantic), 2020,
pp. 567-571, doi: 10. I 109/iSemantic50169.2020.9234287.
[6] A. Poornima and K. S. Priya, "A Comparative Sentiment Analysis
Of Sentence Embedding Using Machine Learning Techniques,"
2020 6th International Conference on Advanced Computing and
Communication Systems (lCACCS), 2020, pp. 493-496, doi:
10.1 109/1CACCS48705.2020.90743 12.

294
Authorized licensed use limited to: Rajshahi University Of Engineering and Technology. Downloaded on January 28,2024 at 11:41:06 UTC from IEEE Xplore. Restrictions apply.

English Vocabulary (List of Words, Part II), C2
No ratings yet
English Vocabulary (List of Words, Part II), C2
7 pages
NLP Final Mini Project
No ratings yet
NLP Final Mini Project
17 pages
96. OKE JUGA - Sentiment Analysis of IMDb Movie Reviews Using Long Short-Term Memory
No ratings yet
96. OKE JUGA - Sentiment Analysis of IMDb Movie Reviews Using Long Short-Term Memory
4 pages
Synopsis
No ratings yet
Synopsis
8 pages
Sentiment Analysis of IMDb Movie Reviews Using LSTM
No ratings yet
Sentiment Analysis of IMDb Movie Reviews Using LSTM
4 pages
Iscs 476
No ratings yet
Iscs 476
18 pages
"Sentiment Analysis of Imdb Movie Reviews": A Project Report
0% (1)
"Sentiment Analysis of Imdb Movie Reviews": A Project Report
22 pages
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
No ratings yet
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
48 pages
JETIRCJ06015
No ratings yet
JETIRCJ06015
4 pages
Sentiment Analysis On IMDB Movie Reviews Using Machine Learning and Deep Learning Algorithms
No ratings yet
Sentiment Analysis On IMDB Movie Reviews Using Machine Learning and Deep Learning Algorithms
6 pages
Sentiment Analysis of Talaash Movie Reviews Using Text Mining Approach
No ratings yet
Sentiment Analysis of Talaash Movie Reviews Using Text Mining Approach
9 pages
nlp_project(documentation)
No ratings yet
nlp_project(documentation)
8 pages
Sentiment Analysis of A Product Based On User Reviews Using Random Forests Algorithm
No ratings yet
Sentiment Analysis of A Product Based On User Reviews Using Random Forests Algorithm
5 pages
Sentiment Analysis IMDB Review - Presentation
No ratings yet
Sentiment Analysis IMDB Review - Presentation
19 pages
Peerj Cs 08 914
No ratings yet
Peerj Cs 08 914
28 pages
Analyzing Sentiment Using IMDb Dataset
No ratings yet
Analyzing Sentiment Using IMDb Dataset
4 pages
JOU Classification of sentiment reviews using n-gram machine learning
No ratings yet
JOU Classification of sentiment reviews using n-gram machine learning
10 pages
Data Science Project
No ratings yet
Data Science Project
24 pages
Cs221 Report
No ratings yet
Cs221 Report
16 pages
23. Movies Reviews Sentiment Analysis and Classification
No ratings yet
23. Movies Reviews Sentiment Analysis and Classification
6 pages
Twitter Analysis
No ratings yet
Twitter Analysis
8 pages
"Sentiment Analysis of Imdb Movie Reviews": A Project Report
No ratings yet
"Sentiment Analysis of Imdb Movie Reviews": A Project Report
27 pages
A Sentiment Analysis Approach Through Deep Learning For A Movie Review
No ratings yet
A Sentiment Analysis Approach Through Deep Learning For A Movie Review
9 pages
Sentiment Analysis of Movie Reviews
No ratings yet
Sentiment Analysis of Movie Reviews
6 pages
MADHU-IEEE Update
No ratings yet
MADHU-IEEE Update
5 pages
research paper text classification
No ratings yet
research paper text classification
17 pages
Sentiment Analysis From Movie Reviews Us
No ratings yet
Sentiment Analysis From Movie Reviews Us
5 pages
DL Project
No ratings yet
DL Project
21 pages
431_paper
No ratings yet
431_paper
5 pages
F13 Final
No ratings yet
F13 Final
23 pages
Sentimental Analysis Final Year Project
No ratings yet
Sentimental Analysis Final Year Project
21 pages
5700-Article Text-21868-1-10-20230318 (1)
No ratings yet
5700-Article Text-21868-1-10-20230318 (1)
6 pages
Dr S.K-IEEE-updated-29-07-24
No ratings yet
Dr S.K-IEEE-updated-29-07-24
5 pages
MADHU-IEEE-updated-28-07-24
No ratings yet
MADHU-IEEE-updated-28-07-24
5 pages
Twitter_Sentiment_Analysis_using_Deep_Learning
No ratings yet
Twitter_Sentiment_Analysis_using_Deep_Learning
5 pages
Abhay Raj 2019ugcs005r NLP Report
No ratings yet
Abhay Raj 2019ugcs005r NLP Report
21 pages
A Comparative Study of Some Selected Classifiers On An Imbalanced Dataset For Sentiment Analysis
No ratings yet
A Comparative Study of Some Selected Classifiers On An Imbalanced Dataset For Sentiment Analysis
7 pages
Final Presentation
No ratings yet
Final Presentation
18 pages
(IJCST-V9I3P23) :aditi Linge, Bhavya Malviya, Digvijay Raut, Payal Ekre
No ratings yet
(IJCST-V9I3P23) :aditi Linge, Bhavya Malviya, Digvijay Raut, Payal Ekre
3 pages
base1
No ratings yet
base1
7 pages
ML Project Report
No ratings yet
ML Project Report
26 pages
Sentiment Analysis Based On Performance of Linear Support Vector Machine and Multinomial Naïve Bayes Using Movie Reviews With Baseline Techniques
No ratings yet
Sentiment Analysis Based On Performance of Linear Support Vector Machine and Multinomial Naïve Bayes Using Movie Reviews With Baseline Techniques
19 pages
MP 1
No ratings yet
MP 1
14 pages
NILES2021 Paper 43
No ratings yet
NILES2021 Paper 43
5 pages
Sentiment Analysis On Movie Reviews Based On Combined Approach
No ratings yet
Sentiment Analysis On Movie Reviews Based On Combined Approach
4 pages
An Enhanced Sentiment Analysis Using Machine Learning Methods in Imbalanced Movie Review Streams
No ratings yet
An Enhanced Sentiment Analysis Using Machine Learning Methods in Imbalanced Movie Review Streams
6 pages
2 +intelligent+2024+paper+1
No ratings yet
2 +intelligent+2024+paper+1
12 pages
RES Presentation
No ratings yet
RES Presentation
21 pages
A Comparative Study On Linear Classifier PDF
No ratings yet
A Comparative Study On Linear Classifier PDF
3 pages
Machine Learning With Advance Model
No ratings yet
Machine Learning With Advance Model
19 pages
To Find Out The Quality and Popularity of A Product by Using User Comments
No ratings yet
To Find Out The Quality and Popularity of A Product by Using User Comments
8 pages
Analyzing The Performance of Sentiment Analysis Using BERT DistilBERT and RoBERTa
No ratings yet
Analyzing The Performance of Sentiment Analysis Using BERT DistilBERT and RoBERTa
6 pages
ISSS609 Project Proposal Group 7
No ratings yet
ISSS609 Project Proposal Group 7
8 pages
Maneesha Nidigonda Major Project
No ratings yet
Maneesha Nidigonda Major Project
11 pages
Opinion Mining Using Machine Learning
No ratings yet
Opinion Mining Using Machine Learning
3 pages
Web Mining Unit 2
No ratings yet
Web Mining Unit 2
12 pages
NLP Final
No ratings yet
NLP Final
22 pages
AI-web_scraping
No ratings yet
AI-web_scraping
18 pages
Sentiment Analysis of Movie Ratings Syst
No ratings yet
Sentiment Analysis of Movie Ratings Syst
5 pages
Preview
No ratings yet
Preview
11 pages
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
Đáp Án Đề Đề Xuất ĐBBB 2024
No ratings yet
Đáp Án Đề Đề Xuất ĐBBB 2024
8 pages
selfstudys_com_file (3)
No ratings yet
selfstudys_com_file (3)
9 pages
Faculty of Art, Humanity and Languages Department of English Year III Semester I
No ratings yet
Faculty of Art, Humanity and Languages Department of English Year III Semester I
25 pages
UNIT 17 JAN 25th
No ratings yet
UNIT 17 JAN 25th
5 pages
Gujarati Chemistry Hg Rdv Merit List
No ratings yet
Gujarati Chemistry Hg Rdv Merit List
5 pages
Elsema Portfolio - de Jesus Ab English 2F
No ratings yet
Elsema Portfolio - de Jesus Ab English 2F
15 pages
2bac Test1
No ratings yet
2bac Test1
3 pages
Voynich (10) - The Text To The Ponds at Page f84v in The Voynich Manuscript
No ratings yet
Voynich (10) - The Text To The Ponds at Page f84v in The Voynich Manuscript
10 pages
Notes On Notice Writing
No ratings yet
Notes On Notice Writing
3 pages
Lesson Plans 2nd Grade 15 Week
No ratings yet
Lesson Plans 2nd Grade 15 Week
2 pages
CAT Timetable Sheet (1)
No ratings yet
CAT Timetable Sheet (1)
5 pages
4 Sesion Ingles
No ratings yet
4 Sesion Ingles
23 pages
Smart Zone - Tongue Gym
No ratings yet
Smart Zone - Tongue Gym
3 pages
Grammar Articles
No ratings yet
Grammar Articles
22 pages
Adverbs of Frequency and Time Expressions
No ratings yet
Adverbs of Frequency and Time Expressions
3 pages
Homework
No ratings yet
Homework
9 pages
Christmas Conditionals
No ratings yet
Christmas Conditionals
3 pages
De Thi Tieng Anh Lop 7 Giua Ki 1 de So 2 2
No ratings yet
De Thi Tieng Anh Lop 7 Giua Ki 1 de So 2 2
3 pages
Media and Teaching Aids 1
No ratings yet
Media and Teaching Aids 1
3 pages
The Use of Mobile Applications in Teaching English Dental Fricatives To Polish Learners in Grades 4
No ratings yet
The Use of Mobile Applications in Teaching English Dental Fricatives To Polish Learners in Grades 4
6 pages
MFL RATIONALE SCOPE & SEQUENCE
No ratings yet
MFL RATIONALE SCOPE & SEQUENCE
7 pages
Pretest
No ratings yet
Pretest
5 pages
The Linguistic Structure of Modern English (Consonant Exercise)
No ratings yet
The Linguistic Structure of Modern English (Consonant Exercise)
3 pages
Direct-and-Indirect-Speech
No ratings yet
Direct-and-Indirect-Speech
139 pages
Analysis of The Road Not Taken
No ratings yet
Analysis of The Road Not Taken
16 pages
Exercises Unit 31 - 33 Starters
No ratings yet
Exercises Unit 31 - 33 Starters
4 pages
OBE 5 English
No ratings yet
OBE 5 English
8 pages
Week 6 Less 6
No ratings yet
Week 6 Less 6
5 pages
Language Functions Lecturer: Yaseen M.Taher English Language Center-Uot
No ratings yet
Language Functions Lecturer: Yaseen M.Taher English Language Center-Uot
7 pages

Sentiment Analysis of IMDb Movie Reviews A Comparative Study On Performance of Hyperparameter-Tuned Classification Algorithms

Uploaded by

Sentiment Analysis of IMDb Movie Reviews A Comparative Study On Performance of Hyperparameter-Tuned Classification Algorithms

Uploaded by

2022 8th International Conference on Advanced Computing and Communication Systems (lCACCS)

Sentiment Analysis of IMDb Movie Reviews : A

978-1-6654-0816-5/22/$31.00 ©2022 IEEE

having less number of corresponding training sam- SVM

ples. This is set to 'balanced' which adjusts weights

where Precision is defined as, for a particular label,

Recall is the fraction of total data points correctly 0.84

TruePositives Linear SVM Performance for different ngram and C

The FI-score has been calculated by taking the mean 4-gram

• AVe Score: Area Under Curve (AUC) Score is the ~

Receiver Operating Characteristic (ROC) curve. ROC 0.83

distinguish between positive and negative classes, in

space. The obtained results are as follows,

gram range i'7

0.8 Fig. 9. a value - F1 Score plot of Multinomial NB with different n-gram

C. Multinomial Naive Bayes V. CONCLUSION

ROC Curve of Best MNB CLassifier

0.0 - ROC curve (area = 0.96)

Fig. 10. ROC plot of Multinomial NB with AUC Score

analysis from IMDb Movie Reviews Dataset. The reviews

You might also like