Aspect-Based Sentiment Analysis Using A Hybridized Approach Based On CNN and GA
Aspect-Based Sentiment Analysis Using A Hybridized Approach Based On CNN and GA
fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
Date of publication xxxx 00, 0000, date of current version xxxx 00, 0000.
Digital Object Identifier 10.1109/ACCESS.2017.DOI
ABSTRACT Sentiment analysis is a computational analysis of unstructured textual data, used to assess
the person’s attitude from a piece of text. Aspect-based sentimental analysis defines the relationship among
opinion targets of a document and the polarity values corresponding to them. Since aspects are often implicit,
it is an extremely challenging task to spot them and calculate their respective polarity. In recent years, several
methods, strategies and improvements have been suggested to address these problems at various levels,
including corpus or lexicon-based approaches, term frequency and reverse document frequency approaches.
These strategies are quite effective when aspects are correlated with predefined groups and may struggle
when low-frequency aspects are involved. In terms of accuracy, heuristic approaches are stronger than
frequency and lexicon based approaches, however, they consume time due to different combinations of
features. This article presents an effective method to analyze the sentiments by integrating three operations:
(a) Mining semantic features (b) Transformation of extracted corpus using Word2vec (c) Implementation
of CNN for the mining of opinion. The hyperparameters of CNN are tuned with Genetic Algorithm
(GA). Experimental results revealed that the proposed technique gave better results than the state-of-the-
art techniques with 95.5% accuracy rate, 94.3% precision rate, 91.1% recall and 96.0% f-measure rate.
INDEX TERMS Aspect-based Sentiment Analysis; Convolutional Neural Network; Genetic Algorithm;
VOLUME 4, 2016 1
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
A. RESEARCH CONTRIBUTION emotions, but these idioms are not considered in the proposed
This paper puts forward an efficient procedure sentimental method of analysis. Considering the document metaphor,
analysis. satirical and ironic challenges the proposed method. In [9], a
method based on deep learning has been proposed to catego-
• A structural technique for extraction of features from
rize user opinions expressed in comments (called RNSA). It
unsaturated data of hotel, automobiles and movie re-
performs well with the provided dataset, however, execution
views has been defined
failed while processing a large dataset. In [10], author used
• Preprocessing of data.
a deep feedforward neural network with a global mean pool
• Semantic features have been extracted.
and a long-term short-term memory model with dense layers
• Convolutional neural network (CNN) is being used to
to identify subjective information from text documents and
extract opinions
solve emotional prediction problems at the sentence level. In
• The tuning of CNN parameters is done with a multi-
[11], an RNN-based method is proposed that uses a word
objective function using Genetic Algorithm (GA).
vector as the input acquired from the GloVe method. The
The order of the paper is as follow Section: 2 include model they suggest took into account the impact of market
literature review, Section: 3 contains Problem statement, Sec- trends, and other factors on sentiment predictions.
tion: 4 and Section: 5 presents the proposed methodology and Rezaeinia et al. [12], an improved word vector model are
proposed techniques, Section: 6 contain experimental design proposed by combining dictionary-based methods, part-of-
while the study of the results in Section: 7 and eventually speech tagging methods, word localization techniques, and
Section: 8 concludes the paper. word2vec / GloVe methods. To learn to embed emotion-
specific words, [13] describes how to integrate emotional
II. LITERATURE REVIEW information in the text. They also propose how to develop
This section reflects a comprehensive literature review of a neural network model to handle fine SA. SA of Hindi
the research work carried out in the SA field. Optimiz- comments, [14], learning model centered on CNN. They
ing data structures, optimizing query technology and opti- tried distinctive settings to modify input size and regulariza-
mizing parallelism can improves system’s efficiency. As a tion technology to determine CNN parameters Output size,
built model, it performs well enough for users competing dropout rate, period, activation function, etc. To improve the
in power, response rate, and expansibility. Al-molsmi et al. accuracy of SA of Arabic data CNN and LSTM is proposed
[4] submitted a commentary on cross-domain SA, as SA in [15]. This model uses soft voting in it, the predicted
has gained much consideration in current years. Therefore, category probabilities of the data are averaged over the two
technology, methods, detailed overview of methods can be CNNs. Then select the LSTM model and the category with
used for cross-domain SA by providing a comprehensive the highest average as the final selection Integrated model
literature introduction. Jianqiang et al. [5] also performed a prediction.
comparative study to analyze the preprocessing methods used Lee et al. [16], assessed the impact of text quality Com-
in the analysis of twitter. The accuracy and F1-measurement ments established on comment length, word count, and
of the classifier for twitter classification can be enhanced with readability. Emotion analysis task is performed on movies
the use of a suitable preprocessing method. dataset. Three models of deep learning family (simple CNN,
Wei et al. [6] has developed an ontological system for the LSTM, and RNN). The authors claim the dataset is short
analysis of the product review. A sentiment ontology tree is and easy to read higher accuracy compared to long and
designed to reflect the information of product functionality short length data sets readability. The CNN-based method
and the emotions associated with it in a hierarchical relation- proposed in [17], user behavior (personal characteristics
ship. Human-labeled data analysis ensures greater accuracy. and social activity) for emotion analysis. In [18], authors
This model can extract features automatically in future. It proposed a new model of neural network with two hidden
may reduce manual effort, while it could compromise a bit layers. The first layer shows sentence vectors that indicate
of accuracy Karagoz, P., Kama, [7], presented a framework sentences in short-term and long-term memory networks
for SA of Turkish informal texts using Frequency Based and the second layer encodes sentence relationships into a
Aspect Extraction with Sentiment Word Support (FBAE- document representation. It also suggests an improved way
SWS), and Web Search Based Aspect Extraction (WSBAE). to first clean the dataset and remove the emotionally less
This article emphasized on improving aspect extraction as polar sentences in the dataset. In [19], SA used a rule-based
an unsupervised method and detect the polarity depending method with the help of SentiWordNet and SVM for feature
on the aspect of the emotional word. It also provides a tool, extraction along with term frequency and inverse document
including a Graphical User Interface (GUI) for implementing frequency.
the proposed algorithm and visualizing the analysis results. In [20], author suggested a way to determine the sentiment
In [8], a new SA integration method based on POS and of reviews based on the hotel dataset. Hotel reviews are
n-gram has been proposed. It also considered semantics, preprocessed into a list of terms. First, the potential Dirichlet
emotional cues, and the order between words called EnSWF. Allocation (LDA) was determined Glossary; Semantic sim-
Some comments or opinions include idioms that help express ilarity then sorts the term list according to the topics gen-
VOLUME 4, 2016 3
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
erated by potential Dirichlet assignments. (LDA) integrated dex utilized is a piece of SemEval 2015 and the outcome
into the five sides of the hotel. Next, when computing the underpins the Twitter SemEval 2015 benchmark and no
resemblance, the Frequency anti-clustering frequency (TF- high-quality highlights have been utilized in this structure.
ICF) method. Finally, classify consumer sentiment (satis- The F1score surveyed was 64.85%, Future scope of this
fied or not) with word embedding and long-term memory structure is that the entire framework would be recreated
(LSTM). Kontopoulos et al. [21] presented the execution through utilizing twitter-based organizations that could make
of a real ontology-based deployment methodology for more improvement in effectiveness of the model under exchange.
effectual twitter’s tweet SA. The suggested model utilizes Ouyang et al. In [34], an approach was proposed that in-
the ontology methodology to analyze the twitter in a more cluded Word2Vec and CNN, and three sets of convolutional
effective way not just by giving a complete tweet score, but layer and max pooling layer were used in this project. This
also by analyzing any hidden dimension of tweets, instead, is the first hypothesis that word2vec and CNN use statements
provide scores for a certain tweet which makes the process using a seven-layer structure model, a linear parameter recti-
of feature engineering more reliable to achieve a higher fier with standardized functionality, and crimping techniques.
degree of accuracy. Ciric et al. [22] suggested a model for Publicly accessible movie review data uses five different tags.
evaluating the tweet’s sentiment. Several machine learning Negative, some negative, neutral, some positive, and positive.
models are used in the proposed work to conduct opinion System test accuracy is 45.4
mining, and their assembly is carried out in order to achieve Jindal et al. [35] built the image system, emotion recog-
better outcomes. The gathered outcomes are equated with nition framework works with deep and complex neural net-
various approaches for thorough analysis. The tabular view work. This methodology has been pulled together with huge
of literature review is shown in Table 1. data for the identification and transition of objects. The data
Bahraini et al. [23] suggested a hybrid model for the study set utilized was the image called Flicker. In [36] differen-
of twitter tweets by target-based sentiment. A hybrid solution tial neural system model has been proposed. It combines a
is shown to outperform other strategies as well as demon- convolutional neural network with treebank data to guide
strated better results with different features and functions. the conclusion checking task. The model provides various
Freitas et al. [24] recommended a model for extraction of elements such as grammatical information and structural
features through ontology technique, and data sources are information data, and its effect is superior to any other single-
data sets for a movie review and hotel reviews. The analysis aspect model. Authors used Static word embedding , and
of sentiment is remarkably effective with higher efficiency static word embedding can be designed by showing which
and accuracy in their experiments. Zhao et al [25] the au- terms work to express the rating in a better way. We can also
thors proposed a Chinese-language approach that achieved provide prefix and suffix data for sensory extraction. Bouazizi
better results than previous models. It has acquired greater et al. [37] built up a framework for a feeling examination of
accuracy. In [26], author introduced a model in which the twitter tweets with a specific capacity to identify mocking
Iris General Electronics tweets were classified in 3-class explanations that improve the effectiveness of conclusion
sentiment. Using a supervised learning method and lexicon- mining. The capacity to identify wry tweets results at a more
based subjectivity score, 61.6 percent accuracy was achieved. significant level of exactness. Abbasi et al. [38] recommends
Sam et al. [27] simplified models has been developed to a model using multilingual SA to share data over the internet.
evaluate customer comments about electronic products on so- SA in the English and Arabic dialects is achieved by using
cial networking sites. The keyword extraction and electronic style and syntax tools to capture specific element parts. The
product ontology creation is about understanding consumer execution results show the accuracy is above 91%.
behavior online. Ravi et al. [28] and Yadollahi et al. [29] Valdivia et al. [39] introduced a model for the presentation
tended to an investigation of the systems utilized for opinion of a TripAdvisor SA matching between user emotions and
mining. Clavel et al. [30] additionally utilized SA in human- automated emotion detection algorithms. Che et al. [40]
agent interaction since conclusion digging was once in a propose an ABSA method that automatically compresses
while utilized for human-agent interaction prior. Dragoni et emotional sentences through a sentence compression random
al. [31] creating a new ontology of common sense for an field. This significantly improves ABSA performance. Bui
analysis of sentiments. Kim [32] used CNN to distribute et al. [41] performed a cancer survivor’s network study of
exam progress. This course was trained on sentence-level temporal connection interpretation of changes in emotions.
classification tasks based on pretrained word vectors. A ba- It is analyzed by presenting a new structure for the cancer
sic CNN technique with few hyperparameters adjustments survivors machine learning has been used to train sentiment
and static vectors delivers unimaginable results on various classifiers across positions that are manually labeled with
benchmarks. This technique is applied using a single layer sentiment labels to identify sentiment post such as negative
convolutional design. or positive. Wu et al. [42] have implemented a SA approach
Stojanvaski et al. [33] recommends a model having a to decision-making in the online stock platform, which pro-
profound convolutionary neural system. Pre-prepared word motes investor decision-making and the understanding of eq-
vector inserting is accomplished in this work by actualizing uity companies. An analysis of the results shows that investor
unaided learning in huge companies. The informational in- sentiment has a greater impact on value stocks. Comparative
4 VOLUME 4, 2016
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
studies with direct evaluation approaches show the reliability of the most important tasks in ABSA [21].
of the method. In Table 2 shows a comparison of the latest
technologies. IV. PROPOSED MODEL
The steps of our proposed work are data collection, prepro-
III. PROBLEM STATEMENT cessing, semantic feature extraction, word2vec representa-
ABSA aims to predict the polarity of a specific document tion, and CNN implementation stages, as shown in Fig. 2.
for a particular aspect of an entity. The neural network
architecture successfully predicted the overall polarity of the
sentence, however, SA of certain aspects is an open question A. DATA COLLECTION
[14]. Although typical SA focus on predicting positive and Web scraping technique is used for review collection; hotel
negative polarities of a particular sentence. This task works reviews collected from “https://webhose.io/ ”, automobiles
when the specified text has only one aspect and polarity. A reviews are collected from “https://www.cvedia.com/” while
more common and complex task is to predict the aspects movie reviews are fetched from ” https://seedmelab.org/”.
mentioned in a sentence and the emotions associated with The raw view of dataset is shown in Fig. 3 The reviews are
each. In ABSA, this is similar to aspect and document, where splinted into positive and negative categories. This simplifies
the relationship between each word in the document is com- the task and makes it easier to distinguish various aspects
pared to the aspect vector. The relationship between aspect of positive and negative views. In the proposed model, three
and language lacks the ability to learn expressions [18]. types of data set (hotel, automobiles, and movies) are taken
Most existing opinion mining methods are based on text- using web scraping and then preprocessing is performed to
level analysis and can only detect well-expressed opinions. get valuable data from the data set. Data is provided to
The goal of ABSA is to identify aspects of an entity and the classifier, semantic features are obtained from specific
sentiments stated on each aspect. Extracting the aspect terms domains, and the total score is determined based on the
and presenting opinions from user-generated content is one initial features. Word2vec is designed for corpora processed
VOLUME 4, 2016 5
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
A. FEATURE EXTRACTION
Words are converted into feature vectors in the first layer
of the network to extract the semantics and morphological
information about words V words represent word vocabulary
Vchar represent character vocabulary. Given sentence contain
n words (w1, w2, w3 . . . . . . . . . ..wn ) And changing Wn
(every word) into vector, according to Eq. 1.
Vn = rword , rwchar
(1)
While rword words is word level embedding and rwchar is
character level embedding. To capture semantic and syn-
tactic information word-level embedding is intended, while
FIGURE 2: Phases of aspect based sentiment analysis character-level embedding is intended to capture shape and
morphological information.
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
F. NEGATIVE TRAINING
The system is trained over a training set A, to limit the
negative probability. A sentence is passed to the system con-
tain parameter set θ and each sentiment label is computing
τ ε T. These scores are converted into giving the label with
conditional probability distribution plus network parameter
set θ and then softmax operation is applied over τ ε T score,
as in Eq. 9.
eSθ (x)i
P (τ |x, θ) = P Sθ (x)i
(9)
∀i∈T e
The stochastic gradient descent is applied to limit the nega-
tive log gradient as in Eq. 10.
X FIGURE 6: Convolutional neural network
θ −→ − log P (v|x, θ) (10)
(u,v)∈A
(U, v) is the sentence compared to corpus training, and V degree of features utilizing filter with size 2. The fundamental
suppresses the corresponding label. aim is to provide sufficient range of features to grab all prob-
able descriptor of the text. Maxpool shall have the highest
G. CONVOLUTIONAL NEURAL NETWORK
output value of the vector on applying the filter. It picks a
The structure of CNN can be seen in Fig. 6, this consists strongest expression aspect in the extracted function from
of the separate layer stack which converts the volume of the output and nothing to do with the length of the word.
the input into target output by differential function. The Every sample is represented as n*1 where n represents the
constituent layers of CNN are length of dimension filter. This filter is practice as a drafting
1) Convolution layer window, for instants 3*1 filter on a sentence. I like the car
2) Max-policy layer very much! would yield (I like this, this car very, car very
3) Rely layer much, very much!) sentence splits up to equal length ahead of
4) Back propagation layer embedding hence all the filter does not result in the identical
The intention is to use sufficient filters (in this case, 128) to dimensions outcome. The area size (2, 3, 4 ) is identical to
catch enough features in a particular sentence. In classifica- 2, 3, 4 –G word and the first filter in this trigram will give
tion of images, different filters integrate different attributes different weight esteems to different words in trigrams. It
for example edges, color density at various spots, transforma- suggest that higher weight are allotted to the first index (0-
tion of colors etc. The problem of text classification stretches index) and lower to the second hence 128 filter will allocate
the similar idea to catch features for example "like" mean respective weight that will be trained to optional weight
positive rather than similarity, "very much" communicate the value after sometimes for precise forecasting. For a particular
8 VOLUME 4, 2016
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
test
80
accuracy
60
40
Mutations occur to maintain diversity within a population 0.0 2.5 5.0 7.5 10.0
Iterations
12.5 15.0 17.5 20.0
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
1.0
LG 82.36 80.5 82.85 81.10
0.5
100
0.0
Iterations
80
20
0
RF CNN DT LM LDA SVM LG
Movies Dataset
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
TABLE 5: Performance evaluation on Movies Dataset TABLE 7: Comparison of proposed model with SVM model
Models Precision Accuracy Recall Fmeasure Parameters Proposed Model SVM
Proposed 91.5 95.2 92.3 94.0 Labels 2 2
DT 75 79 85 82 Data points 200 200
SVM 89.4 92.5 88.8 90.3 Accuracy 88.4 84.8
LDA 89 80 75 85
Label NEG NEG
LM 87 86 88 90
RF 90.8 87.5 84.8 88.0 F-score 89.23% 87.69%
LG 87 88 86 85 TP: 185 FP: 6 TN: 9 TP: 168 FP: 4 TN: 23
FN: 0 FN: 5
Label POS POS
F-score 78% 64%
Fig. 16 and in tabular in Table 5. In Table 5, from an accuracy TP: 185 FP: 6 TN: 9 TP: 166 FP: 6 TN: 25
FN: 0 FN: 3
100
TABLE 8: Performance Evaluation w.r.t iterations on Movie
80 and Hotel Datasets
Iteration loss Accuracy Time Elapsed Base Learning Rate
60 Accuracy 0 9.5881 38.91% 1s 1ms 1.00E-04
Precision 1 0.4045 80.78% 0s 703us 1.00E-04
Recall 2 0.0755 92.34% 0s 586us 1.00E-04
40 F-Measure
3 0.0833 92.34% 0s 622us 1.00E-04
4 0.0851 91.56% 0s 563us 1.00E-04
20 5 0.0692 92.19% 0s 547us 1.00E-04
6 0.0702 92.03% 0s 531us 1.00E-04
7 0.0662 92.34% 0s 531us 1.00E-04
0
RF CNN DT LM LDA SVM LG 8 0.0654 92.03% 0s 531us 1.00E-04
Automobiles Dataset 9 0.0657 92.19% 0s 539us 1.00E-04
10 0.0649 92.19% 0s 531us 1.00E-04
FIGURE 17: Performance on Automobiles Dataset
perspective, the performance of the integrated method is 2% nearly all the provided classifiers for all test parameters. It
better than SVM and 6% better than RF. The accuracy of the can therefore be concluded that the recommended approach
integrated method is 91.5%, while the accuracy of RF and is 93.0% more accurate than all six comparator classifiers.
SVM is 90.8% and 89.4%. In addition, the performance of The result comparison table supports the Ensemble meth-
integrated approach is significantly better to other existing ods given. The comparison results clearly indicate that the
technologies in recall and f-measure. suggested approach produces better results in terms of accu-
racy, precision and f-measure. For certain datasets based on
TABLE 6: Performance Evaluation on Automobiles Dataset positive and negative ratings based on experimental settings,
Models Precision Accuracy Recall Fmeasure SVM has the highest accuracy. .
Proposed 91.6% 93.4% 92.2% 89.23%
DT 70% 70.6% 81.20% 77.59% VIII. CONCLUSION
SVM 85.8% 84.8% 89.9% 87.33%
This article presents an effective classification method for
LDA 76.75% 74.0% 74.15% 81.90%
LM 81.8% 80.8% 88% 87.33% sentiment analysis using Convolutional neural network and
RF 82.3% 85.6% 84.8% 85.33% Genetic Algorithm. Semantic features are mined, and then,
LG 81.8% 82.8% 86% 82.33% several models are trained together with proposed CNN
based ensembler, SVM, maximum entropy, random forest,
Fig. 17 shows the performance outcomes of the latest tech- stabilized discriminant analysis, decision tree, generalized
nologies and integration methods for automobiles datasets. linear model. Using data collected through CNN execution
The proposed method has excellent performance on automo- and analysis, domain-specific reviews reduce false positives
biles dataset in terms of accuracy, precision, recall, and f- and false positives and improve accuracy. we adjust the CNN
measure. The results are described in detail in Table 6. Table hyperparameters using a genetic algorithm to get optimal
7 shows the performance index values in percentage format, values. The experimental results show that the recommended
showing that the proposed method performs well with all method outperforms all other recent methods, with 95.5%,
metrics as compared to SVM. Table 8 shows the detailed 94.3%, 91.1%, and 96.6% for accuracy, precision, recall, and
results for the different iteration levels and the different f-measurement, respectively. Future strategies are to integrate
iterations for accuracy, loss, progress, and Base Learning parallel computing to speed up computation and explore
Rate. For different parameters, we compare the outcomes of metaheuristic-related features. A preferred work is to have
the recommended classifier with all six other classifiers in a web-based ontology framework automation to incorporate
Table 3 and conclude that the proposed approach outperforms sentiment analysis on social sites.
12 VOLUME 4, 2016
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
COMPLIANCE WITH ETHICAL STANDARDS [23] Bahrainian S-A, Dengel A (2013) Sentiment analysis and summarization
of twitter data. In: 2013 IEEE 16th international conference on computational
Conflict of interest: Their is no conflict of interest involved
science and engineering (CSE). IEEE, pp 227–234.
in this research. [24] Freitas LA, Vieira R (2013) Ontology based feature level opinion mining
for Portuguese reviews. In: Proceedings of the 22nd international conference
on World Wide Web. ACM, pp 367–370.
REFERENCES [25] Zhao Y, Niu K, He Z, Lin J, Wang X (2013) Text sentiment analysis
[1] Kumar, Ravindra, Husanbir Singh Pannu, and Avleen Kaur Malhi. "Aspect- algorithm optimization and platform development in social network. In: 2013
based sentiment analysis using deep networks and stochastic optimization." sixth international symposium on computational intelligence and design
Neural Computing and Applications (2019): 1-15 (ISCID), vol 1. IEEE, pp 410–413.
[2] Cambria, Erik, et al. "Sentiment analysis is a big suitcase." IEEE Intelligent [26] Bakliwal A, Foster J, van der Puil J, O’Brien R, Tounsi L, Hughes M
Systems 32.6 (2017): 74-80 (2013) Sentiment analysis of political tweets: towards an accurate classifier.
[3] Al-Smadi, Mohammad, et al. "Human annotated arabic dataset of book Association for Computational Linguistics, pp 49–58.
reviews for aspect based sentiment analysis." 2015 3rd International Con- [27] Sam KM, Chatwin C (2013) Ontology-based sentiment analysis model
ference on Future Internet of Things and Cloud. IEEE, 2015. of customer reviews for electronic products. Int J e-Educ e-Bus e-Manag
[4] Al-Moslmi T, Omar N, Abdullah S, Albared M (2017) Approaches to e-Learn 3(6):477.
cross-domain sentiment analysis: a systematic literature review. IEEE Access [28] Ravi K, Ravi V (2015) A survey on opinion mining and sentiment analysis:
5:16173–16192 tasks, approaches and applications. Knowl Based Syst 89:14–46.
[5] Jianqiang Z, Xiaolin G (2017) Comparison research on text preprocessing [29] Yadollahi A, Shahraki AG, Zaiane OR (2017) Current state of text senti-
methods on twitter sentiment analysis. IEEE Access 5:2870–2879. ment analysis from opinion to emotion mining. ACM Comput Surv: CSUR
[6] Wei W, Gulla JA (2010) Sentiment learning on product reviews via sen- 50(2):25
timent ontology tree. In: Proceedings of the 48th annual meeting of the [30] Clavel C, Callejas Z (2016) Sentiment analysis: from opinion mining to
association for computational linguistics. Association for Computational human-agent interaction. IEEE Trans Affect Comput 7(1):74–93.
Linguistics, pp 404–413. [31] Dragoni M, Poria S, Cambria E (2018) Ontosenticnet: a commonsense
ontology for sentiment analysis. IEEE Intell Syst 33(3):77–85.
[7] Karagoz, P., Kama, B., Ozturk, M., Toroslu, I. H., & Canturk, D. (2019).
[32] Kim Y (2014) Convolutional neural networks for sentence classification.
A framework for aspect based sentiment analysis on turkish informal texts.
arXiv preprint arXiv:1408.5882
Journal of Intelligent Information Systems, 53(3), 431-451.
[33] Stojanovski D, Strezoski G, Madjarov G, Dimitrovski I (2015) Twitter
[8] Khan, J., Alam, A., Hussain, J., & Lee, Y. K. (2019). EnSWF: effective
sentiment analysis using deep convolutional neural network. In: International
features extraction and selection in conjunction with ensemble learning
conference on hybrid artificial intelligence systems. Springer, pp 726–737.
methods for document sentiment classification. Applied Intelligence, 49(8),
[34] Ouyang X, Zhou P, Li CH, Liu L (2015) Sentiment analysis using convo-
3123-3145.
lutional neural network. In: 2015 IEEE international conference on computer
[9] Abdi, A., Shamsuddin, S. M., Hasan, S., & Piran, J. (2019). Deep learning-
and information technology; ubiquitous computing and communications;
based sentiment classification of evaluative text based on Multi-feature
dependable, autonomic and secure computing; pervasive intelligence and
fusion. Information Processing and Management, 56(4), 1245-1259.
computing (CIT/ IUCC/DASC/PICOM). IEEE, pp 2359–2364.
[10] S. Rezaeinia, R. Rahmani, A. Ghodsi, and H. Veisi, "Sentiment analysis [35] Jindal S, Singh S (2015) Image sentiment analysis using deep convolu-
based on improved pre-trained word embeddings", Expert Systems with tional neural networks with domain specific fine tuning. In: 2015 interna-
Applications, Vol.117, No.44, pp.139-147, 2019. tional conference on information processing (ICIP). IEEE, pp 447–451.
[11] Souma, W., Vodenska, I., & Aoyama, H. Enhanced news sentiment anal- [36] ) Yang T, Li Y, Pan Q, Guo L (2016) Tb-CNN: joint tree-bank information
ysis using deep learning methods. Journal of Computational Social Science, for sentiment analysis using CNN. In: 2016 35th Chinese control conference
1–14 (2019). (CCC). IEEE, pp 7042–7044.
[12] Rezaeinia, S. M., Rahmani, R., Ghodsi, A., & Veisi, H. Sentiment analysis [37] Bouazizi M, Ohtsuki T (2016) Sentiment analysis: from binary to multi-
based on improved pre-trained word embeddings. Expert Systems with class classification: a pattern-based approach for multiclass sentiment anal-
Applications, 117, 139–147 (2019). ysis in twitter. In: 2016 IEEE international conference on communications
[13] Tang, D., & Zhang, M. Deep Learning in Sentiment Analysis. In Deep (ICC). IEEE, pp 1–6.
Learning in Natural Language Processing (pp. 219–253). Springer (2018). [38] Abbasi A, Chen H, Salem A (2008) Sentiment analysis in multiple
[14] Rani, S., & Kumar, P. Deep Learning Based Sentiment Analysis Using languages: Feature selection for opinion classification in web forums. ACM
Convolution Neural Network. Arabian Journal for Science and Engineering, Trans Inf Syst: TOIS 26(3):12
1–10 (2018). [39] Valdivia A, Luzo´n MV, Herrera F (2017) Sentiment analysis in tripadvi-
[15] Heikal, M., Torki, M., & El-Makky, N. Sentiment Analysis of Arabic sor. IEEE Intell Syst 32(4):72–77
Tweets using Deep Learning. Procedia Computer Science, 142, 114– 122 [40] Che W, Zhao Y, Guo H, Su Z, Liu T (2015) Sentence compression
(2018). for aspect-based sentiment analysis. IEEE/ACM Trans Audio Speech Lang
[16] Li, L., Goh, T. -T., & Jin, D. How textual quality of online reviews affect Process 23(12):2111–2124.
classification performance: a case of deep learning sentiment analysis. Neural [41] Bui N, Yen J, Honavar V (2016) Temporal causality analysis of senti-
Computing and Applications, 1–29 (2018). ment change in a cancer survivor network. IEEE Trans Comput Soc Syst
[17] Alharbi, A. S. M., & de Doncker, E. Twitter sentiment analysis with adeep 3(2):75–87.
neural network: An enhanced approach using user behavioral information. [42] Wu DD, Zheng L, Olson DL (2014) A decision support approach for
Cognitive Systems Research, 54, 50–61 (2019). online stock forum sentiment analysis. IEEE Trans Syst Man Cybern Syst
[18] W. Huang, G. Rao, Z. Feng, and Q. Cong, "LSTM with sentence repre- 44(8):1077–1087.
sentations for Document-level Sentiment Classification", Neurocomputing, [43] Rani S, Kumar P (2017) A sentiment analysis system to improve teaching
Vol.308, No.45, pp.49-57, 2018. and learning. Computer 50(5):36–43.
[19] M. Fikri and R. Sarno, “A Comparative Study of Sentiment Analysis using [44] Mouthami K, Devi K.N, Bhaskaran VM (2013) Sentiment analysis and
SVM and SentiWordNet”, Indonesian Journal of Electrical Engineering and classification based on textual reviews. In: 2013 international conference
Computer Science, Vol.13, No.3, pp.902-909, 2019]. on information communication and embedded systems (ICICES). IEEE, pp
[20] Priyantina, R. A., & Sarno, R. (2019). Sentiment Analysis of Hotel 271–276
Reviews Using Latent Dirichlet Allocation, Semantic Similarity and LSTM. [45] Zhang Y, Wallace B (2015) A sensitivity analysis of (and practitioners’
International Journal of Intelligent Engineering and Systems, 12(4), 142-155. guide to) convolutional neural networks for sentence classification. arXiv
[21] Kontopoulos E, Berberidis C, Dergiades T, Bassiliades N (2013) preprint arXiv:1510.03820
Ontology-based sentiment analysis of twitter posts. Expert Syst Appl
40(10):4065–4074.
[22] Ciric M, Stanimirovic A, Petrovic N, Stoimenov L (2013) Comparison of
different algorithms for sentiment classification. In: 2013 11th international
conference on telecommunication in modern satellite, cable and broadcasting
services (TELSIKS), vol 2. IEEE, pp 567–570 .
VOLUME 4, 2016 13
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/ACCESS.2020.3011802, IEEE Access
14 VOLUME 4, 2016
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.