TWITTER SENTIMENT ANALYSIS
A COURSE PROJECT ON BIG DATA ANALYTICS
B Y Te a m 1 7 A
TEAM DETAILS
Team Number : 17
Name USN Roll No
Akash Chobari 01FE18BCS023 119
Akella Sumanth 01FE18BCS024 120
Amit Raj 01FE18BCS031 127
Ayush Utsav 01FE18BCS059 153
Devatha Naga Puneeth 01FE18BCS072 164
2
PRESENTATION OUTLINE
D o m a i n I n f o r m a ti o n
Problem Statement
Te c h n o l o g y a n d t o o l s u s e d
D a t a s e t d e s c r i p ti o n
Proposed Methodology
Result and Conclusion
3
DOMAIN INFORMATION
We are living in the 21st century, which is the age of technology with increasing number
of devices connected to the Internet. This is leading to huge data generation from smart
devices and different social media platforms and many more.
This vast amount of data generated is called Big Data. Big Data is a word for large
datasets that the traditional data handling application software is pitiful to handle.
There are many social media platform and devices producing a large amount of data
some of such social media platforms are- Twitter, Facebook, Instagram, Youtube,
Reddit etc.
Twitter is a social media platform on which user post and interact with messages
known as tweets. Registered user can post, like and retweet tweets but unregistered
user can only read those tweets that are publicly available.
4
PROBLEM STATEMENT
“Twitter Sentiment Analysis”
Sentiment Analysis refers to identifying as well as classifying the sentiments
expressed in the text source.
Tweets are useful in generating a vast amount of sentimental data upon analysis
which is useful in understanding the opinion of the people about a variety of topics.
Here the aim of this project is to implement a sentiment analysis model that helps
in overcoming the challenges of identifying the sentiments of the tweets.
5
TECHNOLOGY AND TOOLS USED
Hadoop Framework
Python
Tweepy API
Text Blob
6
PROPOSED METHODOLOGY
7
RESULTS
Figure 1: Graph comparison between Neural, Positive and
Negative Sentiment
8
Figure 2: Words vs Score Labelled in Negative Sentiment Figure 3: Words vs Score Labelled in Positive Sentiment
9
Figure 4: Words vs Score labelled in Neural Category
10
CONCLUSION
Count of Text in the category of Positive sentiment: 74154
Count of Text in the category of Negative sentiment: 29464
Count of Text in the Neutral category :75490
The Tweets regarding Covid19 contains more neutral and positive sentiment than negative.
11