Research Paper 2.0
Research Paper 2.0
and Captions
Shrikant Ilhe Dhananjay Jadhav
Dr.Radhakrishna Naik
Dept. of Computer Engineering Dept. of Computer Engineering
Dept. of Computer Engineering
Sanjivani College of Engineering Sanjivani College of Engineering
Sanjivani College of Engineering
Kopargaon, India Kopargaon, India
Kopargaon, India
Abstract - Depression, a widespread mental health issue, With the global prevalence of depression on the rise,
presents significant challenges in early detection despite its there is an urgent need for scalable and non-invasive
global impact. This paper introduces a pioneering method that
utilizes Convolutional Neural Networks (CNN) and Logistic methods to identify individuals at risk and provide timely
Regression for sentiment analysis of Instagram posts and support. Traditional approaches to depression detection
captions, aiming to identify indicators of depression. By often rely on subjective self-reporting or clinical
harnessing the textual content from social media platforms like assessments, which can be time-consuming and may
Instagram, this approach offers a non-invasive and scalable
solution for recognizing individuals at risk of depression.
miss subtle signs of the disorder. By leveraging the vast
Furthermore, the development of a Flask web application amount of user-generated content on social media
enhances accessibility and usability, enabling seamless platforms, such as Instagram, this proposed system offers
interaction with the proposed model for enhanced mental a novel and data-driven approach to mental health
health support and intervention. By analyzing the emotional screening. Through the analysis of both image and text
content shared on social media platforms like Instagram, this data, the system aims to capture a holistic view of users'
approach seeks to identify subtle signs of depression,
facilitating timely intervention and support. The development emotional states and behaviors, enabling more accurate
of a Flask web application further democratizes access to and personalized detection of depressive symptoms.
mental health resources, allowing individuals to proactively
monitor their well-being and seek assistance when needed. This
holistic approach holds potential for revolutionizing depression Furthermore, the development of a Flask web
detection and improving outcomes for those affected by this application extends the reach of the system beyond research
pervasive condition. settings, making it accessible to a wider audience. This user-
friendly interface allows individuals to easily input their
Keywords: Depression detection, Convolutional Neural Networks Instagram handles or specific posts for analysis, receiving
(CNN), Logistic Regression, Sentiment analysis, Instagram, Social instant feedback on their mental health status. By
media, Mental health, Early detection, Flask web application. empowering users to take an active role in monitoring their
well-being and seeking support when needed, the system
I. INTRODUCTION promotes early intervention and reduces the stigma
Depression is a prevalent mental health disorder that associated with mental health issues. Through the
affects millions of people worldwide, yet its early detection convergence of cutting-edge technology and user-centered
remains a significant challenge. Timely identification of design, this system represents a significant step towards
depressive symptoms is crucial for initiating appropriate improving mental health outcomes and fostering a more
interventions and improving outcomes. In response to this supportive and inclusive online community.
pressing need, this paper proposes an innovative system
that utilizes advanced machine learning techniques, II. LITERATURE REVIEW
specifically Convolutional Neural Networks (CNN) and
Logistic Regression, for sentiment analysis of Instagram Over the past few years, researchers have actively
posts and captions. By harnessing the rich textual and developed numerous algorithms and techniques for
visual content shared on social media platforms like sentiment analysis, making it a continuing research focus to
Instagram, this system aims to detect subtle indicators of date. This field of study encompasses various inquiries,
depression, facilitating early intervention and support. including image processing, the application of machine
Additionally, the integration of a Flask web application learning methodologies, pattern recognition, computer
enhances accessibility, allowing individuals to interact vision, and the utilization of neural networks.
seamlessly with the model for personalized mental health
assessment and assistance. Through this comprehensive
approach, the system seeks to revolutionize depression
detection and contribute to improved well-being for
individuals at risk.
In their paper [1], Yu Ching Huang, Chieh-Feng Chiang, However, the second trial revealed lower accuracy in image
and Arbee L. P. Chen explore the potential of social media analysis compared to captions, highlighting the need to
data for predicting depression tendencies. Their research enhance image analysis accuracy, extend functionality to
demonstrates the effectiveness of a machine learning public accounts, and improve overall aesthetics. The report
approach that integrates text, image, and social behavior suggests exploring additional libraries, such as openCV and
data. The authors employ transfer learning to pre-train a TensorFlow with Vuforia, to refine the image analysis
Convolutional Neural Network (CNN) model for feature algorithm. The vision includes making the mobile app
extraction from images. These features are combined with compatible with computers and potentially transforming it
text and behavior data to construct a deep learning classifier. into a browser extension. The comprehensive literature
Remarkably, the classifier achieves an F-1 score of 82.3%
review emphasizes the practical challenges faced by the
and an Area Under ROC Curve (AUC) exceeding 0.5,
indicating robust performance in depression detection. system, ranging from accuracy concerns to the need for
broader applicability and enhanced user interface design,
I the research paper [2], In their study titled "Utilizing setting the stage for further improvement and development.
Social Media Data for Depression Detection: A Machine
Learning Approach," Smith et al. investigate the feasibility III. METHODOLOGY
of leveraging social media activities to predict depression Our methodology focuses on developing a Depression
tendencies. The research employs a machine learning Detection System utilizing Convolutional Neural Networks
framework integrating text, image, and behavioral data. (CNN) and Logistic Regression to analyze Instagram posts and
Through the use of advanced algorithms and transfer captions for signs of depression.
learning techniques, the study achieves promising results in
depression detection, with an F-1 score of 80% and an AUC
exceeding 0.7.
1. Data Collection:
The research paper [3] The paper "Deep Learning for
We start by collecting a dataset of Instagram posts containing
Depression Detection: Integrating Text, Image, and Social
captions, specifically targeting users who may exhibit
Behavior Data" by Johnson et al. explores the application of
depressive symptoms in their content. The dataset is gathered
deep learning methods in predicting depression tendencies
using Instagram's API and web scraping techniques, ensuring a
from social media data. By combining text, image, and
diverse range of posts across different demographics and user
behavioral features, the study constructs a deep neural
profiles.
network classifier. Experimental results show strong
performance, with an F-1 score of 85% and an AUC of 0.75,
highlighting the potential of deep learning approaches in
2. Preprocessing:
depression detection..
The collected data undergoes preprocessing to extract text from
The paper [4] In "A Comprehensive Study on
captions and process images for analysis. Text preprocessing
Depression Detection Using Multimodal Social Media
techniques such as tokenization, stemming, and stop-word
Data," Wang et al. investigate the effectiveness of
removal are applied to clean and standardize the text data.
multimodal data analysis for depression detection. The study
Images are resized and normalized to a standard format suitable
utilizes text, image, and behavioral data extracted from
for input into the CNN model.
social media platforms. Through the implementation of
ensemble learning techniques, including CNNs and logistic
regression, the research achieves notable performance
metrics, with an F-1 score of 87% and an AUC exceeding 3. CNN for Image Analysis:
0.8, underscoring the importance of multimodal approaches A CNN model is trained on the preprocessed image data to
in depression detection. Significance of the human face as a analyze visual cues indicative of depression. The model extracts
crucial identifier and communication tool, prompting the features from images to identify patterns associated with
development of a practical solution for managing class depressive content. Transfer learning may be employed using
attendance. Their proposed system aims to aid lecturers in pre-trained CNN architectures such as VGG or ResNet to
automatically detecting students' faces within a classroom leverage existing knowledge and improve model performance.
environment, subsequently recording their attendance based
on the recognized faces.
The paper [5] In their work titled "Detecting Depression 4. Logistic Regression for Caption Analysis:
in Social Media Using Machine Learning," Ruoxi Ding and Simultaneously, a Logistic Regression model is trained on the
Yu Sun present the Intelligent System for Social Media preprocessed text data from captions to analyze linguistic
Depression Detection. Focused on automating the patterns and sentiment indicative of depression. The model
identification of youth depression on Instagram, assigns probabilities to captions, predicting the likelihood of
particularly among students who express their feelings on depressive content based on textual features extracted from the
social media instead of seeking medical help, the system captions.
utilizes AI and Deep Learning techniques. Employing web
scraping and the Instagram private API, the system gathers
captions and images from users' personal profiles to 5. Fusion of Results:
determine the potential presence of depressive content. The outputs from the CNN image analysis and Logistic
Supported by the Google Cloud dataset, the software Regression caption analysis are combined using ensemble
conducts sentiment analysis for caption evaluation and methods or weighted averaging to generate a final prediction for
image classification for content analysis. Implemented in each Instagram post. This fusion of results enhances the overall
Python for the backend and Dart with Flutter for the front- accuracy and robustness of the depression detection system.
end, the system underwent two trials. The first trial,
involving 15 students, demonstrated the algorithm's high
accuracy in identifying depression through captions. 6. Evaluation and Validation:
The performance of the system is evaluated using standard system's sensitivity and specificity.
metrics such as accuracy, precision, recall, and F1-score. The
dataset may be divided into training, validation, and test sets Moreover, ethical considerations regarding user privacy
for model training and validation. Cross-validation techniques and data security remain paramount. Striking a balance
may also be employed to ensure the generalization of the between algorithmic accuracy and user confidentiality is
model to unseen data. essential to ensure the responsible deployment of our
depression detection system in real-world settings.