0% found this document useful (0 votes)

13 views9 pages

Sign language emotion and alphabet recognition with hand gestures using convolution neural network

This paper presents a novel framework for recognizing American Sign Language (ASL) emotions and alphabets using a convolutional neural network (CNN). The system achieves impressive accuracy rates, with the 'peace' sign emotion reaching 98.95% recognition, and overall accuracy for emotions and alphabets exceeding 92%. The study highlights the importance of dataset creation, real-time data capturing, and feature extraction in enhancing the performance of sign language recognition systems.

Uploaded by

IAES IJAI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views9 pages

Sign language emotion and alphabet recognition with hand gestures using convolution neural network

Uploaded by

IAES IJAI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

IAES International Journal of Artificial Intelligence (IJ-AI)

Vol. 14, No. 2, April 2025, pp. 954~962

ISSN: 2252-8938, DOI: 10.11591/ijai.v14.i2.pp954-962  954

Sign language emotion and alphabet recognition with hand

gestures using convolution neural network

Varsha K. Patil1, Vijaya R. Pawar2, Aditya Patil3, Vinayak Bairagi1

1
Department of Electronics and Telecommunication Engineering, AISSMS Institute of Information Technology, Pune, India
2
Department of Electronics and Telecommunication Engineering, Bharati Vidyapeeth College of Engineering for Women, Pune, India
3
Department of Electronics and Communication Engineering, BITS Pilani Goa Campus, Sancoale, India

Article Info ABSTRACT

Article history: American sign language (ASL) is a special means of interaction for hard-of-
hearing individuals and has precise conventional rules. Since the general
Received May 31, 2024 public does not know these sign language protocols, there is a need to have an
Revised Oct 27, 2024 efficient automatic sign-emotion recognition system. The objective of this
Accepted Nov 14, 2024 paper is, to develop a framework that recognizes standard hand gestures. The
gesture represents emotions and alphabet. This paper covers the methodology,
results and performance factors, for experimentations. This experimentation
Keywords: of ASL-based alphabet and emotion recognition is novel as till now many
efforts of alphabets categorization are done but this is the new direction of
American sign language research where emotions, such as together’, ‘happy’, ‘peace, ’sad’,
Artificial intelligence ‘confused’, and ‘love’ are captured and automatically classified with hand
Computer vision signs. We mention our approach to increase ‘accuracy’, wherein we capture
Convolutional neural network images and regions of interest (ROI). In this article, a specifically designed
Emotion recognition convolution neural network (CNN), is used to identify emotions from hand
Region of interest gestures and the addition of ROI enhances accuracy. The captured hand
Sign language gesture dataset of the size of 94,000 images. “peace” sign emotion has the
highest recognition rate (‘98.95%’). Alphabet’s “P” and “Q” sign ASL
alphabets have the maximum recognition rate of signs. In all, very impressive
accuracy of “92%” and above is detected. The limits of the experimentation
are as mentioned i) there is no repeatability of accuracy for the same hand
gesture; ii) The distance and angle of hand gestures with camera are crucial
factors for an experiment; and iii) the alphabet recognition system is not
working for the alphabets “J” and “Z”.
This is an open access article under the CC BY-SA license.

Corresponding Author:
Varsha K. Patil
Department of Electronics and Telecommunication Engineering, AISSMS Institute of Information Technology
Pune, India
Email: [email protected]

1. INTRODUCTION
Nonverbal communication [1] is when individuals say many things without speaking. The message
in nonverbal communication is sent by making and moving faces or poses to form movements of body parts.
These movements are called gestures [2]. When hard-of-hearing people interact with each other, they use a
well-structured form of communication called sign language. The American sign language (ASL) is one of the
popular sign languages, where emotions and characters are expressed with the movements of hands. These
movements are done with a set of rules. To understand sign language communication, a trained person who
has acquired sign skills is required. To make the social interactions among hard of hearing and people with
good audacity, there is a technical implementation need for recognizing emotions automatically.

Journal homepage: http://ijai.iaescore.com

Int J Artif Intell ISSN: 2252-8938  955

This paper brings the methods, performance factors, and successful results for automatically
recognizing ASL. The secret of communication lies in understanding the hand gestures involved. This article
aims to automatically recognize emotions and the alphabet via standardized hand gestures and positions. The
highlight of this work is, that it is the first article that shows the method to recognize happy, love, sad, peace,
confused, and together emotions with 92% and more accuracy.

2. LITERATURE SURVEY
This section is divided into two parts. The first part discusses about the broad domain of signal
processing of ASL. The second part deals with methodology to increase the accuracy for sign language.

2.1. Literature survey of American sign language signal processing

To begin hand recognition with image processing, Baca et al. [3] provides basics of hand geometry
capturing hand positions and related mathematical background. Ko et al. [4] have worked on deep learning,
the recurrent neural network methodologies for realizing the important key landmarks of the human body. The
hidden Markov model (HMM) [5] method is used by Starner et al. [5] for tracking the subject’s hands. The
sign language, and the universal facial expressions of sign language were discussed by Sandler [6]. Deep neural
networks [7] were used to classify nonsearchable signs from the entire Dutch and Flemish sign-based language.
The 2D convolutional neural network (CNN) approch was considered by Pigou et al. [7]. Emotion
classification with audio and video was considered by Wang et al. [8]. An approch for using virtual signers
“sign language” was used by Kipp et al. [9]. For the video frames the CNN method [10] provided” 98.981%
“accuracy. Al-Qurishi et al. [11] provided the significance of input modalities. The comparison of the unimodal
recognition and combination of methods was documented in [11]. The HMM was used by Li et al. [12].
Kothadiya et al. [13] demonstrated Indian sign language (ISL) based deep-learning model video frames. CNN
for hand movement detection are considered in the article [14] with single shot detection, shows that methods
are employed for Arabic sign language recognition with long short-term memory (LSTM) and CNN is the main
aim of the paper [15]. Sign language recognition review as in [16]. Subramanian et al. [17] uses a media
pipe-based methodology that integrates with the optimized gated recurrent unit. Jiang et al. [18] uses
fingerspelling recognition in Chinese sign language. According to Zahid et al. [19], sign languages are
translated using TensorFlow and linear regression with the method's accuracy of 97%. Table 1, provides
methods in literature for recognizing sign language.

2.2. Literature for methods for incrementing the accuracy-related to ASL processing
For this part, the approach of listing the methods for incrementing the accuracy related to ASL
processing is considered. Table 1 lists and briefly describes different strategies for improving the automatic
ASL processing methods' accuracy. Table 1 summarizes different approaches by the various authors for
incrementing the accuracy of ASL-related sign languages. Table 1 experiments for accuracy increments include
the use of a large dataset, utilizing techniques such as deep learning or glove-based methods.

Table 1. Methods for incrementing the accuracy related to ASL processing

Sr No. Method for incrementing Remarks
the accuracy
1 “Finger spelling, a” datasets Utilizing large, diverse sign language datasets of videos (with added corresponding
[10] emotional labels), which have images of different sizes, orientations, and complex
backgrounds for increasing accuracy.
2 Using deep learning [11] Exploring different architectures and training strategies to enhance the accuracy of deep
learning models [11].
3 Cross-lingual recognition and It is related to develop techniques to create cross-lingual models that recognize emotions
real-world Glove sensor-based across different sign languages. Using a device with sensors, such as accelerometers flex
/recognition of the real-time sensors, and infrared cameras placed on body parts the fingers, and hands.
gestures [12]
4 Real-world applications [13], Communication aids, and educational tools [13], HSV colour space [19] models,
[19], [20] converting hand gestures into written words and audible speech [20] are some of the real
time applications
5 Edge detection [21] Image processing techniques such as edge detection
6 Transfer learning [22] “VGG16”, “ResNet50”, “MobileNetV2”, “InceptionV3”, and “CNN”, these are the
examples of transfer learning methods
7 Studying review articles [23] A review of sign language with facial expression, hand gestures, and glove sensor
technology is considered
8 Robotic glove movement [24] The glove movement with the robotic principle, represents4emotions and 4 numbers in ASL.
9 Static hand gesture images [25] Some deep learning methods worked on static hand gesture images [25].
10 Sign language day celebration Sign language day celebration is one of the ways to accelerate the process of
[26] advancements in the sign language field.

Sign language emotion and alphabet recognition with hand gestures using CNN (Varsha K. Patil)
956  ISSN: 2252-8938

3. METHOD
From the literature survey in the previous section, the general steps of automatic sign language
recognition start with capturing hand gesture images, followed by preprocessing steps. After preprocessing
there is the crucial step of core algorithms. Algorithms like transfer learning, and deep learning are used, in
literature. In this article, the method used is the CNNs. To interpret the meaning of hand gesture frames from
the input video, we have utilized three steps for implementing sign recognition namely i) dataset creation,
ii) real-time data capturing and feature extraction by the CNN method, and iii) training and testing.
Figure 1 summarizes these steps with a block diagram consisting of preprocessing, feature extraction,
training, and testing. The output is expressed in terms of performance parameters such as confusion matrix,
score, and accuracy. The other form of the system output is the classification of emotions and alphabets. The
system shown in Figure 1 works for confusion, love, together, sad, peace, confused, and happy emotions. The
steps of implementation of the proposed system are shown in Figure 1 and are elaborated as follows.
Step 1: dataset creation: Initially, the input hand gesture images are captured as per the ASL a
teachable machine is used for data capturing and augmentation. The emotion dataset contains 18,000 hand
gesture images categorized into 6 emotion classes. The dataset is created by using a teachable machine. The
alphabet hand gesture image dataset consists of 94,000 images. In the case of each alphabet, 3,000 images are
captured. The main concern of dataset creation is that we have followed ASL guidelines for representing the
alphabet with hand gestures. The total size of the dataset created is 94,000 images, out of these 18,000 images
are related to 6 emotions. The remaining 76,000 images in the dataset are related to the English alphabet. The
alphabet and emotions are kept in separate folders for processing.
Each class that we created contains 3,000 images alphabets. There are subfolders in the emotion
folder, sad, happy, together, peace, and love. The labeled data in the alphabets folder is of 24 Engish alphabets
except J and Z. To mitigate the limitations of this dataset size, we utilized data augmentation techniques through
Keras’s ImageDataGenerator, incorporating a width shift range of 0.1, height shift range of 0.1, a zoom range
of 0.2, shear range of 0.1, and a rotation range of 10 degrees. These augmentations enhance the diversity of the
dataset and improve model generalization, with augmented image batches generated at a size of 20. These
preprocessing methods effectively address the dataset size constraints and bolster the model's robustness. The
data is labeled into folders containing emotions and This way we have created our own data set.

Figure 1. Block diagram for emotions and alphabet recognition

Step 2: real-time data capturing and feature extraction by CNN method: the hand gestures indicating
signs of the alphabet and emotions are captured from the live video feed. The image preprocessing tasks such
as resizing, and zooming are done. Then the features are extracted by the CNN. A dataset of 94,000 images
(captured and processed by us) is used as a reference. The dataset creation band preprocessing process is
already explained in step 1. Figure 2 shows a representative picture of the proposed system for identifying hand
gestures where the input of a preprocessed image is provided to the CNN network, the output image shows the
emotion recognition of peace expressed with of 98.95% accuracy.
As shown in Figure 2, we have designed three layered CNN for sign language, which has inputs from
real-time hand gestures. Our system preprocesses the input image by resizing and edge processing. The output
image shows the” Peace” emotion demonstrated by hand gestures. indicated with a and the output of emotions.
identification experiment shows a bounding box (region of interest (ROI)). A high accuracy of 98.56% is
observed for this experimentation.
The CNN processes input images with filters of size 32×32 pixels with 3 color channels. The
pre-processing step is to import and convert the label “optimizer” from the dataset. CNN has three

Int J Artif Intell, Vol. 14, No. 2, April 2025: 954-962

Int J Artif Intell ISSN: 2252-8938  957

convolutional layers: the first two layers use 32 filters per 3×3 pixel size and ReLU activation, followed by a
layer of 64 filters per 3×3 pixel size. CNN has three convolutional layers: the first two layers use 32 filters per
3×3 pixel size and ReLU activation, followed by a layer of 64 filters per 3×3 pixel size.
MaxPooling2D layers have a pool size of 2×2 pixels and are put after the second convolutional layer,
This reduces the dimensionality of the image map and avoids overfitting. The model includes pooling layers
and convolutional. There is a Dense layer with 512 units and a ReLU activation function. These layers are
preceded by an output-dense layer having SoftMax activation). The network is classified with the
Adam optimizer of learning rate of 0.001 and categorical cross-entropy loss function. Dropout layers with a
rate of 0.25 are applied after the second convolutional layer to enhance generalization. A sequential kernel size
of 3 with ReLU activation and a dropout of 0.2 is used. Flattening the model, and adding a dense layer with
ReLU activation and dropout, followed by a SoftMax layer are the significant design steps to get the best results
by adding nonlinearity. These steps are represented in Figures 3 and 4.
Our proposed model's general and specific features are seen in Figures 3 and 4. The steps are to import
the libraries, design the CNN with different layers, and use the feature extensions. As shown in Figure 3, the general
areas of signal processing in image processing are manual localization, segmentation, morphology, and signal
processing in image extraction. We used the TensorFlow platform for sign language recognition using the CNN
approach. In our experiments, using TensorFlow for hand signals, the following parameters ar e used for detection.
Step 3: training and testing: with hand gesture images, our model is trained for alphabet and emotion
recognition. There are two types of results offered by models. The performance factors like classification
parameters of confusion matrix, accuracy. The g splits with a ratio of 80:20 is used for training and testing. For
training purposes 2,400 images and 600 images for testing.

Figure 2. CNN based ASL for emotion recognition

Figure 3. General steps flowchart for sign language Figure 4. Flowchart for CNN-based emotion
approch recognition with sign language approach

4. RESULTS
The next subsections focus on two key aspects: onscreen results and the confusion matrix.
This subsection highlights the impact of the ROI and epoch on performance. Multiple graphs illustrate their
effects on accuracy and loss metrics. These visualizations provide insights into model evaluation.
Sign language emotion and alphabet recognition with hand gestures using CNN (Varsha K. Patil)
958  ISSN: 2252-8938

4.1. Emotion recognition and confusion matrix

This section contains the results of the experimentation related to emotion recognition. As shown in
the Figure 5, our system recognizes emotions. The related confusion matrix is in Figure 6. The emotions
identified are “peace (98.95%)” “happy (94.96%)” “together”(94.03%).

Figure 5. Emotions like “happy” (94.45%) “together” (94.05%) are recognized from hand gestures

Figure 6. Confusion matrix for emotions

4.2. Graphs of epoch vs loss and epoch vs accuracy

While training and testing emotion and alphabet models, there are the graphs of epochs vs loss and
accuracy as shown in Figure 7. We have allocated the right part of the figure for the alphabet recognition graph.
The left column of the figure is allocated for emotion recognition graphs, The accuracy increases with the
number of epochs and loss decreases.

Figure 7. Graphs for alphabet and emotion for epoch vs loss and epoch vs accuracy

Int J Artif Intell, Vol. 14, No. 2, April 2025: 954-962

Int J Artif Intell ISSN: 2252-8938  959

4.3. Effects of region of interest

The automatic hand gesture recognition system is designed in steps. Dataset generation, CNN design,
and training and testing phases are the three steps of implementation of our methodology. The sign recognition
is done by creating a green-colored bounding box called as ROI on the images. The addition of the ROI on
images ensures maximum accuracy and less loss.
To generate maximum accuracy and less loss of the trained datasets, we have created a ROI. The ROI
is a small green-colored box around the sign. The data is to be recognized with signal processing logic. The
ROI logic contains grayscale and Gaussian blur effect image processing algorithm implementation for maximal
data prediction.
Figures 8 and 9 show increased accuracy by introducing the ROI for sign language-based alphabet
and emotion recognition. Figure 8 shows sign recognition for alphabets. Accuracy increases with the ROI. ROI
is shown for 24 alphabets. (“J” and “Z” Alphabets are not recognized). Representation of the effect of ROI on
the accuracy of each emotion Performance matrices shows that all emotions are more accurately recognized
with ROI shown in Figure 9.

Figure 8. Alphabet’s sign: accuracy increases with the ROI for 24 alphabets

Figure 9. Emotion sign recognition: accuracy comparison with and without ROI

4.4. Alphabet recognition: performance parameters

The ASL-based hand gestures and their interpretation in terms of emotions and the alphabets are
recognized by our system. The results for the letters “R”, “W”, and “Q” Alphabets recognized are shown in
Figure 10. The noteworthy part is in the lab environment the “Q” letter has 100 percentage of accuracy. For 24
alphabets (except J and Z), the precision, F1 score, and confusion matrix are presented.
The performance parameters results are shown in Figures 11 and 12. Figure 11 shows the performance
parameters for 24 “alphabets”. The system shows robustness in identifying the hand gestures of letters
mentioned in Figure 12. Letters “E”, “S”, “I”, and “Y” show the best performance parameters. However, our
system has limitations in that the letters “J” and “Z” are not recognized. Hence Figure 11 includes figures of
performance parameters like precision, accuracy, and F1 score for only 24 letters. Figure 12 shows the
confusion matrix for alphabets.

Sign language emotion and alphabet recognition with hand gestures using CNN (Varsha K. Patil)
960  ISSN: 2252-8938

Figure 10. “R”, “W”, and “Q” alphabets recognized (e.g. “W” accuracy is 99.89%)

Figure 11. Performance parameters for recognition of alphabets by hand gestures

Figure 12. Confusion matrix for recognition of alphabets shown by hand gesture

Int J Artif Intell, Vol. 14, No. 2, April 2025: 954-962

Int J Artif Intell ISSN: 2252-8938  961

5. DISCUSSION
We conducted a study using a self-created dataset comprising 94,000 hand gesture images to advance
the recognition of sign language alphabets and emotions. This dataset was processed using CNN. While many
studies focus on alphabet recognition, few address emotion recognition via sign language, making this research
a pioneering effort in the field. Our system analyzes six emotions-confused, happy, sad, love, peace, and
together-evaluating accuracy with and without ROI application. Results show that incorporating ROI improves
accuracy up to 100%, while its absence reduces accuracy to as low as 60%. For alphabet recognition, 24 letters
(excluding “J” and “Z”) were analyzed, with ROI significantly enhancing performance. Recognition accuracy
exceeded 95% for “P” and “Q”, whereas it dropped to about 75% for “D” and “K”.
The system achieved overall accuracy rates of 96%-98% for alphabets and 97%-99.8% for emotions.
However, limitations include dependency on factors such as camera angle, distance, and sign presentation
consistency. Future work could involve integrating non-invasive communication methods and advanced signal
processing to further optimize system efficiency and broaden its applicability in diverse real-world scenarios.

6. CONCLUSION
ASL, can be understood by trained/skilled individuals. However, the ASL emotions and alphabets are
not recognized by the layman. Hence there is a gap in the automatic recognition of the English alphabet using
ASL. In this article, the hand gestures, and movement are recognized with CNN. The main contribution of this
work is, that our system is capable of identifying emotions such as peace, togetherness, love, and happy are
identified with significant confidence (greater than 91%). We have added the ROI to increase the accuracy.
Graphical, tabular output is presented for sign language-based emotion recognition. The ROI is a signal
processing and localization technique with abounding box around the hand. However, the system has
limitations on the repeatability of the readings. The very same accuracy is not observed after repeated readings.
The image characteristics are the influencing factors for the sequence and content of the sign language
processing steps. Additionally, the quality of the segmentation may depend on factors such as lighting
conditions, camera angle, and the complexity of the hand gesture being performed. Besides these odd
limitations, our system provides an average accuracy greater than 91% of accuracy.

REFERENCES
[1] E. L. W. Keutchafo, J. Kerr, and O. B. Baloyi, “A model for effective nonverbal communication between nurses and older patients:
a grounded theory inquiry,” Healthcare, vol. 10, no. 11, 2022, doi: 10.3390/healthcare10112119.
[2] E. D. Stefani and D. D. Marco, “Language, gesture, and emotional communication: an embodied view of social interaction,”
Frontiers in Psychology, vol. 10, 2019, doi: 10.3389/fpsyg.2019.02063.
[3] M. Baca, P. Grd, and T. Fotak, “Basic principles and trends in hand geometry and hand shape biometrics,” New Trends and
Developments in Biometrics, 2012, doi: 10.5772/51912.
[4] S. K. Ko, J. G. Son, and H. Jung, “Sign language recognition with recurrent neural network using human keypoint detection,” in
The 2018 Research in Adaptive and Convergent Systems, RACS 2018, 2018, pp. 326–328, doi: 10.1145/3264746.3264805.
[5] T. Starner, J. Weaver, and A. Pentland, “Real-time American sign language recognition using desk and wearable computer based video,”
IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, no. 12, pp. 1371–1375, 1998, doi: 10.1109/34.735811.
[6] W. Sandler, “Prosody and syntax in sign languages,” Transactions of the Philological Society, vol. 108, no. 3, pp. 298–328, 2010,
doi: 10.1111/j.1467-968X.2010.01242.x.
[7] L. Pigou, M. V. Herreweghe, and J. Dambre, “Sign classification in sign language corpora with deep neural networks,” in LREC -
7 th Workshop on the Representation and Processing of Sign Languages: Corpus Mining, 2016, pp. 175–178.
[8] Y. Wang and L. Guan, “Recognizing human emotion from audiovisual informaiton,” in ICASSP, IEEE International Conference
on Acoustics, Speech and Signal Processing - Proceedings, 2005, doi: 10.1109/ICASSP.2005.1415607.
[9] M. Kipp, A. Heloir, and Q. Nguyen, “Sign language avatars: animation and comprehensibility,” in Intelligent Virtual Agents, 2011,
pp. 113–126, doi: 10.1007/978-3-642-23974-8_13.
[10] R. K. Pathan, M. Biswas, S. Yasmin, M. U. Khandaker, M. Salman, and A. A. F. Youssef, “Sign language recognition using the
fusion of image and hand landmarks through multi-headed convolutional neural network,” Scientific Reports, vol. 13, no. 1, 2023,
doi: 10.1038/s41598-023-43852-x.
[11] M. Al-Qurishi, T. Khalid, and R. Souissi, “Deep learning for sign language recognition: current techniques, benchmarks, and open
issues,” IEEE Access, vol. 9, pp. 126917–126951, 2021, doi: 10.1109/ACCESS.2021.3110912.
[12] K. Li, Z. Zhou, and C. H. Lee, “Sign transition modeling and a scalable solution to continuous sign language recognition for real-
world applications,” ACM Transactions on Accessible Computing, vol. 8, no. 2, 2016, doi: 10.1145/2850421.
[13] D. Kothadiya, C. Bhatt, K. Sapariya, K. Patel, A. B. Gil-González, and J. M. Corchado, “Deepsign: sign language detection and
recognition using deep learning,” Electronics, vol. 11, no. 11, 2022, doi: 10.3390/electronics11111780.
[14] N. Al Mudawi et al., “Innovative healthcare solutions: robust hand gesture recognition of daily life routines using 1D CNN,”
Frontiers in Bioengineering and Biotechnology, vol. 12, 2024, doi: 10.3389/fbioe.2024.1401803.
[15] P. Vyavahare, S. Dhawale, P. Takale, V. Koli, B. Kanawade, and S. Khonde, “Detection and interpretation of Indian sign language
using LSTM networks,” Journal of Intelligent Systems and Control, vol. 2, no. 3, pp. 132–142, 2023, doi: 10.56578/jisc020302.
[16] I. A. Adeyanju, O. O. Bello, and M. A. Adegboye, “Machine learning methods for sign language recognition: a critical review and
analysis,” Intelligent Systems with Applications, vol. 12, 2021, doi: 10.1016/j.iswa.2021.200056.
[17] B. Subramanian, B. Olimov, S. M. Naik, S. Kim, K. H. Park, and J. Kim, “An integrated mediapipe-optimized GRU model for
Indian sign language recognition,” Scientific Reports, vol. 12, no. 1, 2022, doi: 10.1038/s41598-022-15998-7.
[18] X. Jiang, M. Lu, and S. H. Wang, “An eight-layer convolutional neural network with stochastic pooling, batch normalization and

Sign language emotion and alphabet recognition with hand gestures using CNN (Varsha K. Patil)
962  ISSN: 2252-8938

dropout for fingerspelling recognition of Chinese sign language,” Multimedia Tools and Applications, vol. 79, no. 21–22, pp.
15697–15715, 2020, doi: 10.1007/s11042-019-08345-y.
[19] H. Zahid et al., “A computer vision-based system for recognition and classification of Urdu sign language dataset for differently
abled people using artificial intelligence,” Mobile Information Systems, vol. 2023, pp. 1–17, 2023, doi: 10.1155/2023/1060135.
[20] H. K. Vashisth, T. Tarafder, R. Aziz, M. Arora, and Alpana, “Hand gesture recognition in Indian sign language using deep learning,”
in Engineering Proceedings, 2023, doi: 10.3390/engproc2023059096.
[21] A. Akoum and N. Al Mawla, “Hand gesture recognition approach for ASL language using hand extraction algorithm,” Journal of
Software Engineering and Applications, vol. 8, no. 8, pp. 419–430, 2015, doi: 10.4236/jsea.2015.88041.
[22] S. Mohsin, B. W. Salim, A. K. Mohamedsaeed, B. F. Ibrahim, and S. R. M. Zeebaree, “American sign language recognition based on transfer
learning algorithms,” International Journal of Intelligent Systems and Applications in Engineering, vol. 12, no. 5s, pp. 390–399, 2024.
[23] S. Srivastava, R. Jaiswal, R. Ahmad, and V. Maddheshiya, “Sign language recognition,” SSRN Electronic Journal, 2024, doi:
10.2139/ssrn.4778501.
[24] W. Lin, C. Li, and Y. Zhang, “Interactive application of data glove based on emotion recognition and judgment system,” Sensors,
vol. 22, no. 17, 2022, doi: 10.3390/s22176327.
[25] D. Satybaldina and G. Kalymova, “Deep learning based static hand gesture recognition,” Indonesian Journal of Electrical
Engineering and Computer Science, vol. 21, no. 1, pp. 398–405, 2021, doi: 10.11591/ijeecs.v21.i1.pp398-405.
[26] United Nations, “International day of sign languages,” United Nations. Accessed: Jun. 15, 2024. [Online]. Available:
https://www.un.org/en/observances/sign-languages-day

BIOGRAPHIES OF AUTHOR

Varsha K. Patil an M.Tech. electronics in the year 2009 from the Government
College of Engineering Pune. She has filed 02 patents and 1 copyright in the technical field. She
has published more than 25 papers. She has been invited as a resource person for 10 faculty
development programs. Her areas of interest are image processing, AIoT, the internet of things,
and electronics in agriculture. She has worked to establish the Center of Excellence of Texas
Innovation Lab and IEEE Affordable Agriculture Lab. She has worked on two research grant-
funded projects and one QIP Grant. She is a life member of ISTE and IETE. She can be contacted
at email: [email protected].

Vijaya R. Pawar has completed M.E. in electronics in 2002 from Shivaji University
Kolhapur, Bharati Vidyapeeth Deemed University, Pune has awarded her a Ph.D. degree in
electronics engineering in 2015. She has teaching experience of 28 years and research experience
of 10 years. She has filed 02 patents and 2 copyrights in the technical field. She has published
more than 47 papers, of which 28 papers are in international journals. Her 25 papers are indexed
in Scopus and 12 papers are indexed in SCI. She was invited as session chair for 12 national and
international conferences. She has also been invited as a resource person by 06 colleges as a
resource person for delivering a session on, “Digital signal processing” and “Biomedical signal
processing”. She has received one research grant and 08 development grants. She is a life
member of ISTE, IEI, and IETE. She can be contacted at email: [email protected].

Aditya K. Patil is graduated B.Tech. in electronics and communication engineering

from Birla Institute of Technology and Sciences, Goa Campus in 2024. His research interests
include nanoelectronics, circuit design, and applications of machine learning in related fields.
Currently, he is working as at texas instruments. He can be contacted at email:
[email protected].

Vinayak Bairagi has completed M.E. in electronic in 2007. The University of Pune
has awarded him a Ph.D. degree in Engineering in 2013. He has teaching experience of 17 years
and research experience of 12 years. He has filed 12 patents and 5 copyrights in the technical
field. He has published more than 81 papers, of which 38 papers are in international journals.
His 71 papers are indexed in Scopus and 32 papers are indexed in SCI. He was invited as session
chair for 21 national and international conferences. He has also been invited as a resource person
by 31 colleges for invited talk. He has received four research grants. He was Chair of the IEEE
Signal Processing Society Pune chapter (2020-23). He can be contacted at email:
[email protected].

Int J Artif Intell, Vol. 14, No. 2, April 2025: 954-962

Algebra Masterkey Class 9 Maharashtra Board
85% (110)
Algebra Masterkey Class 9 Maharashtra Board
124 pages
English Spotlight 1, Part 1 Teacher's Guide
100% (2)
English Spotlight 1, Part 1 Teacher's Guide
144 pages
New Project Report
No ratings yet
New Project Report
48 pages
92dc4915b9487f589cfe29a2d410cc0a
No ratings yet
92dc4915b9487f589cfe29a2d410cc0a
6 pages
2021A1R002-1
No ratings yet
2021A1R002-1
14 pages
PFX-48420843 (1)
No ratings yet
PFX-48420843 (1)
6 pages
Sign Language
No ratings yet
Sign Language
22 pages
2-Deep Learning Approach for Sign Language Recognition
No ratings yet
2-Deep Learning Approach for Sign Language Recognition
10 pages
Visual Language Interpreter
No ratings yet
Visual Language Interpreter
7 pages
107
No ratings yet
107
11 pages
G7 Synopsis
No ratings yet
G7 Synopsis
14 pages
Staticsign CNN
No ratings yet
Staticsign CNN
8 pages
Static Sign Language Recognition Using Deep Learning
No ratings yet
Static Sign Language Recognition Using Deep Learning
9 pages
5476 12069 1 Ed
No ratings yet
5476 12069 1 Ed
14 pages
Real-Time American Sign Language Recognition With Convolutional Neural Networks
No ratings yet
Real-Time American Sign Language Recognition With Convolutional Neural Networks
8 pages
Indian Sign Language Classification and Recognition Using Machine Learning
No ratings yet
Indian Sign Language Classification and Recognition Using Machine Learning
6 pages
Sign Language Detection
No ratings yet
Sign Language Detection
5 pages
MCA2185_Research paper
No ratings yet
MCA2185_Research paper
8 pages
Dynamic Tool For American Sign Language Finger Spelling Interpreter
No ratings yet
Dynamic Tool For American Sign Language Finger Spelling Interpreter
5 pages
Plag Free
No ratings yet
Plag Free
28 pages
At 2 Manuscript
No ratings yet
At 2 Manuscript
2 pages
Enhancing accessibility with long short-term memory-based sign language detection systems
No ratings yet
Enhancing accessibility with long short-term memory-based sign language detection systems
8 pages
American Sign Language Recognition Using Hidden Markov Models
No ratings yet
American Sign Language Recognition Using Hidden Markov Models
15 pages
Hand_Gesture_Based_Sign_Language_Recognition_Using_Deep_Learning
No ratings yet
Hand_Gesture_Based_Sign_Language_Recognition_Using_Deep_Learning
5 pages
VatsalGupta PuruMalhotra MajorProjectReport
No ratings yet
VatsalGupta PuruMalhotra MajorProjectReport
8 pages
Vigneshkumar - Sign Language Recognition
No ratings yet
Vigneshkumar - Sign Language Recognition
8 pages
Silent Expressions
No ratings yet
Silent Expressions
9 pages
Dynamic Gesture Recognition for Sign Language Using Long Short Term Memory Networks
No ratings yet
Dynamic Gesture Recognition for Sign Language Using Long Short Term Memory Networks
7 pages
Deep Learning-Based Sign Language Recognition System For Static Signs
No ratings yet
Deep Learning-Based Sign Language Recognition System For Static Signs
12 pages
Sign Language Recognition Using Deep Learning Through LSTM and CNN
No ratings yet
Sign Language Recognition Using Deep Learning Through LSTM and CNN
5 pages
Mohammed Maqdoom Jahagirdarp2Yo
No ratings yet
Mohammed Maqdoom Jahagirdarp2Yo
9 pages
Finger Motion Capture For Sign Language Interpretation
No ratings yet
Finger Motion Capture For Sign Language Interpretation
11 pages
A Survey on Sign Language Recognition Systems
No ratings yet
A Survey on Sign Language Recognition Systems
27 pages
Real-Time_Recognition_of_Indian_Sign_Language
No ratings yet
Real-Time_Recognition_of_Indian_Sign_Language
6 pages
Real-Time Sign Language Interpreter Using Deep-Learning
No ratings yet
Real-Time Sign Language Interpreter Using Deep-Learning
8 pages
American - Yolo
No ratings yet
American - Yolo
16 pages
Gesture Recognition Based on Deep Convolutional Neural Network
No ratings yet
Gesture Recognition Based on Deep Convolutional Neural Network
6 pages
Final Minor Report
No ratings yet
Final Minor Report
24 pages
Sign Language Recognition System Using CNN (2)
No ratings yet
Sign Language Recognition System Using CNN (2)
6 pages
Paper 3+ijisae
No ratings yet
Paper 3+ijisae
15 pages
Sign Language To Text-Speech Translator Using Machine Learning
No ratings yet
Sign Language To Text-Speech Translator Using Machine Learning
5 pages
A Review On The Perception and Recognition Systems For Interpreting Sign Languages Used by Deaf and Mute
No ratings yet
A Review On The Perception and Recognition Systems For Interpreting Sign Languages Used by Deaf and Mute
6 pages
Sign Language Recognition Using Machine Learning
No ratings yet
Sign Language Recognition Using Machine Learning
8 pages
Bit Rate Video Coding For Low Communication Wireless Multimedia Applications
No ratings yet
Bit Rate Video Coding For Low Communication Wireless Multimedia Applications
5 pages
Hand Sign Language Translator For Speech Impaired
No ratings yet
Hand Sign Language Translator For Speech Impaired
4 pages
(7-14) Journal of Soft Computing and Computational Intelligence5
No ratings yet
(7-14) Journal of Soft Computing and Computational Intelligence5
8 pages
Machine_Learning_and_Deep_Learning_Approaches_for_
No ratings yet
Machine_Learning_and_Deep_Learning_Approaches_for_
91 pages
2021a1r002
No ratings yet
2021a1r002
51 pages
SIGNLANGUAGE PPT
100% (1)
SIGNLANGUAGE PPT
15 pages
Sriatmaja,+Ridwang+19401 AAP
No ratings yet
Sriatmaja,+Ridwang+19401 AAP
10 pages
American Sign Language Pattern Recogniti
No ratings yet
American Sign Language Pattern Recogniti
6 pages
Review Paper
No ratings yet
Review Paper
5 pages
Paper 2728
No ratings yet
Paper 2728
10 pages
Mathematics 11 03729
No ratings yet
Mathematics 11 03729
20 pages
s44163-024-00113-8
No ratings yet
s44163-024-00113-8
11 pages
Text To Speech and Language Conversion in Hindi and English Using CNN
No ratings yet
Text To Speech and Language Conversion in Hindi and English Using CNN
10 pages
American Sign Language Detection Using YOLOv5 and
No ratings yet
American Sign Language Detection Using YOLOv5 and
16 pages
All Research
No ratings yet
All Research
133 pages
IJRPR20645
No ratings yet
IJRPR20645
9 pages
Sign Language Recognition Using Convolutional Neur
No ratings yet
Sign Language Recognition Using Convolutional Neur
12 pages
Mediapipe_and_CNNs_for_Real-Time_ASL_Gesture_Recog
No ratings yet
Mediapipe_and_CNNs_for_Real-Time_ASL_Gesture_Recog
5 pages
Kismet: Fundamentals and Applications
From Everand
Kismet: Fundamentals and Applications
Fouad Sabry
No ratings yet
Abstractive summarization using multilingual text-to-text transfer transformer for the Turkish text
No ratings yet
Abstractive summarization using multilingual text-to-text transfer transformer for the Turkish text
10 pages
Multi-task deep learning for Vietnamese capitalization and punctuation recognition
No ratings yet
Multi-task deep learning for Vietnamese capitalization and punctuation recognition
11 pages
Graph-based methods for transaction databases: a comparative study
No ratings yet
Graph-based methods for transaction databases: a comparative study
10 pages
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
No ratings yet
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
10 pages
Developing a website for English-speaking practice to English as a foreign language learners at the university level
No ratings yet
Developing a website for English-speaking practice to English as a foreign language learners at the university level
12 pages
Enhancing emotion recognition model for a student engagement use case through transfer learning
No ratings yet
Enhancing emotion recognition model for a student engagement use case through transfer learning
11 pages
A comparative study of natural language inference in Swahili using monolingual and multilingual models
No ratings yet
A comparative study of natural language inference in Swahili using monolingual and multilingual models
8 pages
A contest of sentiment analysis: k-nearest neighbor versus neural network
No ratings yet
A contest of sentiment analysis: k-nearest neighbor versus neural network
9 pages
A proposed approach for plagiarism detection in Myanmar Unicode text
No ratings yet
A proposed approach for plagiarism detection in Myanmar Unicode text
9 pages
Hindi spoken digit analysis for native and non-native speakers
No ratings yet
Hindi spoken digit analysis for native and non-native speakers
7 pages
Evaluating ChatGPT’s Mandarin “yue” pronunciation system in language learning
No ratings yet
Evaluating ChatGPT’s Mandarin “yue” pronunciation system in language learning
8 pages
Hybrid object detection and distance measurement for precision agriculture: integrating YOLOv8 with rice field sidewalk detection algorithm
No ratings yet
Hybrid object detection and distance measurement for precision agriculture: integrating YOLOv8 with rice field sidewalk detection algorithm
11 pages
Video forgery: An extensive analysis of inter-and intra-frame manipulation alongside state-of-the-art comparisons
No ratings yet
Video forgery: An extensive analysis of inter-and intra-frame manipulation alongside state-of-the-art comparisons
13 pages
Artificial intelligence algorithms to predict customer satisfaction: a comparative study
No ratings yet
Artificial intelligence algorithms to predict customer satisfaction: a comparative study
9 pages
Automatic detection of dress-code surveillance in a university using YOLO algorithm
No ratings yet
Automatic detection of dress-code surveillance in a university using YOLO algorithm
8 pages
Two-dimensional Klein-Gordon and Sine-Gordon numerical solutions based on deep neural network
No ratings yet
Two-dimensional Klein-Gordon and Sine-Gordon numerical solutions based on deep neural network
13 pages
Primary phase Alzheimer's disease detection using ensemble learning model
No ratings yet
Primary phase Alzheimer's disease detection using ensemble learning model
9 pages
Hybrid model detection and classification of lung cancer
No ratings yet
Hybrid model detection and classification of lung cancer
11 pages
Exploring DenseNet architectures with particle swarm optimization: efficient tomato leaf disease detection
No ratings yet
Exploring DenseNet architectures with particle swarm optimization: efficient tomato leaf disease detection
9 pages
Deep learning-based techniques for video enhancement, compression and restoration
No ratings yet
Deep learning-based techniques for video enhancement, compression and restoration
13 pages
Improved convolutional neural networks for aircraft type classification in remote sensing images
No ratings yet
Improved convolutional neural networks for aircraft type classification in remote sensing images
8 pages
Optimizing deep learning models from multi-objective perspective via Bayesian optimization
No ratings yet
Optimizing deep learning models from multi-objective perspective via Bayesian optimization
10 pages
Adaptive kernel integration in visual geometry group 16 for enhanced classification of diabetic retinopathy stages in retinal images
No ratings yet
Adaptive kernel integration in visual geometry group 16 for enhanced classification of diabetic retinopathy stages in retinal images
12 pages
Detecting road damage utilizing retinanet and mobilenet models on edge devices
No ratings yet
Detecting road damage utilizing retinanet and mobilenet models on edge devices
11 pages
A novel scalable deep ensemble learning framework for big data classification via MapReduce integration
No ratings yet
A novel scalable deep ensemble learning framework for big data classification via MapReduce integration
15 pages
U-Net for wheel rim contour detection in robotic deburring
No ratings yet
U-Net for wheel rim contour detection in robotic deburring
14 pages
A comparative analysis of exponential smoothing method and deep learning models for bitcoin price prediction
No ratings yet
A comparative analysis of exponential smoothing method and deep learning models for bitcoin price prediction
9 pages
Deep ensemble learning with uncertainty aware prediction ranking for cervical cancer detection using Pap smear images
No ratings yet
Deep ensemble learning with uncertainty aware prediction ranking for cervical cancer detection using Pap smear images
11 pages
Enhancing fall detection and classification using Jarratt‐butterfly optimization algorithm with deep learning
No ratings yet
Enhancing fall detection and classification using Jarratt‐butterfly optimization algorithm with deep learning
10 pages
Event detection in soccer matches through audio classification using transfer learning
No ratings yet
Event detection in soccer matches through audio classification using transfer learning
9 pages
Nouns: Name Vii D Roll No
No ratings yet
Nouns: Name Vii D Roll No
15 pages
Social Studies Subject for Middle School
No ratings yet
Social Studies Subject for Middle School
83 pages
Language Testing
No ratings yet
Language Testing
7 pages
ALH Level1 Units 4-6
No ratings yet
ALH Level1 Units 4-6
15 pages
Refugee Blues
No ratings yet
Refugee Blues
19 pages
Irregular Interpeace
No ratings yet
Irregular Interpeace
3 pages
Drift and The Development of Sentential Complements in British and American English
No ratings yet
Drift and The Development of Sentential Complements in British and American English
75 pages
Gfta 3 Spanish Case Study Alina
No ratings yet
Gfta 3 Spanish Case Study Alina
4 pages
(eBook PDF) The PowerScore Digital LSAT Logical Reasoning Bible 2020th Edition instant download
100% (4)
(eBook PDF) The PowerScore Digital LSAT Logical Reasoning Bible 2020th Edition instant download
57 pages
0610_w24_ms_61
No ratings yet
0610_w24_ms_61
8 pages
Uasa English Part 4 Beginners 2.0
100% (1)
Uasa English Part 4 Beginners 2.0
5 pages
CHAPTER III: Providing Input For Acquisition: The Potential of The Second Language Classroom
No ratings yet
CHAPTER III: Providing Input For Acquisition: The Potential of The Second Language Classroom
11 pages
Unit 1 Skills Sheets
No ratings yet
Unit 1 Skills Sheets
9 pages
Stretch Level1 Book
No ratings yet
Stretch Level1 Book
114 pages
George Carlin - On Language
100% (11)
George Carlin - On Language
2 pages
Eng P 1
No ratings yet
Eng P 1
12 pages
GST 111 PQ by King Jay_20250504_203310_0000
No ratings yet
GST 111 PQ by King Jay_20250504_203310_0000
43 pages
Bilingualism: Causes, Advantages, and Disadvantages: January 2020
No ratings yet
Bilingualism: Causes, Advantages, and Disadvantages: January 2020
3 pages
Home Alone Marathon
No ratings yet
Home Alone Marathon
20 pages
Q&A For Host Families
No ratings yet
Q&A For Host Families
2 pages
Lend a Hand
No ratings yet
Lend a Hand
1 page
ECA1 - Tests - Language Test 3B - New2018
No ratings yet
ECA1 - Tests - Language Test 3B - New2018
4 pages
Swahili Drama
No ratings yet
Swahili Drama
2 pages
It's A Novel Idea - Blueback
No ratings yet
It's A Novel Idea - Blueback
108 pages
Lesson Plan
No ratings yet
Lesson Plan
9 pages
August 27 - Personal Pronoun
No ratings yet
August 27 - Personal Pronoun
2 pages
Toefl Material
No ratings yet
Toefl Material
37 pages
GRADE 7-Part-A Revision Paper
No ratings yet
GRADE 7-Part-A Revision Paper
8 pages

Sign language emotion and alphabet recognition with hand gestures using convolution neural network

Uploaded by

Sign language emotion and alphabet recognition with hand gestures using convolution neural network

Uploaded by

IAES International Journal of Artificial Intelligence (IJ-AI)

Vol. 14, No. 2, April 2025, pp. 954~962

Sign language emotion and alphabet recognition with hand

Varsha K. Patil1, Vijaya R. Pawar2, Aditya Patil3, Vinayak Bairagi1

Article Info ABSTRACT

Journal homepage: http://ijai.iaescore.com

2.1. Literature survey of American sign language signal processing

Table 1. Methods for incrementing the accuracy related to ASL processing

Figure 1. Block diagram for emotions and alphabet recognition

Int J Artif Intell, Vol. 14, No. 2, April 2025: 954-962

Figure 2. CNN based ASL for emotion recognition

4.1. Emotion recognition and confusion matrix

Figure 6. Confusion matrix for emotions

4.2. Graphs of epoch vs loss and epoch vs accuracy

Int J Artif Intell, Vol. 14, No. 2, April 2025: 954-962

4.3. Effects of region of interest

4.4. Alphabet recognition: performance parameters

Figure 11. Performance parameters for recognition of alphabets by hand gestures

Int J Artif Intell, Vol. 14, No. 2, April 2025: 954-962

Aditya K. Patil is graduated B.Tech. in electronics and communication engineering

Int J Artif Intell, Vol. 14, No. 2, April 2025: 954-962

You might also like