0% found this document useful (0 votes)

72 views5 pages

3D Human Pose Estimation with CNN

This document discusses a method for 3D human pose estimation using machine learning, specifically leveraging convolutional neural networks (CNNs) to analyze depth and ridge data for accurate joint localization. The proposed approach aims to improve pose estimation accuracy by addressing issues like joint drift and loss of 3D information, with applications in fields such as robotics, virtual reality, and healthcare. The study also highlights the importance of a robust dataset and the integration of spatial and temporal information for effective model performance.

Uploaded by

FATHIMA E

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views5 pages

3D Human Pose Estimation with CNN

Uploaded by

FATHIMA E

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Vol-9 Issue-5 2023 IJARIIE-ISSN(O)-2395-4396

3D HUMAN POSE ESTIMATION USING

MACHINE LEARNING
Pradakshinaa P1,Harini S2, Dhivyamanohar C3, Kiruthika V R4

1
Student, Information Technology, Bannari Amman Institute of Technology, Tamil Nadu, India
2
Student, Information Technology, Bannari Amman Institute of Technology, Tamil Nadu, India
3
Student, Information Technology, Bannari Amman Institute of Technology, Tamil Nadu, India
4
AssistantProfessor, Information Technology, Bannari Amman Institute of Technology, Tamil Nadu,
India

ABSTRACT

Human posе estimation is a fundamental task in computer vision and artificial intеlligеncе that involvеs thе
еstimation of thе spatial configuration of a human body in an imagе or vidеo. Accuratе pose estimation is crucial
for a wide range of applications, including human-computеr intеraction, augmented rеality, virtual rеality,
biomеchanics, and action rеcognition. Whilе 2D posе еstimation can providе valuablе information about thе posе in
imagе spacе, 3D human posе еstimation aims to rеcovеr thе thrее-dimеnsional positions of body joints, offering a
morе complеtе and informative representation of human movеmеnt. We propose a method that uses a convolutional
neural network (CNN) to estimate human pose by analyzing the projection of the depth and ridge data, which
represent local maxima in a distance transform map. To fully utilize the 3D information of depth points, we propose
a method to project the depth and ridge data in various directions. The proposed projection method reduces the loss
of 3D information, stack data can avoid joint drift, and CNN improves localization accuracy. Separate humans from
the background using depth data and extract highlight data from human silhouettes. Project depth and elevation
data to XY, XZ, and ZY planes. ResNet-101 accepts 6 rendered images and uses heatmaps to generate 2D heatmaps
and offsets. Create 2D key points for each plane using the soft-argmax operation. Obtain detailed 3D joint positions
using fully connected layers. In experiments on SMMC-10, EVAL, and ITOP datasets, the proposed method
achieved improved pose estimation accuracy. The proposed method can eliminate the loss of 3D information and
displacement of joint positions that may occur during human pose estimation.

Keyword - Human pose Estimation, Spatial Configurations, three-dimensional space, RGB images

1. INTRODUCTION
Human pose estimation is the task of finding the parameters of a human body model, such as the length and
orientation of body parts (head, trunk, limbs, etc.) that fit an input image. Fast depth imaging devices can extract
rich information from depth images, thereby simplifying human pose estimation. An approach to detect human
joints from observed input images using a pre-trained body part detector. Used a joint detector based on geodesic
features to locate body joints in depth data. A head-torso detector based on rabid candidates and a pattern matching
algorithm for each limb, but required that the upper body and face be visible without obstruction. Human pose
recognition approach that predicts the representation of mid-body parts for human pose estimation, but this
prediction usually requires expensive training steps and large human pose space. We needed a large number of
training examples to cover. Used regression forests to directly identify co-occurrences from the votes of each pixel,
but the votes were modeled, which required more complex training. Described a method to convert depth data into
an average representation without background subtraction. Although most of these detection approaches do not
completely miss body parts, they can only detect visible parts, which suffers from occlusion problems and

21804 [Link] 1898

Vol-9 Issue-5 2023 IJARIIE-ISSN(O)-2395-4396

significantly reduces the accuracy of human pose estimation. A productive approach to find human joints by fitting a
predefined human body model to the observed input images. An iterative closest point (ICP) method for human pose
estimation and human body tracking, but it cannot be used due to computational complexity.

2. SCOPE

The motivation to work on 3D human pose estimation using machine learning comes from its transformative
potential in different fields. In fields such as computer vision and robotics, accurate 3D gesture estimation enables
robots and machines to better understand and interact with humans, facilitating safer and more intuitive human-
machine collaboration. In the entertainment industry, it paves the way for more immersive virtual reality experiences
and realistic character animations, increasing user interaction and realism. Additionally, in sports and healthcare, 3D
pose estimation can aid injury prevention, rehabilitation, and performance analysis by providing detailed insight into
human movement. Additionally, applications can be extended to security and surveillance to enhance the tracking
and identification of individuals in crowded or complex environments. Finally, advances in 3D human pose
estimation using machine learning will revolutionize the way we interact with technology, entertainment and
education, and monitor and improve human well-being in various applications.

3. LITERATURE REVIEW

Thе literature survey lеd for this study investigates еxisting works and ongoing еxploration lеd in the fiеld of "3D
Human Posturе Assеssmеnt utilizing AI ''. This part plans to give an exhaustive outline of thе cutting edge strategies
and systеms, recognize holes in thе flow research, and establish thе groundwork for thе proposed arrangement. In
this part, wе arrangе and talk about еxisting chips at "3D Human Posе еstimation using ML". Each word is given its
comparison segment numbеr.
3.1 MONOCULAR 3D HUMAN POSE ESTIMATION
Monocular 3d Human pose estimation in the wild using improved CNN supervision" prеsеnts a significant
advancеmеnt in monocular 3D human posе еstimation using Convolutional Nеural Nеtworks (CNNs). Thе authors
introduced a two-stage framework that lеvеragеs both 2D and 3D information, achieving statе-of-the-art rеsults.
Thе mеthod is sеnsitivе to occlusions and may not pеrform wеll in crowdеd scеnеs. Idеntifiеd Gap: Addrеssing
occlusion handling and improving robustnеss in complеx еnvironmеnts.
3.2 MASK R-CNN
Mask R-CNN addresses the problem of instance segmentation, which involves not only detecting objects in an
image but also segmenting them at the pixel [Link] primary contribution of Mask R-CNN is the integration of
instance segmentation into the existing object detection framework. It achieves state-of-the-art results on instance
segmentation tasks while maintaining competitive object detection performance.
3.3 VOXELPOSE TO GAUGE 3D STANCES

VoxelPose to gauge 3D stances of numerous individuals from different camera sees. VoxelPose straightforwardly
works in the 3D space and, consequently, tries not to settle on mistaken choices in every camera view. In particular,
they initially anticipate 2D posture heatmaps for all perspectives (the stage one), then twist the heatmaps to a typical
3D space and develop a component volume took care of into a Cuboid Proposition Organization (CPN) to confine
all individuals examples (the stage two), and, at last, build a better grained highlight volume and gauge a 3D posture
for every proposition (the last stage).

4. METHODOLOGY
This project will mainly concentrate on creating and assessing a machine learning-based 3D human pose estimation
model. Our project scope encompasses tasks such as gathering data, preparing the data, developing the model,
training it, and evaluating its performance. It's important to recognize that the field of 3D pose estimation is
extensive, and our project may not encompass all its complexities.

21804 [Link] 1899

Vol-9 Issue-5 2023 IJARIIE-ISSN(O)-2395-4396

4.1 Convolutional Neural Network

Convolutional neural networks, a kind of artificial neural networks, have completely changed the way that image
processing and computer vision are researched. It is widely utilized in many different applications, including face
identification, image segmentation, and object recognition. The biological framework of the human brain's visual
cortex serves as the foundation for CNN's architectural design. Among the layers that make up this system are the
input layer, convolutional layer, pooling layer, and fully connected layer. The raw data from the input layer, which is
a picture, is subjected to a number of filters in the convolutional layer. A variety of elements in the image, including
edges, corners, and forms, are detected using the filters. The pooling layer is used to lower the output's
dimensionality and improve its computational efficiency after the convolution layer’s output. The classification
process is then carried out using the identified features and the completely linked layer. Convolutional filters are
used in CNN's operating system to extract information from an image. The input image is compressed with the
filters, which are micro matrices, to create a feature map. A specific feature's activation at a certain spot in the image
is represented by the feature map. The best combination of filters to utilize to identify different features in an image
are learned by the CNN during the training phase.

5. PROPOSED WORK

The purpose of 3D human pose estimation using machine learning is to accurately and automatically determine the
3D positions and orientations of joints and parts of the human body from 2D images and video frames. This
technology is especially valuable in fields as diverse as computer vision, robotics, and human-computer interaction,
as it allows machines to understand and interpret human body movements as well as gesture recognition, virtual
reality, and biomechanics. Can be used for applications such as analysis. The steps involved in 3D human pose
estimation usually include several key steps. First, the process begins with data collection, where a dataset of images
or video frames containing people is collected. Next, a preprocessing step may be required to normalize and enrich
the input data. Then, feature extraction techniques are used to identify important parts of the human body, such as
joint locations. A machine learning model, often a deep neural network such as a convolutional neural network
(CNN) or a recurrent neural network (RNN), is trained on this data to learn the relationship between a two-
dimensional state and the corresponding three-dimensional state. Once the model is trained, it is applied to new
images or video frames to predict the 3D pose of a human. Post-processing techniques can be used to adjust and
smooth the estimated state.

5.1 SINGLE-PERSON HUMAN POSE ESTIMATION

Single-person 3D pose estimation is divided into 2-step and 1-step categories. The method is as shown in Figure
3.1a. The two-step method consists of obtaining 2D joint positions using a 2D key point detection model. Convert
2D key points to 3D key points using deep learning techniques. Such an approach in the first stage, suffers from the
ambiguity of depth inherent in the second key stage. A problem that many works aim to solve. One-step method
means three-dimensional regression obtains detailed positions directly from static images. These methods require a
lot of training data 3D annotation is available, but manual annotation is expensive and tedious. A standard method
for single and multi-person 3D pose estimation. From the input 2D image and the predicted 3D human pose in (a)
are obtained from the sample and GT.3.6M human dataset.

21804 [Link] 1900

Vol-9 Issue-5 2023 IJARIIE-ISSN(O)-2395-4396

FIGURE 3.1 Single-person 3D estimation

5.2 DIRECT REGRESSION

This method directly maps the joints and joints of the body. Features of the human body model. If the model is
familiar, If you predict the 17 important points of a certain person, the result will be a 2.17 vector containing X and
Y coordinates. All the expected signs are shown in Figure 3.2 below.

FIGURE 3.2 Direct Regression

5.3 HEATMAP REGRESSION

Heat map regression is widely used for 2D humans position estimation and grouping of key locations: for Example -
hands, face, body. With heat map in frameworks, pixel values are usually used as follows: Probability that the
corresponding pixels are mapped milestones in this framework. By adopting this technique, It is easy to practice and
can achieve pixel-level accuracy.

6. ADVANTAGES

This tеchniquе lеvеragеs thе capabilitiеs of dееp lеarning modеls to accuratеly prеdict thе thrее-dimеnsional
positions of human body joints and kеypoints.3D human posе еstimation using machinе lеarning offеrs thе promisе
of highly accuratе and adaptablе rеsults in divеrsе rеal-world scеnarios. Whilе bеnеfiting from еnd-to-еnd lеarning,
tеmporal insights, and scalability, this approach dеmands substantial data, computational rеsourcеs, and
considеration of еthical and anatomical challеngеs. As tеchnology advancеs, thе potеntial for applications in fiеlds
likе sports, hеalthcarе, and animation rеmains compеlling, undеrscoring thе significancе of a thoughtful and contеxt-
awarе approach to implеmеntation.

21804 [Link] 1901

Vol-9 Issue-5 2023 IJARIIE-ISSN(O)-2395-4396

7. CONCLUSION

The success of our 3D pose estimation model can be attributed to several important factors. The variety and size of
the dataset played an important role in training a robust model. Larger and more diverse data sets allow the model to
better generalize to different situations and conditions. The combination of CNN and recurrent layers allows the
model to capture both spatial and temporal information, improving pose estimation accuracy. The computational
efficiency of the model is a major advantage. It opens up opportunities for real-time applications in areas such as
gaming and augmented reality. Continuous optimization of model inference speed is beneficial.

6. REFERENCES

[1]. [Link]

[2]. [Link]

[3].[Link]

[4].[Link]

21804 [Link] 1902

Deep Learning in Human Pose Estimation
No ratings yet
Deep Learning in Human Pose Estimation
16 pages
Advances in Human Pose Estimation
No ratings yet
Advances in Human Pose Estimation
5 pages
Human Pose Estimation with GNN Techniques
No ratings yet
Human Pose Estimation with GNN Techniques
5 pages
Human Pose Estimation: Deep Learning Benchmark
No ratings yet
Human Pose Estimation: Deep Learning Benchmark
7 pages
Real-Time 3D Human Posture Estimation
No ratings yet
Real-Time 3D Human Posture Estimation
5 pages
Advances in Human Pose Estimation
No ratings yet
Advances in Human Pose Estimation
11 pages
Advances in Human Pose Estimation
No ratings yet
Advances in Human Pose Estimation
4 pages
3D Pose Estimation Research Proposal
100% (1)
3D Pose Estimation Research Proposal
18 pages
Diplomarbeit Lassner
No ratings yet
Diplomarbeit Lassner
115 pages
2D and 3D Human Pose Estimation Guide
No ratings yet
2D and 3D Human Pose Estimation Guide
84 pages
Comprehensive Survey on Human Pose Estimation
No ratings yet
Comprehensive Survey on Human Pose Estimation
30 pages
Advances in Human Pose Estimation
No ratings yet
Advances in Human Pose Estimation
13 pages
3D Human Pose Estimation with GANs
No ratings yet
3D Human Pose Estimation with GANs
7 pages
3D Human Pose Estimation Using Heatmaps
No ratings yet
3D Human Pose Estimation Using Heatmaps
5 pages
Human Pose Estimation via Body Tracking
No ratings yet
Human Pose Estimation via Body Tracking
35 pages
Human Pose Estimation with CNNs
No ratings yet
Human Pose Estimation with CNNs
10 pages
Human Pose Estimation Project Report
No ratings yet
Human Pose Estimation Project Report
23 pages
Real-Time Human Pose Estimation Techniques
No ratings yet
Real-Time Human Pose Estimation Techniques
27 pages
Virtual Fitness Assistance via Pose Estimation
No ratings yet
Virtual Fitness Assistance via Pose Estimation
7 pages
Joint Learning for Human Pose Estimation
No ratings yet
Joint Learning for Human Pose Estimation
6 pages
Human Pose Estimation with Deep Learning
No ratings yet
Human Pose Estimation with Deep Learning
6 pages
MediaPipe Pose for 3D Human Estimation
No ratings yet
MediaPipe Pose for 3D Human Estimation
21 pages
Human Pose Estimation Overview
No ratings yet
Human Pose Estimation Overview
13 pages
3D Human Pose Recovery from Images
No ratings yet
3D Human Pose Recovery from Images
15 pages
Human Pose Estimation Project Report
No ratings yet
Human Pose Estimation Project Report
10 pages
Depth Imaging for Human Posture Recognition
No ratings yet
Depth Imaging for Human Posture Recognition
18 pages
3D Human Pose Estimation with Multi-Camera
No ratings yet
3D Human Pose Estimation with Multi-Camera
7 pages
3D Human Pose Estimation Review
No ratings yet
3D Human Pose Estimation Review
28 pages
3D Pose Estimation from RGB Images
No ratings yet
3D Pose Estimation from RGB Images
10 pages
Advances in Human Pose Estimation Research
No ratings yet
Advances in Human Pose Estimation Research
8 pages
Multi-Person 3D Pose Estimation Model
No ratings yet
Multi-Person 3D Pose Estimation Model
10 pages
Human Pose Estimation Project Report
No ratings yet
Human Pose Estimation Project Report
14 pages
Human Pose Estimation Overview and Applications
No ratings yet
Human Pose Estimation Overview and Applications
40 pages
Human Modelling & Pose Estimation Insights
No ratings yet
Human Modelling & Pose Estimation Insights
13 pages
Transformer-Based Human Pose Estimation
No ratings yet
Transformer-Based Human Pose Estimation
16 pages
Human Pose Estimation Overview
No ratings yet
Human Pose Estimation Overview
24 pages
Body Posture Detection for Rehabilitation
No ratings yet
Body Posture Detection for Rehabilitation
20 pages
Metric Learning for 3D Pose Estimation
No ratings yet
Metric Learning for 3D Pose Estimation
12 pages
Human Pose Estimation Techniques
No ratings yet
Human Pose Estimation Techniques
12 pages
Survey on Single-Person Pose Estimation
No ratings yet
Survey on Single-Person Pose Estimation
16 pages
DensePose: Human Pose Estimation Overview
No ratings yet
DensePose: Human Pose Estimation Overview
10 pages
AlphaPose: Real-Time Multi-Person Tracking
No ratings yet
AlphaPose: Real-Time Multi-Person Tracking
17 pages
Deep Learning for Human Pose Estimation
No ratings yet
Deep Learning for Human Pose Estimation
11 pages
Enhancing MHFormer for 3D Pose Estimation
No ratings yet
Enhancing MHFormer for 3D Pose Estimation
10 pages
Human Pose Estimation Techniques Review
No ratings yet
Human Pose Estimation Techniques Review
1 page
Multitask Deep Learning for Pose Estimation
No ratings yet
Multitask Deep Learning for Pose Estimation
10 pages
Human Pose Estimation with CNNs
No ratings yet
Human Pose Estimation with CNNs
7 pages
Deep Learning for 2D & 3D Pose Estimation
No ratings yet
Deep Learning for 2D & 3D Pose Estimation
68 pages
MediaPipe Workout Monitoring System
No ratings yet
MediaPipe Workout Monitoring System
8 pages
Human Pose Estimation with AI Techniques
No ratings yet
Human Pose Estimation with AI Techniques
6 pages
Human Pose Estimation in Sports Analysis
No ratings yet
Human Pose Estimation in Sports Analysis
12 pages
Structured Deep Learning Supported With Point Cloud For 3D Human Pose Estimation
No ratings yet
Structured Deep Learning Supported With Point Cloud For 3D Human Pose Estimation
6 pages
Self-Attention Network for Pose Estimation
No ratings yet
Self-Attention Network for Pose Estimation
14 pages
Human3.6M: 3D Human Pose Dataset
No ratings yet
Human3.6M: 3D Human Pose Dataset
15 pages
Lightweight Human Pose Estimator LiPE
No ratings yet
Lightweight Human Pose Estimator LiPE
11 pages
Human Pose Estimation with CNNs Report
No ratings yet
Human Pose Estimation with CNNs Report
8 pages
cv人体姿态识别综述
No ratings yet
cv人体姿态识别综述
37 pages
OpenThermalPose: YOLOv8-Pose Dataset
No ratings yet
OpenThermalPose: YOLOv8-Pose Dataset
8 pages
Data Science Laboratory Course Plan
No ratings yet
Data Science Laboratory Course Plan
10 pages
CSE Paper on Similarity Index Analysis
No ratings yet
CSE Paper on Similarity Index Analysis
6 pages
OOP Principles in Software Engineering
No ratings yet
OOP Principles in Software Engineering
96 pages
Requirements Analysis and Specification
No ratings yet
Requirements Analysis and Specification
24 pages
Foundations of Data Science Overview
No ratings yet
Foundations of Data Science Overview
21 pages
FDS Paper Model Exam Overview
No ratings yet
FDS Paper Model Exam Overview
2 pages
Understanding Pre-training in NLP
No ratings yet
Understanding Pre-training in NLP
5 pages
LF Plus User Manual V1.2
No ratings yet
LF Plus User Manual V1.2
86 pages
Duty by René Le Senne: 1950 Edition
No ratings yet
Duty by René Le Senne: 1950 Edition
14 pages
Hypotheses Testing (JAMOVI) - RM
No ratings yet
Hypotheses Testing (JAMOVI) - RM
16 pages
LAC Report: Resources for Struggling Readers
No ratings yet
LAC Report: Resources for Struggling Readers
3 pages
FCFS and Round Robin Scheduling in C
No ratings yet
FCFS and Round Robin Scheduling in C
11 pages
Main Switchboard Specifications H2451
No ratings yet
Main Switchboard Specifications H2451
146 pages
Inclusive Marketing Strategies for Diversity
No ratings yet
Inclusive Marketing Strategies for Diversity
4 pages
Writing Scientific Articles: AIMRADC Guide
No ratings yet
Writing Scientific Articles: AIMRADC Guide
42 pages
Astronomy Distance Measurement Methods
No ratings yet
Astronomy Distance Measurement Methods
23 pages
KSB Centrifugal Pump Design
100% (6)
KSB Centrifugal Pump Design
47 pages
FRAM MCUS For Dummies Part 2
No ratings yet
FRAM MCUS For Dummies Part 2
5 pages
Chemical Engineering Magazine 2017.08
No ratings yet
Chemical Engineering Magazine 2017.08
68 pages
Livelihood Programs for Disabled Individuals
No ratings yet
Livelihood Programs for Disabled Individuals
13 pages
2018 01 Green Urban Area
No ratings yet
2018 01 Green Urban Area
16 pages
SCCM Current Branch Upgrade Insights
No ratings yet
SCCM Current Branch Upgrade Insights
21 pages
Axial Load Analysis and Deformation
No ratings yet
Axial Load Analysis and Deformation
48 pages
Fast File System for Unix Overview
No ratings yet
Fast File System for Unix Overview
2 pages
DH-XVR1B16H Digital Video Recorder
No ratings yet
DH-XVR1B16H Digital Video Recorder
3 pages
Understanding Anesthesia Vaporizers
100% (1)
Understanding Anesthesia Vaporizers
5 pages
Grade 10 Task 2025 (QP) - Trigonometry
100% (1)
Grade 10 Task 2025 (QP) - Trigonometry
4 pages
Catalase Activity Analysis Worksheet
No ratings yet
Catalase Activity Analysis Worksheet
3 pages
Understanding Human Dignity in Education
No ratings yet
Understanding Human Dignity in Education
6 pages
Front Underrun Protection Standards for Trucks
No ratings yet
Front Underrun Protection Standards for Trucks
14 pages
Batch Purification Procedures Guide
No ratings yet
Batch Purification Procedures Guide
1 page
NIOS 12th Pass Certificate - Jignesh Patel
No ratings yet
NIOS 12th Pass Certificate - Jignesh Patel
1 page
EPC Framework for Battery Storage Projects
No ratings yet
EPC Framework for Battery Storage Projects
5 pages
6th State Level Chemistry Test Answer Key
No ratings yet
6th State Level Chemistry Test Answer Key
21 pages
4As Lesson Plan Template Guide
No ratings yet
4As Lesson Plan Template Guide
2 pages
Student Analytics System Overview
No ratings yet
Student Analytics System Overview
4 pages

3D Human Pose Estimation with CNN

Uploaded by

3D Human Pose Estimation with CNN

Uploaded by

Vol-9 Issue-5 2023 IJARIIE-ISSN(O)-2395-4396

3D HUMAN POSE ESTIMATION USING

21804 [Link] 1898

21804 [Link] 1899

4.1 Convolutional Neural Network

5.1 SINGLE-PERSON HUMAN POSE ESTIMATION

21804 [Link] 1900

FIGURE 3.1 Single-person 3D estimation

5.2 DIRECT REGRESSION

FIGURE 3.2 Direct Regression

5.3 HEATMAP REGRESSION

21804 [Link] 1901

21804 [Link] 1902

You might also like