0% found this document useful (0 votes)

10 views6 pages

Chapter1 2

The document discusses the importance of vision in human physiology and the prevalence of visual impairment globally, highlighting the potential of object recognition glasses to enhance independence for blind individuals. It outlines various methodologies and technologies, such as deep learning and real-time detection frameworks, to assist visually impaired users, while also addressing challenges faced by existing projects, including hardware limitations and lack of user testing. The document emphasizes the need for improved systems that integrate multiple functionalities to better serve the visually impaired community.

Uploaded by

jamyt6384

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views6 pages

Chapter1 2

Uploaded by

jamyt6384

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Chapter 1

1.1Introduction

Considering that 83% of the sensory information we receive is derived from our visual perception, it is clear that
vision plays a crucial role in human physiology.[1]

As per the 2011 WHO statistics, around 285 million people globally are affected by visual impairment.
Among them, 39 million are blind, while 246 million experience low vision[2].

Object recognition glasses can enhance social inclusion for blind individuals by allowing greater
participation in daily activities. These devices reduce reliance on others, boosting confidence and
independence[3].

To aid individuals with visual impairments in recognizing objects, it is advisable to implement real-time detection
frameworks such as YOLO or SSD to provide immediate feedback. Utilizing Mask R-CNN can facilitate scene
description, while integrating Optical Character Recognition (OCR) with Text-to-Speech (TTS) technology will enable
text reading. Furthermore, employing LiDAR technology can enhance spatial awareness, and improving user
interaction through gesture or voice commands is recommended. It is also essential to ensure compliance with
GDPR and to adopt privacy-preserving AI measures. Various object recognition methodologies should be
considered, including feature-based techniques (such as SIFT and SURF), deep learning models (like CNNs and Vision
Transformers), and hybrid approaches to achieve greater accuracy. Additionally, adapting models through transfer
learning can cater to personalized environments, with deep learning being the predominant method utilized.[4]

Various studies using deep learning have developed technologies to assist the visually impaired. For
example, a study on smart glasses utilized Raspberry Pi 4, ESP32-CAM, and YOLOv4, achieving 69.2%
precision in 3–4 seconds. Another study employed DA-Multi-DCGAN on mixed datasets,
achieving 80.21% accuracy. A study with YOLOv3 and Raspberry Pi 3 B+ reported 85–95% accuracy,
with 100% for specific objects in 50ms. Another study used Raspberry Pi 3 and CNN models with
PASCAL VOC, achieving 90% accuracy for 20 objects in 29ms. Lastly, a study introduced CATNet on
Raspberry Pi 2, achieving high accuracy for small targets [5].

Detection techniques encounter challenges in low-light conditions, managing overlapping or

diminutive objects, and exhibit high resource consumption. Recognition systems experience
reduced accuracy due to varying viewing angles, minor alterations in lighting or facial expressions,
as well as the effects of unbalanced datasets and the risk of reverse attacks. In the realm of optical
character recognition (OCR), issues arise with handwritten, small, or distorted texts, along with the
complexities of handling multiple languages, particularly Arabic. Voice recognition also faces
obstacles in noisy environments, accommodating various dialects, and the potential for voice
manipulation, resulting in slow response times and diminished accuracy in quiet speech.
1.2 Reasearch problem:

1.3 Goals of the project

1. Develop an affordable and efficient object recognition system using CNN and ESP32 to assist blind
individuals in Yemen.
2. Enhance independence and improve daily navigation by enabling real-time identification of objects.
1.4 Methodology

1.5 limitation of the project

The project faces challenges with data availability and quality, as limited datasets may not cover all
relevant objects and environments, and real-world variations in lighting and angles make training difficult.

1.6 components of the project

1.ESP32
The ESP32 is a versatile, low-power, dual-core microcontroller developed by Espressif Systems,
designed for a wide range of applications, particularly in IoT (Internet of Things). It integrates Wi-Fi and
Bluetooth (both Classic and BLE) connectivity, making it ideal for wireless communication in embedded
systems. The ESP32 is widely used in devices such as smart home systems, wearables, and
automation products[6].

2 ESP32-CAM
The ESP32-CAM is a compact camera module featuring the ESP32-S chip and the OV2640 camera,
costing about $10. It includes a microSD card slot for storing images and files. Measuring
27*40.5*4.5mm with a deep sleep current of 6mA, it can operate independently. Ideal for various IoT
applications, it suits home smart devices, industrial control, wireless monitoring, and more.[7]

3.Headphones
Headphones are a pair of small loudspeaker drivers worn on or around the head over a user's ears.
They are electroacoustic transducers, which convert an electrical signal to a corresponding sound. [8]

4.Ultrasound sensor
Ultrasonic sensors help UAVs measure ground distance for altitude control by emitting pulses and
receiving reflections.

They have a range of up to four meters and are generally unaffected by environmental factors but can
be influenced by noise and airflow. Despite limitations, larger versions could help helicopters detect
obstacles like wires[9].
Chapter 2
The project on "Smart Glasses" for blind people aimed to assist visually impaired individuals in
education and daily life. The system could scan printed text, convert it to audio, and translate English
to Arabic using the Google Translate API. It also used RFID technology to help users locate specific
places like classrooms, along with ultrasonic sensors for better image capture. However, the project
faced numerous issues. The initial design was bulky and impractical, requiring a complete redesign
for better usability. There were compatibility issues between the camera, NOOBS operating system,
and Raspberry Pi model B+, forcing the team to switch to different hardware. Additionally, the RFID
sensor had a limited range, reducing its effectiveness. Time constraints also hindered the
implementation of all planned features, leaving the project incomplete in certain aspects.

The AI-powered smart glasses for the blind and visually impaired aimed to improve navigation and
social interaction by using deep learning techniques, specifically the Faster R-CNN, for object and face
recognition. The system provided voice-based assistance, enabling users to recognize their
surroundings. However, the project had several shortcomings. It focused only on object and face
recognition, overlooking crucial features like text reading and complex navigation. Moreover, the
absence of user testing raised concerns about its real-world effectiveness. The lack of technical
details on implementation and performance metrics made it difficult to assess its practicality.
Additionally, the project’s oversimplified presentation failed to acknowledge the complexities of
training deep learning models and processing real-time data, which are essential for such a system to
function effectively.

Another project, "My Eyes—Smart Glasses for Blind People," provided a cost-effective wearable
solution for visually impaired individuals. It integrated Raspberry Pi, a camera, and earpieces to assist
users in reading tasks and navigating their environment. The glasses combined text-to-speech
conversion, obstacle detection, and face recognition to enhance accessibility. While the project
showed promise, it had several limitations. The bulky design made prolonged use uncomfortable, and
its reliance on an internet connection posed challenges in areas with poor connectivity. The accuracy
of object recognition was not fully reliable in complex environments, and the system had a learning
curve that might discourage some users. Additionally, the glasses had limited battery life, potential
technical malfunctions, and privacy concerns that were not addressed in the study.

A similar project, "AI-Based Smart Glasses for Visually Impaired Individuals," focused on enhancing
accessibility in shopping environments. It used a Raspberry Pi 4, a camera module, and YOLOv5 deep
learning algorithms for real-time object classification and text recognition. The system provided audio
output in multiple languages to cater to user preferences. However, it had several drawbacks.
Language support was limited to English and Tamil, restricting its usability for a broader audience.
The project was dependent on specific hardware, the EPSON BT-300 smart goggles, reducing its
adaptability. There was also a noticeable delay in speech output, affecting user experience.
Additionally, the system only focused on object recognition and could not assist in navigation or
product comparison. It struggled in dynamic environments, had difficulties in indoor settings due to
poor GPS coverage, and lacked integration with environmental sensors, limiting its overall
effectiveness.
The smart glasses project for visually impaired individuals aimed to provide assistance in various daily
tasks, including text reading. It was designed as a cost-effective, wearable solution using a Raspberry Pi
2. The system offered audio feedback and demonstrated good text recognition accuracy, particularly
with larger fonts. However, it had notable limitations. Despite being designed for multiple tasks, it only
implemented a single reading mode, limiting its practical applications. The text recognition accuracy
was highly dependent on font size, style, and image clarity, making it less effective for general use. The
system lacked user testing, which is crucial for refining usability. Additionally, the project used Matlab
and Simulink for model design but relied on C++ for implementation on Raspberry Pi, increasing
complexity and the risk of errors. The Raspberry Pi 2’s limited processing power further constrained
performance, reducing the feasibility of adding advanced features.

Another project, "IoT-Based Smart Glasses with Facial Recognition for People with Visual
Impairments," proposed a low-cost assistive technology that used a Raspberry Pi 4, a camera module,
and an ultrasonic sensor for facial recognition and obstacle detection. It provided real-time assistance
by identifying people and measuring distances to avoid obstacles. The project had several advantages,
including affordability and the use of widely available technology. However, it was limited in
functionality, supporting only facial recognition and distance detection, without navigation features.
The small size of the Raspberry Pi’s SD card posed challenges in expanding capabilities. There were also
concerns regarding power management, as the project did not detail how it would sustain long-term
use. Additionally, the study lacked information on the accuracy of the recognition and distance
measurement systems, raising questions about its reliability. The absence of a defined user interface
also made it unclear how users would interact with the system effectively.

Lastly, the "AI-Powered Smart Glasses" project aimed to enhance mobility for blind individuals by
integrating computer vision, deep learning, and speech processing. The system could detect obstacles,
recognize faces, and read text using Optical Character Recognition (OCR). It provided voice feedback to
guide users, improving independence and accessibility. The project had several advantages, including
enhanced safety through early obstacle detection, increased mobility, and a user-friendly voice-based
interface. However, it faced some challenges. Language support was limited, restricting usability for
non-English speakers. The accuracy of the ultrasonic sensor was not ideal for detecting objects at short
distances, and the system’s performance was highly dependent on lighting conditions. Additionally, the
Raspberry Pi’s processing power was limited, making it difficult to handle complex tasks efficiently.
Despite these shortcomings, the project showed promise in developing assistive technology for visually
impaired individuals, with potential for future improvements.

Overall, while each of these projects contributed to the advancement of smart glasses for the visually
impaired, they all had notable limitations. Some struggled with hardware constraints, while others
lacked key functionalities such as navigation, user testing, or real-world adaptability. Addressing these
issues in future iterations could significantly enhance their effectiveness and accessibility.
The core of the system is based on Faster Region Convolutional Neural Network (Faster R-CNN) for
object and face recognition. The captured images are analyzed, and the results are converted into audio
for the user. The authors acknowledge the system is still in a prototype stage but is promising for future
development[16].

[1.]A. Raj, M. Kannaujiya, I. Bhardwaj, A. Bharti, and R. Prasad, “Model for object detection using
computer vision and machine
learning for decision making,” International Journal of Computer Applications, vol. 181, no. 43, Mar. 2019,
doi:
10.5120/ijca2019918516.

[2].WHO|Visual impairment and blindness. WHO, 7 April 1948.

http://www.who.int/mediacentre/factsheets/fs282/en . Accessed Oct 2015

[3]. Douglas, G., Corcoran, C., & Pavey, S. (2006). *The role of assistive technology in the lives of blind
and partially sighted people.* Visual Impairment Research. This article explores the impact of assistive
technologies on the social and emotional well-being of users.

[4].
Redmon et al., "You Only Look Once: Unified, Real-Time Object Detection", 2016.
Liu et al., "SSD: Single Shot MultiBox Detector", 2016.
He et al., "Mask R-CNN", 2017.
Smith, "An Overview of the Tesseract OCR Engine", 2007.
Apple's iPhone 12 Pro with LiDAR scanner.
Gesture recognition (Leap Motion, Kinect), Voice commands (Google Assistant, Siri, Alexa).
General Data Protection Regulation (GDPR).
McMahan et al., "Communication-Efficient Learning of Deep Networks from Decentralized Data",
2017.Yosinski et al., "How transferable are
features in deep neural networks?", 2014.
[5].https://www.espressif.com/en/products/socs/esp32
[6].https://docs.sunfounder.com/projects/galaxyrvr/en/latest/hardware/cpn_esp_32_cam.html
[7].https://en.m.wikipedia.org/wiki/Headphones
[8].https://www.sciencedirect.com/topics/engineering/ultrasonic-sensor
[9].JAMPULA SAITEJA,2022,Ultrasonic Smart Goggles for Blind People,SATHYABAMA INSTITUTE OF
SCIENCE AND TECHNOLOGY
[10].Shantappa G. Gollagi1,2023,An innovative smart glasses for blind people using artificial
intelligence,Indonesian Journal of Electrical Engineering and Computer Science

[11].Esra Ali Hassan,2016,Smart Glasses for the Visually Impaired People,Universiti Teknologi
PETRONAS

[12].Swapna Choudhary,2023,IoT Based Smart Glasses with Facial Recognition for

People with Visual Impairments,SSRG International Journal of Electrical and Electronics Engineering

[13].R. SWEATHA and S. SATHIYA,2024 PRIYAYOLOv5 driven smart glasses for visually
impaired,International Journal of Science and Research Archive

[14].Shruti Jha, Nagendran Shetty Neel Shinde,2024,My Eyes- Smart Glasses for Blind
People,Electronics Department
Atharva College of Engineering

[15].Ananthi, M., Bharathi, R., Gayathri, M., Gokul, G., Sivakumar, M., & Vaishnavi, B. (2023). AI-
Powered Smart Glasses for the Blind and Visually Impaired. International Journal of Innovative
Technology and
Exploring Engineering, 12(9), 4007-4012.

2303 07451 PDF
No ratings yet
2303 07451 PDF
6 pages
Smart Glasses For The Blind
50% (2)
Smart Glasses For The Blind
20 pages
Intelligent Glasses For The Visually Impaired With Google Cloud API
No ratings yet
Intelligent Glasses For The Visually Impaired With Google Cloud API
4 pages
Bio (In Focus Year 12)
67% (3)
Bio (In Focus Year 12)
636 pages
Assistive Technology For Visually Impaired Using Tensor Flow Object Detection in Raspberry Pi and Coral USB Accelerator
No ratings yet
Assistive Technology For Visually Impaired Using Tensor Flow Object Detection in Raspberry Pi and Coral USB Accelerator
4 pages
Ijeee V10i9p114
No ratings yet
Ijeee V10i9p114
6 pages
IoT Based Powered Objects Recognition Glasses For Blind Persons
No ratings yet
IoT Based Powered Objects Recognition Glasses For Blind Persons
3 pages
Ijireeice 2023 11408
No ratings yet
Ijireeice 2023 11408
4 pages
Third Eye An Aid For Visually Impaired 1
No ratings yet
Third Eye An Aid For Visually Impaired 1
6 pages
Asep New Research Paper Copy-1
No ratings yet
Asep New Research Paper Copy-1
5 pages
Artificial Intelligence Based Optical Character Recognition Using Visual Impaired People Shopping Trolley Technology
No ratings yet
Artificial Intelligence Based Optical Character Recognition Using Visual Impaired People Shopping Trolley Technology
7 pages
IJCRT2207295
No ratings yet
IJCRT2207295
4 pages
Electronic Eye For Visually Challenged People
No ratings yet
Electronic Eye For Visually Challenged People
4 pages
Virtual Smart Glass For Blind Using Object Detection
No ratings yet
Virtual Smart Glass For Blind Using Object Detection
6 pages
Final
No ratings yet
Final
5 pages
A Novel Based Intelligent Spectacles For Visually Impaired
No ratings yet
A Novel Based Intelligent Spectacles For Visually Impaired
9 pages
Final Year Project Synopsis
No ratings yet
Final Year Project Synopsis
5 pages
Final PPT
No ratings yet
Final PPT
7 pages
Object Detection Research Paper
No ratings yet
Object Detection Research Paper
4 pages
Ai Glass 1
No ratings yet
Ai Glass 1
6 pages
Text Reader For Blind
No ratings yet
Text Reader For Blind
6 pages
Iot ML DL
No ratings yet
Iot ML DL
5 pages
2019-20 A CS6PO5 A1 CW Individual Work
No ratings yet
2019-20 A CS6PO5 A1 CW Individual Work
8 pages
Final Invision
No ratings yet
Final Invision
6 pages
Vision Maker: An Audio Visual and Navigation Aid For Visually Impaired Person
No ratings yet
Vision Maker: An Audio Visual and Navigation Aid For Visually Impaired Person
6 pages
IJRPR35279
No ratings yet
IJRPR35279
3 pages
Ijcrt July Student 2022
No ratings yet
Ijcrt July Student 2022
5 pages
AI Powered Glasses For Visually Impaired Person
No ratings yet
AI Powered Glasses For Visually Impaired Person
6 pages
Final Invision
No ratings yet
Final Invision
6 pages
Arkwright CGP-AI Presentation
No ratings yet
Arkwright CGP-AI Presentation
8 pages
AI Assisted Reading Glasses Report
No ratings yet
AI Assisted Reading Glasses Report
3 pages
Project Synopsis
No ratings yet
Project Synopsis
8 pages
Smartglasses For Visually Impaired
No ratings yet
Smartglasses For Visually Impaired
7 pages
Critique Paper 1
No ratings yet
Critique Paper 1
6 pages
Answers PDF
100% (1)
Answers PDF
138 pages
Chatbot Paper
No ratings yet
Chatbot Paper
10 pages
SKILL LAB Project
No ratings yet
SKILL LAB Project
9 pages
Lastبحث عن دراسات سابقة لمشروع التخرج نهائي
No ratings yet
Lastبحث عن دراسات سابقة لمشروع التخرج نهائي
6 pages
Message
No ratings yet
Message
313 pages
Lab 2
No ratings yet
Lab 2
10 pages
Raspberry Pi Based Smart Reader For Visually Impaired People
50% (2)
Raspberry Pi Based Smart Reader For Visually Impaired People
12 pages
FINAL Cerd Smart Specs Final
No ratings yet
FINAL Cerd Smart Specs Final
11 pages
STD V Intl Syllabus 2024 25
No ratings yet
STD V Intl Syllabus 2024 25
10 pages
Institutional Theory Framework
No ratings yet
Institutional Theory Framework
9 pages
Project PPT of Low Cost Ventilation
No ratings yet
Project PPT of Low Cost Ventilation
16 pages
Edi 2
No ratings yet
Edi 2
17 pages
Smart Glasses For Visually Challenged Person
No ratings yet
Smart Glasses For Visually Challenged Person
16 pages
Smart Sight - EndSEMpptx
No ratings yet
Smart Sight - EndSEMpptx
15 pages
Vision 1
No ratings yet
Vision 1
25 pages
3rd Module
No ratings yet
3rd Module
5 pages
SPSS: A Tool For Survey Analysis: Alok Kumar PGDM 2 Year
No ratings yet
SPSS: A Tool For Survey Analysis: Alok Kumar PGDM 2 Year
3 pages
Smart Glass 4
No ratings yet
Smart Glass 4
34 pages
Skolapkar
No ratings yet
Skolapkar
20 pages
Report
No ratings yet
Report
32 pages
Report
No ratings yet
Report
25 pages
YOLOv5 Driven Smart Glasses For Visually Impaired
No ratings yet
YOLOv5 Driven Smart Glasses For Visually Impaired
22 pages
Thesis 10 12
No ratings yet
Thesis 10 12
26 pages
Assignment # 1,2 - HE
No ratings yet
Assignment # 1,2 - HE
8 pages
Project Group1
No ratings yet
Project Group1
38 pages
FPA-21 PG 70 ABV
No ratings yet
FPA-21 PG 70 ABV
1 page
Obstacle Detection and Distance Estimation For Visually Impaired People
No ratings yet
Obstacle Detection and Distance Estimation For Visually Impaired People
21 pages
Oscor Blue
No ratings yet
Oscor Blue
6 pages
Puthan Specs
No ratings yet
Puthan Specs
38 pages
A Report On Existing AI Work For Visually Impaired People: Ayesha Tariq
No ratings yet
A Report On Existing AI Work For Visually Impaired People: Ayesha Tariq
51 pages
Third Aparna
No ratings yet
Third Aparna
38 pages
Final Review
No ratings yet
Final Review
31 pages
Air Cadet Pumps Manual
No ratings yet
Air Cadet Pumps Manual
12 pages
Aparna - Mishra - Final 2
No ratings yet
Aparna - Mishra - Final 2
50 pages
Final Thesis-Esra
No ratings yet
Final Thesis-Esra
65 pages
Faktor Pengeboran Sumur Make Up
No ratings yet
Faktor Pengeboran Sumur Make Up
16 pages
1999-2000 SUSPENSION Front - Avalon, Camry, Camry Solara, Celica, Corolla, Echo, RAV4 & SiennaFront Suspension
75% (4)
1999-2000 SUSPENSION Front - Avalon, Camry, Camry Solara, Celica, Corolla, Echo, RAV4 & SiennaFront Suspension
22 pages
Definition, Health by WHO
No ratings yet
Definition, Health by WHO
13 pages
The Effect of Macrocelebrity and Microin Uencer Endorsements On Consumer-Brand Engagement in Instagram
No ratings yet
The Effect of Macrocelebrity and Microin Uencer Endorsements On Consumer-Brand Engagement in Instagram
21 pages
Preparation of Fermented Blue Crab With Rice and It'S Market Ability
No ratings yet
Preparation of Fermented Blue Crab With Rice and It'S Market Ability
6 pages
Class Interval
No ratings yet
Class Interval
1 page
HFSS-High Frequency Structure Simulator
No ratings yet
HFSS-High Frequency Structure Simulator
38 pages
Additive Properties
No ratings yet
Additive Properties
1 page
Zishan Z3 User Manual
No ratings yet
Zishan Z3 User Manual
3 pages
Lab 6 & 7
No ratings yet
Lab 6 & 7
11 pages
Yolo Tensorflow
No ratings yet
Yolo Tensorflow
13 pages
3RB30461XW1
No ratings yet
3RB30461XW1
7 pages
Joinon Electric Vehicle Charging Solutions
No ratings yet
Joinon Electric Vehicle Charging Solutions
31 pages
Ebook Monitoring Can Help Make Tailings Dams Safer
No ratings yet
Ebook Monitoring Can Help Make Tailings Dams Safer
17 pages
A+ Guide To Managing and Maintaining Your PC, 6e: Motherboards
100% (1)
A+ Guide To Managing and Maintaining Your PC, 6e: Motherboards
36 pages
by Lord Asa Briggs 2001
100% (2)
by Lord Asa Briggs 2001
430 pages
Aristotle - History of Animals
No ratings yet
Aristotle - History of Animals
183 pages
Detyre Kursi Rrjeta Telematike
No ratings yet
Detyre Kursi Rrjeta Telematike
19 pages
BS 230 Operation Manual V8 0 en (300 373)
No ratings yet
BS 230 Operation Manual V8 0 en (300 373)
74 pages
Research Name and Dat Name of The Algorithm
No ratings yet
Research Name and Dat Name of The Algorithm
1 page
A Wearable Visual Recognition System For The Blind With Auditory Feedback Using YOLOv8-1
No ratings yet
A Wearable Visual Recognition System For The Blind With Auditory Feedback Using YOLOv8-1
1 page
Hydratight Sweeny RSL
No ratings yet
Hydratight Sweeny RSL
1 page
Ariston Trainman63X
No ratings yet
Ariston Trainman63X
19 pages
Result Sem-6 UG 2025 Sixth 9 Subjects-1
No ratings yet
Result Sem-6 UG 2025 Sixth 9 Subjects-1
22 pages
Blind 1 8
No ratings yet
Blind 1 8
83 pages
555
No ratings yet
555
1 page

Chapter1 2

Uploaded by

Chapter1 2

Uploaded by

Chapter 1

Detection techniques encounter challenges in low-light conditions, managing overlapping or

1.3 Goals of the project

1.5 limitation of the project

1.6 components of the project

[2].WHO|Visual impairment and blindness. WHO, 7 April 1948.

[12].Swapna Choudhary,2023,IoT Based Smart Glasses with Facial Recognition for

You might also like