PPPPR 3
PPPPR 3
on
OBJECT DETECTION FOR BLIND
PERSON
Submitted in partial fulfillment for award of
BACHELOR OF TECHNOLOGY
Degree
In
COMPUTER SCIENCE & ENGINEERING
2022-2023
Under the Guidance of: Submitted By:
Mr.Sanjay Srivastava Sakshi khandelwal(1900330100189)
Assistant Professor Yash Varshney (1900330100254)
Pradumn Singh ( 1900330100161)
Sadik Ali (2000330109009)
5. Details of Students:
S.No. Univ. Roll No. Name Role Signature
6. Project Supervisor:
Mr.Sanjay Srivastava
v
SYNOPSIS
Object detection is a well-known computer technology connected with computer vision and image
processing.With the advent of deep learning techniques, the accuracy for object detection has increased
drastically.It focuses on detecting objects or its instances of a certain class (such as humans, flowers,
animals) in digital images and videos.
Many problems in computer vision were saturating on their accuracy before a decade. However, with
the rise of deep learning techniques, the accuracy of these problems drastically improved. One of the
major problem was that of image classi cation, which is de ned as predicting the class of the image. A
slightly complicated problem is that of image localiza-tion, where the image contains a single object and
the system should predict the class of the location of the object in the image. The more complicated
problem (this project), of object detection involves both classi cation and localization.
An accurate and efficient object detection system has been developed which achieves comparable
metrics with the existing state-of-the-art system. This project uses recent techniques in the eld of
computer vision and deep learning. Custom dataset was created using labelling and the evaluation was
consistent. This can be used in real-time applications which require object detection for preprocessing
in their pipeline. An important scope would be to train the system on a video sequence for usage in
tracking applications. Addition of a temporally consistent network would enable smooth detection and
more optimal than per-frame detection.
The goal of the this project is to model an object detector to detect objects for visually impaired people
and other commercial purposes by recognizing the objects at a particular distance. Available old
techniques for object detection needed large training data it takes more time and its quite complicated
and it’s a difficult task. With the help of object detection system and navigation system the blind person
can easily detect the object through cameras or through some sensors.
Object detection algorithms typically leverage machine learning to produce meaningful results. When
humans look at images or video, we can recognize and locate objects of interest within a matter of
moments. Getting ito iuse imodern iobject idetection imethods iin iapplications iand isystems, ias iwell
ias ibuilding inew iapplications ibased ion ithese imethods iis inot ia istraight iforward itask.
vi
This iproject itook iuse iof iseveral isoftware ilibraries, ipackages iand iprograms ito iutilize imachine
ilearning. iPython iwas ithe ichoice iof iprogramming ilanguage, iand iTensorFlow iwas iused ifor ithe
ideep ilearning icomputations, iwhich iin iturn ihas ia ilist iof idependencies. iTensorFlow ioffers ia
iversion ifor iCPU iusage iand ianother ifor iGPU, ithis iproject iused ithe iGPU iversion. i
Computer vision is still a developing discipline, it has not been matured to that level where it can be
applied directly to real life problems. After few years computer vision and particularly the object
detection would not be any more futuristic and will be ubiquitous. For now, we can consider object
detection as a sub-branch of machine learning.
vii
TABLE OF CONTENTS
SYNOPSIS ii
LIST OF TABLES vii
LIST OF FIGURES viii
1. INTRODUCTION 1
1.1 Problem Statement 1
1.2 Motivation 2
1.3 Objective 2
1.4 Scope 3
1.5 Tools
1.5.1 Visual Studio
1.5.2 MySQL 4
1.6 Background and related work
2. HARDWARE AND SOFTWARE REQUIREMENTS 8
2.1 Hardware Requirements 9
2.2 Software Requirements 9
3. SDLC METHODOLOGIES 10
3.1 Different SDLC Methodologies Used 11
3.2 How It Affects Our Model 13
4. SOFTWARE REQUIREMENT SPECIFICATION AND 14
ANALYSIS
4.1 Introduction 15
4.1.1 Functional Requirements 17
4.1.2 Non- Functional Requirements 18
19
4.1.3 Interface Requirements
4.1.4 Performance Requirements
4.2 Syste
4.3 Overall Description
4.3.1 Product Perspective
5. RISK ASSESSMENT 22
5.1 Outliers 22
5.2 Feature Selection 23
5.3 Risk Assessment Service 25
vii
i
6. Project 26
6.1 ER Diagram 26
7. PROJECT MODULES DESIGN/SOFTWARE FEATURES 27
7.1 Software Features 27
8. TESTING AND EVALUATION 30
9. PROJECT SNAPSHOTS 36
10. LIMITATION 38
11. FUTURE SCOPE 39
12. CONCLUSION 40
13. REFERENCES 42
14 RESEARCH PAPER
ix
LIST OF TABLES
Chapter Table No. Title Page No.
i
LIST OF FIGURES
One of the most difficult problems in image processing is detecting particular objects in an image.
For a human observer it is very easy to identify any object, however it is far more difficult for a
machine. Numerous methods exist for detecting objects of known type in a particular environment
or image.
Videos are the sequences of frames that run fast enough to give an effect of continuity as human
eye perceives the frame sequences moving with a particular speed as a video. However, in many
cases, the visual characteristics of the objects are unknown, or it is necessary to detect objects that
are very different from each other.
As far as object detection is concerned techniques of image processing are applied to the frames
in order to identify any change so as to state the motion of object detected through the change
observed in the two consecutive frames after attaining the results of the image processing
techniques used for identification purpose.
The main goal of object detection is to scan digital images or real-life scenarios to locate instances
of every object, separate them, and analyze their necessary features for real-time predictions.
Object detecting and tracking has a wide variety of applications in computer vision such as video
compression, video surveillance, vision-based control, human-computer interfaces, medical
imaging, augmented reality, military applications, traffic monitoring and robotics.
A surveillance system can be implemented in three steps. The first step consists of detecting the
objects in motion .Then tracking them and finally High-level interpretation of the ongoing events.
With the help of object detection system and navigation system the blind person can easily detect
the object through cameras or through some sensors. For example, it is very difficult for to find a
particular room in an unfamiliar environment. The basic procedure is to combine data-driven
knowledge in the likelihood terms with clinical knowledge in the prior terms to classify an image
into a pre-determined number of classes.
1.1 Problem Statement
The basic concept behind object detection in videos engrosses the verification of the presence of
an object in image sequences and possibly locating it in particular for recognition. This is done by
solving the temporal correspondence problem, the problem of matching the target region in
successive frames of a sequence of images taken at closely spaced time intervals. The general
problem of motion understanding and tracking of the moving objects is one of the most
interestingly used areas of computer vision
1.2 Motivation
The motive of object detection is to recognize and locate all known objects in a scene. Preferably
in 3D space, recovering pose of objects in 3D is very important for robotic control systems.
1.3 Objective
Moving object detection is important in many real-time image processing applications such as
autonomous robotics, traffic control, and driver assistance and surveillance systems. Usually high
resolution gray-scale images must be processed; since each image pixel may belong to a moving
object, pixel-wise processing is required. Now days, video surveillance is an important and
challenging field in computer vision for both indoor and outdoor environments .The project
Moving object tracking from video sequences is an attempt to study some algorithms, which are
robust for the tracking of mobile but non rigid objects from the image sequences precisely called
video.
The goal of the this project is to model an object detector to detect objects for visually impaired
people and other commercial purposes by recognizing the objects at a particular distance.
Available old techniques for object detection needed large training data it takes more time and its
quite complicated and it’s a difficult task. With the help of object detection system and navigation
system the blind person can easily detect the object through cameras or through some sensors.
It also includes pre and post processing sequences as well which includes noise removal, image
enhancement issues, organization and classification etc. but these issues are not of primary
importance as far as this project is concerned so are not addressed in detail in this project.
1.4 Scope
Object detection is a computer vision technique that allows us to identify and locate objects in an
image or video. With this kind of identification and localization, object detection can be used to
count objects in a scene and determine and track their precise locations, all while accurately
labeling them.The main scope of object detection is to scan digital images or real-life scenarios to
locate instances of every object, separate them, and analyze their necessary features for real-time
predictions. Object detection is a part of the overall data architecture of a company.
1.5 Tools
1.5.1 Visual Studio
Visual Studio is anintegrated development environment(IDE) fromMicrosoft. It is used to
developcomputer programsincludingwebsites,web apps,web servicesandmobile apps. Visual
Studio uses Microsoft software development platforms such asWindows API,Windows
Forms,Windows Presentation
Foundation,Windows StoreandMicrosoft Silverlight. It can produce bothnative codeandmanaged
code. Visual Studio supports 36 differentprogramming languagesand allows the code editor and
debugger to support nearly any programming language, provided a language-specific service
exists.
1.5.2 MySQL
MySQL is an open source relational database management system. For WordPress sites, that
means it helps you store all your blog posts, users, plugin information, etc. It stores that
information in separate “tables” and connects it with “keys”. MYSQL used for the application
database. Insertions, Deletions and changes of the data in the system can do straightforward via
the designed GUI without interacting with the table.
5 Oject detection for Rajeshvaree 2021 Object detection for blind person
blind person with Ravindra Karmarkar with voice guidance account but the
voice guidance whole image is divided into grids.
6 Blind Person Aniket Birambole , 2022 Blind person assistant using SSD
Assistant: Object Pooja algorithm and tensorflow APIs. Very
Detection. Bhagat , Bhavesh first step of image processing is
Mhatre , Prof. Aarti readthe training as well as testing
Abhyankar images
M Y Babu et. al [1] Suggest a prototype system of camera-based assistive object reading. This
system consists of three functional components, screen capture, image processing, audio output.
The screen capture component collects the objects of interest in the form of image or video.The
Data processing system is used for our proposed algorithms, including Region-of-interest to
collects the objects from the cluttered backgrounds or from other surroundings, image localization
to attain the object regions to containing grey scale and object recognition to transform the image-
based readable codes. The audio output component is to inform the visually impaired persons of
recognised text codes.The 3.5 mm audio jack is interfaced with the board which receives as a
speech.
S Shaikh et al. [2] Suggest two cameras placed on blind person's glasses, GPS free service, and
ultrasonic sensors are employed to provide information about the surrounding environment.
Object detection is used to find objects in the real world such as faces, bicycles, chairs, doors, or
tables that are common in the scenes of a blind. Here, GPS service is used to create groups of
objects based on their locations, and the sensor detects an obstacle at a medium to long distance.
The descriptor of the Speeded-Up Robust Features (SURF) method is optimized to perform the
recognition. The use of two cameras on glasses can be sophisticated.
H. Jabnoun et al [3] Suggest that visual substitution devices accept input from the user’s
surroundings, decipher it to extract information about entities in the user’s environment, and then
transmit that information to the subject via auditory or tactile means or some combination of these
two They are based on rules for converting images into data sensory substitution tactile or auditory
stimuli. These systems are efficient for mobility and localization of objects which is sometimes
with a lower precision.
M. Rahul et al [4] Suggest a smart vision whose objective is to move anywhere in the environment
through a user-friendly interface system. his project mainly focusses on computer vision module.
In these authors made a system which will able to find the obstacle which are near to his head
specially while entering from door.in short it is made to protected his head from getting injury.
This product is design to navigate blind person in any environment and it guides the user about
that object and provide information about that obstacle using buzzer and vibrater as a two-output
mode of the user.
R.R. Karmarkar et al [5] Suggest about mask region-based convolutional network (Mask-RCNN)
is used for utilization for multi-class object detection for sensing the images. Transfer learning,
fine tuning and data augmentation are used to overcome object scale variability, the density of
object. Also, adaptive Mask-RCNN was compared to deep object detection methods. MaskRCNN
is an extended version to faster RCNN that allow an accurate pixel-based segmentation it consist
of two stages feature pyramid network (FPN) and region proposal network (RPN)
A. Birambole et al [6] Suggest attempts about object detection from the image and represented it
by their name and speech. And it also helps the blind people in location and encoded the audio
into 2 channel audios with the help of 3D binaural sound. In these a video is capture with
portable camera device from client side and it is streamed to a server for real time image
recognition with object detection. which mean it identify and follow the same object in
sequence of video frames sometimes video may have some noise. to remove that noise from
frames noise reduction technique is used that improve the image quality and extraction of
object frame is used to detected the object based on color of the moving frame.
CHAPTER 2
HARDWARE AND SOFTWARE REQUIREMENTS
All computer softwares requires the presence of specific hardware parts or additional software
resources. These prerequisites, also referred to as (computer) system requirements, are frequently
employed as a general norm rather than an unbreakable law. The majority of software specifies a
minimum and recommended set of system requirements. System requirements tend to rise over
time due to the rising demand for more processing power and resources in newer versions of
software.
• Pentium i3 Processor
• Hard Disk : 500 GB
• Ultrasonic sensor
• GSM
SDLC METHODOLOGIES
A software development life cycle model (also termed process model) is a pictorial and
diagrammatic representation of the software life cycle. A life cycle model represents all the
methods required to make a software product transit through its life cycle stages. It also
captures the structure in which these methods are to be undertaken. In other words, a life cycle
model maps the various activities performed on a software product from its inception to
retirement. Different life cycle models may plan the necessary development activities to phases
in different ways. Thus, no element which life cycle model is followed, the essential activities
are contained in all life cycle models though the action may be carried out indistinct orders in
different life cycle models. During any life cycle stage, more than one activity may also be
carried out.
The waterfall model is the oldest of all SDLC methodologies. It’s linear and straightforward
and requires development teams to finish one phase of the project completely before moving
on to the next. Each stage has a separate project plan and takes information from the previous
stage to avoid similar issues (if encountered). However, it is vulnerable to early delays and lead
to big problems arising for development teams later down the road.
• Data Modeling
• Process Modelling
• Application Generation
In this type of SDLC model testing and the development, the step is planned in parallel. So, there
are verification phases on the side and the validation phase on the other side. V-Model joins by
Coding phase.
Figure 3.4. V-Model4
For Our model, Iterative process starts with a simple implementation of a subset of the software
requirements and iteratively enhances the evolving versions until the full system is implemented.
At each iteration, design modifications are made and new functional capabilities are added. The
basic idea behind this method is to develop a system through repeated cycles (iterative) and in
smaller portions at a time (incremental).The following illustration is a representation of the
Iterative and Incremental model –
Deep learning-based detectors usually produce a redundant set of object bounding boxes
including many duplicate detections of the same object. These boxes are then filtered using
non-maximum suppression (NMS) in order to select exactly one bounding box per object of
interest. This greedy scheme is simple and provides sufficient accuracy for isolated objects but
often fails in crowded environments, since one needs to both preserve boxes for different
objects and suppress duplicate detections. In this work we develop an alternative iterative
scheme, where a new subset of objects is detected at each iteration. Detected boxes from the
previous iterations are passed to the network at the following iterations to ensure that the same
object would not be detected twice. This iterative scheme can be applied to both one-stage and
two-stage object detectors with just minor modifications of the training and inference
procedures.
CHAPTER 4
SOFTWARE REQUIREMENT SPECIFICATION AND
ANALYSIS
4.1 Introduction
The objective of this software requirement specification and analysis is to provide a detailed
description of the requirements and specifications of an object detection system for blind
persons. The system is designed to assist the blind by detecting objects in the user's
surroundings using computer vision and image processing techniques. The system will also
provide the user with audio feedback on the detected objects.The requirements were
Functional requirements
Non-functional requirements
Interface requirements
Performance requirements
Non-functional requirements are the requirements which define how the system should
Performance: The system should be able to detect objects accurately and quickly.
Accessibility: The system should be easily accessible and usable by the blind.
Reliability: The system should be reliable and provide consistent results.
Security: The system should be secure and protect the user's privacy.
Scalability: The system should be scalable to handle a large number of users.
The information provided by an object detection system can be conveyed to the user through
various interfaces, such as audio and visual feedback. Audio interfaces might use text-to-speech
engines to describe objects or a series of beeps or tones to indicate an object's location and distance.
Visual interfaces might use a smartphone camera to capture an image of the object and then use
image recognition software to describe its properties.Object detection systems can be helpful for
blind individuals in a variety of settings, such as navigating through unfamiliar environments,
identifying items on store shelves, or recognizing people's faces. These systems can help improve
the independence and quality of life of visually impaired people by providing them with more
information about their surroundings.
There are several system interfaces for object detection that can be helpful for blind individuals.
Here are some options:
Visual interfaces: Visual interfaces use graphics or other visual cues to convey
information about the objects around a person. For example, a system could use a
smartphone camera to capture an image of the object and then use image recognition
software to describe the object's properties.
Wearable interfaces: Wearable interfaces are devices that are worn on the body, such as
smart glasses or a smartwatch. These devices can use a combination of audio, haptic, and
visual feedback to provide information about the objects around a person.
Overall, the choice of interface will depend on the individual's needs and preferences. Some people
may prefer an audio-based interface, while others may prefer a haptic or visual interface. It's
important to consider factors such as ease of use, portability, and cost when choosing an object
detection system for a blind person.
CHAPTER-5
RISK ASSESSMENT
Risk analysis in software testing is an approach to software testing where software risk is analysed
and measured. Traditional software testing normally looks at relatively straight-forward function
testing (e.g. 2 + 2 = 4). A software risk analysis looks at code violations that present a threat to the
stability, security, or performance of the code.
Software risk is measured during testing by using code analyzers that can assess the code for both
risks within the code itself and between units that must interact inside the application. The greatest
software risk presents itself in these interactions. Complex applications using multiple
frameworks and languages can present flaws that are extremely difficult to find and tend to cause
the largest software disruptions.
5.1 Outliers
In the context of object detection for blind individuals, risk outliers refer to unexpected events or
situations that may occur when using the technology that could potentially cause harm or negative
consequences. Here are some possible risk outliers for object detection technology for blind
individuals:
Environmental factors: Object detection technology may not work well in certain
environments, such as low light conditions or areas with many obstacles. In these
situations, users may rely too heavily on the technology and be at risk of injury or accident.
Device malfunctions: Like any technology, object detection devices may malfunction or
fail, leading to incorrect information or no information being provided to the user. This
could potentially put the user at risk if they are relying on the technology to navigate or
identify objects.
Cybersecurity threats: Object detection technology that connects to the internet or other
networks may be vulnerable to cyber attacks or hacking attempts, which could compromise
user data or put the user at risk in other ways.
Legal and regulatory risks: The use of object detection technology may be subject to legal
and regulatory restrictions or requirements, which could pose risks to the user or the
developer if not properly addressed.
To address these risk outliers, developers of object detection technology for blind individuals
should take a proactive approach to risk management. This may include implementing strong
cybersecurity measures, conducting thorough testing and quality control, staying up-to-date with
legal and regulatory requirements, and engaging with diverse stakeholders to address social and
cultural concerns..
When selecting features for object detection technology for blind individuals, it is important to
consider the specific needs and preferences of the target audience. Here are some possible features
to consider for object detection technology for blind individuals:
Overall, feature selection for object detection technology for blind individuals should be guided
by the principles of usability, accessibility, and user-centered design..
A risk assessment service for object detection for blind persons would be a valuable tool to help
improve their mobility and safety. The service could be designed to use computer vision algorithms
to analyze video or images from a camera mounted on a user's cane or worn on their person. The
algorithms would identify and classify obstacles, such as curbs, stairs, or other potential hazards,
and then provide the user with audio or haptic feedback to warn them of the obstacle's presence
and location.
To develop such a service, a team of engineers, computer scientists, and accessibility experts
would need to work together. They would need to create and train a machine learning model to
recognize common obstacles that a blind person might encounter, as well as develop the software
and hardware necessary to provide the user with feedback.The service would also need to be tested
extensively to ensure it is accurate, reliable, and easy to use. It would also be important to consider
the privacy and security implications of such a service, to protect the user's data and prevent
unauthorized access to the device.
In summary, a risk assessment service for object detection for blind persons has the potential to
significantly improve the mobility and safety of blind individuals. However, developing such a
service would require a multidisciplinary team, careful testing, and consideration of privacy and
security concern
CHAPTER 6
APPLICATION ARCHITECTURE
6.1 E-R Diagram:
ER Diagram stands for Entity Relationship Diagram, also known as ERD is a diagram that displays
the relationship of entity sets stored in a database. In other words, ER diagrams help to explain the
logical structure of databases. ER diagrams are created based on three basic concepts: entities,
attributes and relationships.
The software features for an object detection system designed for blind individuals could include:
● Real-time object detection: The system should be able to detect objects in real-time and provide
feedback to the user about the objects present in their surroundings.
● Accurate object recognition: The system should be able to recognize different types of objects
accurately, including people, animals, vehicles, and other obstacles.
● Audio feedback: The system should provide audio feedback to the user to inform them about
the objects present in their surroundings. The audio feedback could include information about
the size, shape, and location of the object.
● Voice commands: The system should be able to take voice commands from the user, allowing
them to ask questions about the objects in their surroundings.
● Customizable settings: The system should have customizable settings, allowing the user to
adjust the sensitivity and range of the object detection system to suit their preferences and
needs.
● Multiple input options: The system should have multiple input options, allowing the user to
interact with the system through touch, voice commands, or other inputs.
● Data privacy: The system should prioritize data privacy and ensure that the user's personal
information is protected.
● User-friendly interface: The system should have a user-friendly interface that is easy to use
and understand, even for those who are not tech-savvy.
CHAPTER 8
TESTING AND EVALUATION
Object detection for blind people is an important and challenging area of research, and testing and
evaluation play a crucial role in ensuring that these systems are accurate and effective.
There are several approaches to evaluating object detection systems for blind people, including:
User Studies: Another approach to evaluating object detection systems for blind people is to conduct
user studies. These studies involve asking blind people to use the system and provide feedback on
its effectiveness, ease of use, and other important factors.
Real-world Testing: Testing object detection systems in real-world environments is also important.
This can involve testing the system in different lighting conditions, with different types of objects,
and in various locations.
Comparison with Existing Systems: Another important approach to evaluating object detection
systems for blind people is to compare them with existing systems. This can help to identify
strengths and weaknesses and highlight areas for improvement.
In addition to these approaches, it is also important to involve blind people in the development and
testing of these systems. By involving blind people in the process, developers can gain valuable
insights into how these systems can be improved to better meet the needs of their intended users.
CHAPTER 9
PROJECT SNAPSHOTS
Object detection is a promising technology that has the potential to enhance the independence of blind
persons. However, there are several limitations of object detection technology that should be considered
when developing a project for blind persons. Here are some of the most important limitations:
Accuracy: Object detection algorithms are not always accurate in identifying objects, particularly
when objects are partially obscured or located in complex environments. Inaccuracies can lead to
false positives or false negatives, which can create confusion or danger for blind persons.
Cost: The cost of object detection technology can be high, making it difficult for many blind
persons to afford. Additionally, some object detection systems require specialized hardware, such
as cameras or sensors, which can further increase costs.
Accessibility: Object detection technology may not be accessible to all blind persons, particularly
those who are not proficient with technology. It is important to ensure that any project utilizing
object detection technology is designed with accessibility in mind.
Dependence: Blind persons may become overly dependent on object detection technology, which
can limit their ability to develop other skills and strategies for navigating the world. For example,
if they become reliant on object detection systems for navigation, they may not develop other
orientation and mobility skills.
Privacy: Object detection technology can raise privacy concerns, particularly if cameras or sensors
are used to collect data about a person's movements or activities. It is important to ensure that any
project utilizing object detection technology is designed with privacy in mind.
Cultural Differences: It is important to consider cultural differences in the use of object detection
technology. Some cultures may view the use of cameras or sensors as intrusive, which can limit
the effectiveness of object detection technology in those contexts.
Training Data Bias: Object detection algorithms rely on large amounts of training data to identify
and classify objects accurately. However, if the training data is biased towards certain types of
objects or certain environments, the object detection system may perform poorly in other
environments or with other objects.
Overall, while object detection technology can be a valuable tool for enhancing the independence of blind
persons, it is important to consider these limitations when designing a project. By taking these limitations
into account, developers can create projects that are more effective, accessible, and respectful of the
privacy and cultural sensitivities of blind persons
CHAPTER 11
FUTURE SCOPE
The future scope of object detection technology for blind persons is promising, as there is a lot of potential
for further development and improvement. Here are some of the potential future directions for object
detection technology for blind persons:
Improved Accuracy: Researchers are constantly working on improving the accuracy of object
detection algorithms, particularly in challenging environments. This could help reduce false
positives and false negatives and make object detection technology more reliable.
More Affordable Technology: As the technology advances and becomes more mainstream, it is
possible that the cost of object detection technology will decrease, making it more accessible to
blind persons.
Improved Accessibility: Developers are working on improving the accessibility of object detection
technology for blind persons, including through the use of audio and haptic feedback.
Integration with Other Technologies: Object detection technology could be integrated with other
technologies, such as artificial intelligence, to provide more advanced features and capabilities.
Increased Privacy Protections: As concerns around privacy continue to grow, developers are
working on improving the privacy protections of object detection technology, such as by
minimizing the collection and storage of personal data.
Adoption by Industry: Object detection technology is being adopted by industries such as retail
and transportation to improve accessibility for customers and passengers. As these industries
continue to embrace the technology, there may be further opportunities for the technology to be
utilized for blind persons.
Overall, the future scope of object detection technology for blind persons is bright. With continued
research and development, it is possible that object detection technology could become an even
more effective and accessible tool for enhancing the independence of blind persons.
Conclusion
In conclusion, object detection technology holds great promise for enhancing the independence of blind
persons. However, as with any technology, there are limitations that must be considered when developing
projects for blind persons. These limitations include accuracy, cost, accessibility, dependence, privacy,
cultural differences, and training data bias. Despite these limitations, the future scope of object detection
technology for blind persons is promising. Researchers are working to improve accuracy and accessibility,
integrate the technology with other systems, and increase privacy protections. As the technology continues
to develop and become more widely adopted, it has the potential to revolutionize the way that blind
persons navigate and interact with the world around them. Ultimately, it is important to continue to
develop and refine object detection technology in a way that is respectful of the unique needs and
perspectives of blind persons.
REFERENCES
[1] Dr. M Y Babu, Akash Jatavath “Object Detection system with voice alert” ,International
research journal of engineering Vol 9 S(1), DOI: 10.17485/ijst/2016/v9iS(1)/108446
ref.December 2021.
[2] Shifa Shaikh, Vrushali Karale, Gaurav “Real time object detection for blind person” in Issue
09,September-2022Vision,,Vol. 9905, pp. 21-37, Sept. 2018
[3] Hanen Jabnoun, Faouzi Benzarti, Hamid Amiri “Object Detection and identification for blind
people” in vedio scene,international conference of computational intelligence,pp 27-34.
[4] Mayuresh Banne , Rahul Vhatkar, Ruchita Tatkare “Object Detection and Translation for
Blind People Using Deep Learning”, international Research Journal of Engineering and
Technology .(IRJET) e-ISSN: 2395-0056
[5] Rajeshwari Ravindra Karmarkar “Object detection system for blind person with voice
guidance” International Journal of Engineering Applied Sciences and Technology, 2021 Vol.
6, Issue 2, ISSN No. (2455-2143).
[6] Aniket Birambole, Pooja Bhagat, Bhavesh Mhatre, Prof. Aarti Abhyankar “Indian Journal of
Marine science” 38, (2009):
[7] Nada Alzahrani, Heyam-H Al Baity “ Object Recognition for the visually Impaired” ref fig
Information Technology Department, Collage of Computer and Information Sciences, King Saud
University, Riyadh P.O. Box 145111, Saudi Arabia.
[8] Manjula S. , Laxmi Krishnamurthy “ Study on Object detection” December 2016 International
journal of pharmacy and technology ISSN: 0975-766X.
[9] Tausif dewaan, Anirudh, Jitendra V. Tembhurne “ Object detection using YOLO ”
Department of Computer Science & Engineering, Indian Institute of Information Technology,
Nagpur, India. 2017.
[10] Udulie H.D.I., Lakshini Kuganandarmurthy “ Real time Object detection Using YOLO”
Approx Algorithms Metaheuristics, pp. 1-1432, 2007
[11] Daniel D, Rekha M S “ Facial Emotion and Object Detection for visually impaired and
Blind person” International Journal of engineering research and technology Volume 10, Issue 09(
September 2021) .
OBJECT DETECTION FOR BLIND
PERSON USING ML
Sanjay Srivastav sir 1, Sakshi Khandelwal 2, Pradumn Singh3, Sadik Ali4, Yash Vashney5
Proposed system on Object Detection for visually Word Recognition: Voice recognition software
impaired System by S. Ketkale, M. Mujawar, and (also known as speech to textual content
S. Londhe suggest the proposal in this study software) allows users to use their voice instead
focuses on providing information on the various of typing on a computer. Voice recognition can
types of barriers in front of the person, their be used to dictate text into a computer or to issue
length, and their distance from the person. The commands to the computer. Voice recognition
camcorder is used to capture video. Following software enables a quick method of writing on a
that, video processing methods are used.. computer. It is also useful for people with
disabilities who find it difficult to operate a
3. Proposed Method keyboard.
REFERENCES