AI-Based Navigator For Hassle-Free Navigation For Visually Impaired
AI-Based Navigator For Hassle-Free Navigation For Visually Impaired
Supriya Patil
Student, Dept. of Computer Engineering,
Vivekanand Education Society’s Institute of Technology,
Chembur, India.
[email protected]
Abstract—There are about 285 million visually impaired people Although it can be very useful in outdoor settings, indoor
in the world. Though they manage to do their daily chores, with environments create different problems.
support but face many difficulties. The proposed solution
provides indoor and outdoor assistance to the visually impaired. In indoor environments, due to lack of environmental cues [eg.
Conventionally they generally use a cane for proper and hassle- landmarks] and randomly placed objects cane would be of little
free navigation. The system would aid them with a routine task use. Some people use accessible maps like tactile maps to help
like walking, aid while writing, going up and down the stairs, navigate indoors. Accessible maps help in improving and
searching for common objects like keys, brushes, chairs, etc. speeding up the process of building mental maps, using tactile
Voice-based aid would be provided to them through speakers and maps is difficult when the person is not familiar with raised
sensors. The ultrasonic sensor helps in recognizing objects within surfaces and tactile representations (e.g., lines, graphs, and
the vicinity which might not be captured by the cameras and Braille) which takes a lot of effort to learn [2].
produce an alerting sound for the user to avoid any sudden
bump. This system will be comparatively economical and Another popular method for navigation is the use of wearable
accessible. computer devices and mobile devices [2]. Since it does not
require the user to be familiar with their surroundings
Keywords- Convolutional Neural Networks (CNN), computer beforehand and is dynamic. The proposed work is based on this
vision, neural networks, Jetson Nano particular way of navigation. The three important features that
the proposed system would provide are (1) distance estimation,
I. INTRODUCTION (2) orientation, and (3) obstacles and hazards.
Considering that a significant portion of the global The proposed navigator intends to help in guiding visually
population has problems with the vision, navigating in the impaired people mainly in indoor environments.
indoor spaces still remains a challenge. While there are
attempts to design our indoor environments to the needs of II. LITERATURE SURVEY
visually impaired people, the cost and reach are some of the
limiting factors. There have been numerous studies conducted using
different sensors, algorithms, and models for assisting visually
Cane is one of the most used aids by sight impaired people impaired people in navigation. While some of them required
for navigation. With the help of the cane, a person can perceive processors with higher processing power others required a
the environment around them by hitting at different positions in good internet connection and their surroundings to be mapped
front of them. beforehand. The existing systems also lacked three features
essential in the mobility of a visually impaired person i.e.
distance estimation, orientation, and obstacles and hazards.
The navigation system [3] consists of 2 parts: a wearable path. The main drawback is that these tags need to be inserted
part and a schematic of the area where the navigation system beforehand and also it does not detect randomly placed objects.
works by guiding the user. The indoor space is divided into It does provide distance estimation but that is limited to the
cells with each cell containing IR transmitters. The location of objects with chips embedded in them [5].
these transmitters is mapped onto a matrix and using the IR
signals received, a processor calculates the position of the user. “Navigation System for Blind - Third Eye” is an AI-based
The drawback is that the area of navigation is limited to the system. It uses sonar sensors for estimating the distance, IR for
number of transmitters. Furthermore, if we intend to increase obstacle avoidance, and cameras for orientation. The model
the range then it would put a load on the processor as it would used for detection and classification falls short on accuracy and
be required to perform more calculations. Finally, it does not it can be improved using other models [7]. This system uses
help the user in avoiding obstacles or in distance estimation ultrasonic sensors, a microcontroller, and aids the user by
[3]. synthetic speech output. It uses vibrations to alert the person of
an obstacle in his/her immediate vicinity. Depending on the
The system “NAVI” assists the blind and visually impaired distance from the obstacle, the vibrations either increase or
by using sound commands. The system uses RGB sensors and decrease in intensity. This system does not help in orienting the
IR sensors to detect objects which can be seen as well as which user in their surroundings but is a good aid for obstacle
cannot be seen. It employs a pathfinding algorithm to help to avoidance [8].
assist in navigation. One of the pitfalls is that the system
expects a robust processor with high-performance capabilities Real-time Object detection [10] uses a YOLO model and
as well as it needs an array of sensors for operation. Although it phone camera/web camera to capture a video feed of the
succeeds in orientation and obstacle avoidance it fails in surroundings. The major drawback of this paper is that it
providing the distance estimation [4]. depends on the phone/ laptop for orienting the camera. The
paper relies on the user for positioning the camera towards
This system uses Radio-frequency identification (RFID) for where they are moving, this can be difficult under given
both indoor and outdoor navigation. It uses RFID tags circumstances.
embedded in objects, these tags store electronic information
and need not be in the line of sight of the user. The user is
given a cane with the receiver and by interacting with the
detectors on the floor, a braille message is generated which
would help in identifying their location or any obstacles in the
TABLE I. COMPARISON OF EXISTING AND PROPOSED NAVIGATION MODELS FOR PEOPLE WITH VISION IMPAIRMENT
Sensors IR & IR & RGB RFID Tag IR sensor, Stereoscopic Camera Cameras
Ultrasonic sensors reader sonar sonar system, and
receivers sensor, accelerometer Ultrasonic
and sensors
camera
IV. IMPLEMENTATION
Jetson Nano is the main processing part of the system.
The cameras and speakers are connected to the processor.
Using the high processing power of GPU, Jetson Nano can
run our model to detect and alert the user. The entire neural
network is written in Pytorch and executed in anaconda
environment. The model was trained on the COCO dataset
for object detection. COCO is a large-scale object detection,
Fig. 3 Object detection steps segmentation, and captioning dataset [9].
For the obstacle avoidance part of our system, an
c) Class probabilities Arduino Uno microcontroller was employed in the
Class probabilities for each grid cell are calculated. This system. It is the microcontroller board that is equipped with
value along with the confidence value of bounding boxes a digital and analog input/output pin. It can be programmed
from earlier is then multiplied. The bounding box has a low in Arduino software. The Arduino is used with an ultrasonic
value and is then removed from the image. We end up with sensor. The Ultrasonic sensor emits ultrasound at 40000Hz
the class probability map as shown in Fig. 4 [8]. The objects which travels through the air and if there is an object its path
detected are then sent to the speech module, where it is will bounce back to the module. The Arduino is
relayed to the user using headphones. programmed to beep when the object comes in the preset
range of ultrasonic sensor which is done using an
2) Obstacle detection using ultrasonic sensor Arduino. AI-based Navigator for hassle-free navigation for
The ultrasonic sensor consists of two parts - a transmitter the visually impaired is developed and integrated as
and a receiver. The transmitter continuously transmits sound mentioned in the design. The system has 4 separate
waves. When an object is in the path of a sound wave, part of components: Jetson nano, camera, Arduino uno, and
it is transmitted back. The microcontroller keeps track of the ultrasonic sensor.
transmitted time and the received time. The time difference
is then used to calculate the distance between the object and V. RESULTS
the sensor. The microcontroller then runs this value into a
program, if the value is found to be less than the threshold AI-based Navigator for hassle-free navigation for the
value, it sends a signal to the buzzer. The buzzer then alerts visually impaired is developed and integrated as mentioned
the user of the presence of an object. in the design. The system has 4 separate components: jetson
nano, camera, Arduino uno, and ultrasonic sensor.
C. YOLO object detection model Figure 5 and 6 shows the output from the object detection
YOLO is a real-time object detection CNN model. It is model using YOLO.
the most effective way to detect objects and it does it in a Following tasks were performed.
single neural network. It is capable of achieving high
accuracy while running in real-time, it can process at a rate • Detecting the distance of the object.
of 45 frames per second.
• Detecting the type of the object.
The neural network has 24 fully connected layers
VI. CONCLUSION
The proposed system can detect objects and guide the
user in indoor environments. The main aim was to assist a
Fig. 5 System detecting multiple objects on table visually impaired person in navigation without any costly
infrastructure or equipment. The proposed system can
The ultrasonic sensor and Arduino were used for
achieve its objectives (orientation, distance estimation,
measuring the distance between the user and the objects. obstacle avoidance) essential for helping them to navigate.
The limitations in the system are the memory and processing
power of the processor. Even though Jetson Nano has high
processing capabilities it still is not enough to run big neural
networks. The number of classes of objects and the speed of
the system could have been better if a large neural network
was trained on a bigger dataset. The larger neural network
would require a more powerful processor, which would
increase the overall cost of the system thereby making it
difficult to distribute among the common populace. Even
with the existing neural network, the system does well in
assisting them for basic navigation around the indoor spaces.
REFERENCES