0% found this document useful (0 votes)

66 views11 pages

Android Application For Real Time Object Detection Using Deep Learning

The document proposes developing an Android application to help visually impaired users navigate the outdoors. It would use object detection on video from the mobile camera to identify surrounding objects and describe them verbally using speech synthesis. The proposed application aims to be faster and more lightweight than existing solutions. Over the next semester, the team plans to complete training an object detection model, integrate speech synthesis, and deploy the application to the cloud for testing. The goal is to provide a simple yet interactive tool to verbally guide blind users in real-time.

Uploaded by

Rohan Furde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views11 pages

Android Application For Real Time Object Detection Using Deep Learning

Uploaded by

Rohan Furde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

A

PROJECT DESIGN REPORT

ON
Android application for real time object detection
using deep learning
For the subject Lab1 Project Phase 1
Submitted in partial fulfillment of the requirement for the award of
Bachelor of Engineering
In
Computer Science and Engineering
Punyashlok Ahilyadevi Holkar Solapur University
By
Name Roll. No. Exam Seat No.
Mr. Tushar Hulle 24 1823590
Mr. Prasanna Khadake 25 1724433
Mr. Faizan Makandar 26 1724661
Mr. Shahid Shaikh 27 1724491
Mr. Sourabh Nare 70 1525348

Under Guidance Of
Prof. V.D.Chavan
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
WALCHAND INSTITUE OF TECHNOLOGY
SOLAPUR - 413006
(2020-21)

CERTIFICATE
This is to certify that the Project entitled

Android application for real time object detection using deep

learning
is
Submitted by

Name Roll. No. Exam Seat No.

Mr. Tushar Hulle 24 1823590
Mr. Prasanna Khadake 25 1724433
Mr. Faizan Makandar 26 1724661
Mr. Shahid Shaikh 27 1724491
Mr. Sourabh Nare 70 1525348

as a part of Project Design Report.

Studying in BE CSE for the subject Lab1 Project Phase 1

(Prof. V.D.Chavan) (Prof. A.R.Kulkarni)

Project Guide Head
Dept of Computer Science & Engg

(Dr. S .A. Halkude)

Principal
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
WALCHAND INSTITUE OF TECHNOLOGY
SOLAPUR
(2020-21)

INDEX
Sr No. Topic Page No.
1 Abstract 4
2 Introduction 5
3 Background 6
4 Technology Required 7
5 Objectives 7
6 Proposed work 8
7 UML Diagrams 9
8 Work planned for next semester 10
9 Conclusion/Summary 10
10 References 11
Abstract

As there is a growing culture of technology in the world. Our project is to

develop an android application for the blind peoples. Our application will guide
and notify them about the things around them. When user go outside for his/her
purpose he can use our app. When user starts our app it will starts the camera of
mobile and start capturing video. Then this video will send to the server. Server
will convert that video frames into an textual data by using object detection
algorithm. And then from textual to audio. Then the audio data will send back to
the user. Then user can able to recognize the things nearer to them. This app is
very useful for them when they walk on the road. There is one app from Google
named as Lookout . But this app will support only on the Google pixel ,Lg and
some Samsung devices. We are trying to develop the android app for all devices
which work faster and take less memory of the mobile devices. We are trying to
make the server online because then app will run on the less ram devices as well.
Introduction

We are focusing on the need of visually impaired users of mobile devices.

The rapid development of information technology has made access to mobile
applications for visually impaired people became difficult. Development of
mobile applications, especially designing the user interface accessible for a blind
person, is a very serious issue and it has become one of the new field of research.
Most of mobile applications available in the Google Play or App Store have
difficult graphical user interface that is impossible to use for visually impaired
people. So we are developing an android application for the visually impaired
peoples. This app will help and guide them to recognize the things nearer to them.
This app has very simple and easy to use graphical interface. When user starts
the app it starts the camera of the device and starts recognizing the things in front
of the them.
Background

It has been observed that a handicapped person does not like the fact that a
helper is is just showing his empathy on his incomplete physique. Hence these are
the instances where technology should abridge this empathy gap by making the
handicapped person self dependent in whatever way possible. This abridging might
help relationships to grow in a healthy way because there is not much dependence
on each other on physical level. In order to bring this thought into reality we had to
analiys the problems faced by visually impaired people when they are outdoors.
The major concern was how to make people trust this technology, as any
malfunction in system can even result to users death.

Basically it’s a audio guide for visually impaired people which would verbally
instruct them of objects coming his way. Blind people mainly have fear of vehicles
which are running on roads, this system is self sufficient to inform about all the
movables in the vicinity of the user. System’s interaction speed has increased
without much effort as less attention is given to visual soothing of the system.
Technologies Required

Dataset:-
COCO dataset

Frontend :-

➢ Android Studio IDE with Java and SDK tools.

➢ The main file or implementation of a program or source file is on android
studio using Java as a source language.
➢ Sdk toolkit for device wise editing purpose.

Backend :-

➢ Python for building deep learning model .

➢ Jupyter Notebook , Pycharm IDE’s for the development purpose.
➢ Cloud for deployment of the model

Objectives

Our work is based on a motive to ease the lives of visually impaired people
who face difficulty in their day to day life. The system is designed to verbally guide
the person by instructing him/her about the objects which comes in his way while
he is outdoor.

The system is optimized to be fast and verbally interactive. Less care has
been taken to improve the visual interface, as the user is blind. Due to system’s less
visual appeal, the system manages to interact with the person at a rate at which he
is walking as the servers won’t have the load of sending highly pixeled web pages.
Proposed Work

Fig.-1

➢ Installation of IDE’s.
➢ Installing all the modules and packages required for training the model.
➢ Collecting COCO dataset.
➢ Studying different object detection algorithms based on their accuracy.
➢ Research on speech synthesizer algorithm.
UML diagrams
1. Activity diagram:-

Fig.-2
2. Use case diagram:-

Fig.-3
Work Planned for Next Semester

➢ Completion of training of the object detection model.

➢ Storing the result of detected object in text format.
➢ Research regarding speech synthesizer through different accuracy
rate.
➢ Training the speech synthesizer model using text data.
➢ Final deployment on cloud and will display the result.

Conclusion/Summary

The system is a verbally interactive arrangement to guide the blind to find

its way in the outdoors.Efforts had been taken to keep the system as interactive as
possible. Time is the main constraint kept in mind while developing this app as
everything depends on how quickly the app makes the person aware of the object
coming its way.
References

Idea : https://brailleworks.com/5-top-mobile-apps-for-the-blind/
https://www.youtube.com/watch?v=gc2_Kva-HeE&t=1s

Object detection : https://machinelearningmastery.com/object-recognition-with-

deep-learning/
https://towardsdatascience.com/object-detection-simplified-
e07aa3830954

Speech synthesizer :
https://journals.indexcopernicus.com/search/article?articleId=1756224

2022 TRIMESTER 2 COSC260 - Assignment 1 - Programming Task 1 (HTML & CSS)
No ratings yet
2022 TRIMESTER 2 COSC260 - Assignment 1 - Programming Task 1 (HTML & CSS)
4 pages
Owners and Authorizations For BRTools
No ratings yet
Owners and Authorizations For BRTools
2 pages
1 Web App Hacking Password Reset Functionality m1 Slides
No ratings yet
1 Web App Hacking Password Reset Functionality m1 Slides
8 pages
Low Latency Streaming Cmaf Whitepaper
No ratings yet
Low Latency Streaming Cmaf Whitepaper
11 pages
Project Report Group 2
No ratings yet
Project Report Group 2
27 pages
Project Diary
No ratings yet
Project Diary
26 pages
A Report On Existing AI Work For Visually Impaired People: Ayesha Tariq
No ratings yet
A Report On Existing AI Work For Visually Impaired People: Ayesha Tariq
51 pages
Blind Navigation Report
No ratings yet
Blind Navigation Report
55 pages
Project Group1
No ratings yet
Project Group1
38 pages
Title:: Rationale
No ratings yet
Title:: Rationale
1 page
Title:: Rationale
No ratings yet
Title:: Rationale
1 page
Offer Project Original
No ratings yet
Offer Project Original
52 pages
Blind Helper Documentation NEW
No ratings yet
Blind Helper Documentation NEW
70 pages
SonicSight Phase2 PPT
No ratings yet
SonicSight Phase2 PPT
33 pages
Black Book
No ratings yet
Black Book
52 pages
Evolving Project Plan
No ratings yet
Evolving Project Plan
8 pages
PR3125
No ratings yet
PR3125
48 pages
Technical Report Format (MID SEMESTER 2022)
No ratings yet
Technical Report Format (MID SEMESTER 2022)
19 pages
Review 2
No ratings yet
Review 2
30 pages
Blind Assistance
No ratings yet
Blind Assistance
16 pages
Engr Group 1 2202 2217 2168 2215
No ratings yet
Engr Group 1 2202 2217 2168 2215
11 pages
Ai PDF
No ratings yet
Ai PDF
17 pages
"Text Recognition and Face Detection Aid For Visually Impaired Person Using Raspberry Pi
No ratings yet
"Text Recognition and Face Detection Aid For Visually Impaired Person Using Raspberry Pi
62 pages
Vision Android Application For The Visually Impaired
No ratings yet
Vision Android Application For The Visually Impaired
6 pages
19L038 - Deep Learning - Assignment Presentation
No ratings yet
19L038 - Deep Learning - Assignment Presentation
24 pages
AI Optics: Object Recognition and Caption Generation For Blinds Using Deep Learning Methodologies
No ratings yet
AI Optics: Object Recognition and Caption Generation For Blinds Using Deep Learning Methodologies
6 pages
Blind
No ratings yet
Blind
24 pages
Fyp Documentation Complete (Fyp) New
No ratings yet
Fyp Documentation Complete (Fyp) New
116 pages
Batch-14 Report (JLS)
No ratings yet
Batch-14 Report (JLS)
81 pages
IJCRT2207295
No ratings yet
IJCRT2207295
4 pages
Blind Assistance Full
No ratings yet
Blind Assistance Full
12 pages
B02
No ratings yet
B02
56 pages
Report 2.3-Revised-Final - Kaal Harir Abdulle, 160041080
No ratings yet
Report 2.3-Revised-Final - Kaal Harir Abdulle, 160041080
36 pages
Artificial Intelligence Based Optical Character Recognition Using Visual Impaired People Shopping Trolley Technology
No ratings yet
Artificial Intelligence Based Optical Character Recognition Using Visual Impaired People Shopping Trolley Technology
7 pages
Project Proposal
No ratings yet
Project Proposal
9 pages
Nikhilfinalevaluation
No ratings yet
Nikhilfinalevaluation
30 pages
Project PPT of Low Cost Ventilation
No ratings yet
Project PPT of Low Cost Ventilation
16 pages
B Indicator-An Android Application For Blind and Visually Impaired
No ratings yet
B Indicator-An Android Application For Blind and Visually Impaired
65 pages
2019-20 A CS6PO5 A1 CW Individual Work
No ratings yet
2019-20 A CS6PO5 A1 CW Individual Work
8 pages
Object Detection Research Paper
No ratings yet
Object Detection Research Paper
4 pages
AI Powered Glasses For Visually Impaired Person
No ratings yet
AI Powered Glasses For Visually Impaired Person
6 pages
Ijcrt July Student 2022
No ratings yet
Ijcrt July Student 2022
5 pages
Object Recognition in Mobile Phone Application For Visually Impaired Users
No ratings yet
Object Recognition in Mobile Phone Application For Visually Impaired Users
4 pages
Shopping for Blind
No ratings yet
Shopping for Blind
7 pages
Vision 1
No ratings yet
Vision 1
25 pages
Report
No ratings yet
Report
32 pages
Project Stage I Group
No ratings yet
Project Stage I Group
26 pages
Group Copy - Project Stage-I
No ratings yet
Group Copy - Project Stage-I
7 pages
Final Project Report
No ratings yet
Final Project Report
76 pages
Write Up Format
No ratings yet
Write Up Format
4 pages
Smart Drishti For Blind Report
No ratings yet
Smart Drishti For Blind Report
30 pages
1st Review PPT Finished
No ratings yet
1st Review PPT Finished
11 pages
Assistive Technology For Visual Impairment
No ratings yet
Assistive Technology For Visual Impairment
15 pages
Chatbot Paper
No ratings yet
Chatbot Paper
10 pages
Fyp Report - Wei Cheng Won
No ratings yet
Fyp Report - Wei Cheng Won
137 pages
FYP Idea Submission Form
No ratings yet
FYP Idea Submission Form
2 pages
Iot Synopsis Sem 2
No ratings yet
Iot Synopsis Sem 2
21 pages
Scripto
No ratings yet
Scripto
25 pages
2303 07451 PDF
No ratings yet
2303 07451 PDF
6 pages
Automated Service Assistances To The Visually Impaired People Using Android Application
No ratings yet
Automated Service Assistances To The Visually Impaired People Using Android Application
9 pages
Report On Obstacle Glasses-2-3
No ratings yet
Report On Obstacle Glasses-2-3
20 pages
mdgsoc
No ratings yet
mdgsoc
6 pages
Report
No ratings yet
Report
25 pages
BITS 2513 - Internet Technology Presentation Layer
No ratings yet
BITS 2513 - Internet Technology Presentation Layer
54 pages
Only Fans Can Look It Up - Google Search
No ratings yet
Only Fans Can Look It Up - Google Search
1 page
Reverse Engineering
88% (8)
Reverse Engineering
85 pages
1805 Woodgate Arch - Google Search
No ratings yet
1805 Woodgate Arch - Google Search
1 page
DH DVR510451085116C
No ratings yet
DH DVR510451085116C
1 page
Raid IT Programmer
No ratings yet
Raid IT Programmer
2 pages
Update Firmware - Tuya IoT Development Platform - Tuya IoT Development Platform
No ratings yet
Update Firmware - Tuya IoT Development Platform - Tuya IoT Development Platform
32 pages
Upsell Guide For Partners: Microsoft 365 Business Basic Microsoft 365 Business Premium
No ratings yet
Upsell Guide For Partners: Microsoft 365 Business Basic Microsoft 365 Business Premium
3 pages
Analog Mastering Tools
No ratings yet
Analog Mastering Tools
12 pages
How To Make A Blogger Template
100% (1)
How To Make A Blogger Template
35 pages
Database Management Systems PS Gill PDF
0% (1)
Database Management Systems PS Gill PDF
309 pages
Capstone Software Module
No ratings yet
Capstone Software Module
4 pages
No Data Available in P6 - BI Publisher
No ratings yet
No Data Available in P6 - BI Publisher
5 pages
Introduction To AJAX: Chapter 10 (Text Book)
No ratings yet
Introduction To AJAX: Chapter 10 (Text Book)
49 pages
Convert TXT To Word - TXT To Word Converter Online & Free
No ratings yet
Convert TXT To Word - TXT To Word Converter Online & Free
3 pages
Custom ADF Component
No ratings yet
Custom ADF Component
55 pages
Comparison MacOS Linux Windows11
No ratings yet
Comparison MacOS Linux Windows11
5 pages
Dummy
0% (1)
Dummy
4 pages
Refined Intro
No ratings yet
Refined Intro
42 pages
Fundamentals of Internet Programming All in One Handout
No ratings yet
Fundamentals of Internet Programming All in One Handout
60 pages
SRS Document For Review
No ratings yet
SRS Document For Review
15 pages
Manual Indonesia Ford Mondeo
No ratings yet
Manual Indonesia Ford Mondeo
197 pages
Licensing Brief PLT Introduction To Microsoft Core Licensing Oct2022
No ratings yet
Licensing Brief PLT Introduction To Microsoft Core Licensing Oct2022
13 pages
Application Note For NICE Systems NICE CEM v8.8 Digital Recording System
No ratings yet
Application Note For NICE Systems NICE CEM v8.8 Digital Recording System
6 pages
Custom Is Ing The JA Purity Template
No ratings yet
Custom Is Ing The JA Purity Template
89 pages
Quick Start Guide - en
No ratings yet
Quick Start Guide - en
46 pages

Android Application For Real Time Object Detection Using Deep Learning

Uploaded by

Android Application For Real Time Object Detection Using Deep Learning

Uploaded by

A

PROJECT DESIGN REPORT

Android application for real time object detection using deep

Name Roll. No. Exam Seat No.

as a part of Project Design Report.

Studying in BE CSE for the subject Lab1 Project Phase 1

(Prof. V.D.Chavan) (Prof. A.R.Kulkarni)

(Dr. S .A. Halkude)

As there is a growing culture of technology in the world. Our project is to

We are focusing on the need of visually impaired users of mobile devices.

➢ Android Studio IDE with Java and SDK tools.

➢ Python for building deep learning model .

➢ Completion of training of the object detection model.

The system is a verbally interactive arrangement to guide the blind to find

Object detection : https://machinelearningmastery.com/object-recognition-with-

You might also like