0% found this document useful (0 votes)
19 views

Voice Assistant using python (Jalax)

Uploaded by

0126it201012
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views

Voice Assistant using python (Jalax)

Uploaded by

0126it201012
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

ORIENTAL COLLEGE OF TECHNOLOGY,

BHOPAL

VOICE
ASSISTANT
USING PYTHON
(JALAX)

Akash Malviya (0126IT201012)

Akash kajve (0126IT201011)

Milind Kumeriya (0126IT201055)


Approval Certificate

Major Project work entitled“Voice Assistant using python (Jalax)” being


submitted by Akash Malviya (0126IT201012) , Akash kajve
(0126IT201011) & Milind Kumeriya (0126IT201055) has been
examined by us and is hereby approved.

Supervisor Name: Hod:


Prof. Ruchi jain Prof. Amit Kanskar
INTRODUCTION
• As we know Python is a suitable language for scriptwriters and
developers. Lets write a script for Voice Assistant using Python. The
query for the assistant can be manipulated as per the user’s need.

• Speech recognition is the process of converting audio into text. This
is commonly used in voice assistants like Jalax, Siri, etc. Python
provides an API called Speech Recognition to allow us to convert
audio into text for further processing.

• In this article, we will look at converting large or long audio files into
text using the Speech Recognition API in python.
Project Objectives
• Upcoming trending technologies such as virtual reality, augmented
reality, voice interaction, IOT etc are changing the way people
engage with the world and transforming digital experiences.

• Voice control is one of important development of human-machine


interaction, which was possible because of advancement in
Artificial Intelligence.

• we are able to train our machine to do their tasks by themselves or


to think like humans using technologies like Artificial Intelligence,
Machine Learning, Neural Networks, etc. we can talk to our
machines with the help of virtual assistants.
Project Scope
• Voice assistants are devices/apps that use voice recognition
technology, natural language processing, and AI to respond to
humans. Using the technology, the device synthesises the user’s
message, breaks it down, evaluates it, and offers a meaningful
response in return.

• There are two categories of AI voice assistants: one is general-


purpose, and the other is the bot voice assistants.

• Voice assistant brands such as Siri and Alexa belong to the first
category. Conversely, voice chatbots or voice bots are usually the
second type where the assistant is integrated into an app/website to
help its users navigate the service.
HARDWARE USED
Laptop or pc:-

➢ Window 10 or higher

➢ Ryzan 5 processor system or higher

➢ 4GB RAM or higher

➢ 100GB ROM or higher Contents


SOFTWARE USED

➢ VISUAL STUDIO 2022

➢ PYTTSX3

➢ SPEECH RECOGNITION

➢ WOLFRAMALPHA
Logic & Business rules

• Business logic is the custom rules or algorithms that handle the


exchange of information between a database and user
interface. Business logic is essentially the part of a computer
program that contains the information (in the form of business rules)
that defines or constrains how a business operates.

• Business logic can be seen in the workflows that they support, such
as in sequences or steps that specify in detail the proper flow of
information or data, and therefore decision-making. Business logic is
also known as "domain logic."
Design
Existing System

In existing system, the audio command is taken as input through


microphone of the device. The next task of voice assistant will be to
analyze audio command and give appropriate output to the user.

Data flow diagram (DFD)

DFD is graphical representation of system which give detail information


about data flow between input and output. As level increases it
elaborates detail information about data flow.
Python:
• Python is an interpreted high-level general-purpose programming
language.
• The version python 3.6.0 is used in the development of voice
assistance project.

PyCharm:
• PyCharm is the most popular IDE for Python
• It includes great features such as excellent code completion and
inspection with advanced debugger and support for web
programming and various frameworks.
Screenshrot
Testing
• There are four general steps to running automated tests for
voice recognition software:
• Screen navigation to voice assistant (supported by Appium).
• Activate voice assistant (supported by Appium).
• Say a voice command (requires advanced automation).
• Validate screen/response (requires advanced automation).
Conclusion
• Voice-Controlled Devices uses Natural Language Processing to
process the language spoken by the human and understand the
query and process the query and respond to the human with the
result. The understanding of the device means Artificial
Intelligence needs to be integrated with the device so that the
device can work in a smart way and can also control IoT
applications and devices and can also respond to query which
will search the web for results and process it. It is designed to
minimize the human efforts and control the device with just
human Voice. The device can also be designed to interact with
other intelligent voice-controlled devices like IoT applications
and devices, weather reports of a city from the Internet, send an
email to a client, add events on the calendar, etc
References

1. DOUGLAS O’SHAUGHNESSY, SENIOR MEMBER, IEEE,


“Interacting With Computers by Voice: Automatic Speech Recognition
and Synthesis” proceedings

2. THE IEEE, VOL. 91, NO. 9, SEPTEMBER 2003 2. Kei Hashimoto1,


Junichi Yamagishi2, William Byrne3 Simon King2, Keiichi Tokuda,

3. An analysis of machine translation and speech synthesis in speech-to-


speech.
Thank You

You might also like