0% found this document useful (0 votes)
3 views

Developing-A-Desktop-Voice project

The document discusses the development of a desktop voice assistant inspired by existing virtual assistants like Cortana and Siri, aiming to facilitate user tasks through voice commands. It outlines the project's methodology, including requirements gathering, technical and market feasibility, and system design aspects like class and sequence diagrams. The conclusion emphasizes the role of artificial intelligence in reducing human effort and enhancing efficiency in daily tasks.

Uploaded by

chmaheshkumar98
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Developing-A-Desktop-Voice project

The document discusses the development of a desktop voice assistant inspired by existing virtual assistants like Cortana and Siri, aiming to facilitate user tasks through voice commands. It outlines the project's methodology, including requirements gathering, technical and market feasibility, and system design aspects like class and sequence diagrams. The conclusion emphasizes the role of artificial intelligence in reducing human effort and enhancing efficiency in daily tasks.

Uploaded by

chmaheshkumar98
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

International Journal of Scientific Research in Engineering and Management (IJSREM)

Volume: 09 Issue: 01 | Jan - 2025 SJIF Rating: 8.448 ISSN: 2582-3930

Developing a Desktop Voice Assistant


Dr.PSV.Srinivasa Rao1, A.Prem kumar2, Anas Abdullah3 ,Ch.Rajeshwari4,K.Devika5
1Dr.PSV.Srinivasa Rao (professor)
2A.Prem Kumar Department of Computer Science and Engineering (Joginpally b.r. Engineering College)
3Anas Abdullah Department of Computer Science and Engineering (Joginpally b.r. Engineering College)
4Ch.Rajeshwari Department of Computer Science and Engineering (Joginpally b.r. Engineering College)
5K.Devika Department of Computer Science and Engineering (Joginpally b.r. Engineering College)

---------------------------------------------------------------------***---------------------------------------------------------------------
ABSTRACT the concept of AI in every field is decreasing human
effort and saving time.
The project aims to develop a personal-assistant for
Laptops. Krishna draws its inspiration from virtual
assistants like Cortana for Windows, and Siri for iOS but 2. LITERATURE SURVEY
Cortana extends its services to some extent. Our project Voice Assistant has the long history. It has been in the
has been designed to provide a userfriendly interface for phase of evolution since 1880.
carrying out a variety of tasks by employing certain well- In 1880 Alexander Graham Bell implemented further
defined activities which made it easier to send emails operations over Edison’s phonograph, which his Volta
without typing any word, searching on Google without Graphophone Company patented in 1886. Instead of foil
opening the browser, and performing many other daily graphophone was used, which allowed for longer
tasks like playing music, opening your favorite IDE with recordings and higherquality playback. Edison also
the help of a single voice command. Users can interact developed a wax version of the phonograph and both
with the assistant either through voice commands or using devices were used primarily for dictating letters and other
keyboard inputs. Python is an emerging language so it documents.
becomes easy to write a script for Voice Assistant in In 1961 IBM introduced the IBM Shoebox, it’s the first
Python. The instructions for the assistant can be handled digital speech recognition tool. It recognized 16 words
as per the requirement of user. In Python there is an API and digits 0 to 9. It was able perform mathematical
called Speech Recognition which allows us to convert functions and perform speech recognition.
speech into text. It was an interesting task to make our In 1996 Microsoft introduces Clippy. Microsoft Clippy,
own assistant. By making this project, we realized that the it’s also known as Clippit and officially recognized as
concept of AI in every field is decreasing human effort Office Assistant, it was an intelligent user interface for
and saving time. Microsoft Office. It assisted the users in a number of
interactive ways by appearing as a visualized character on
the Office applications and offering help related to
Key Words: NLP(natural language processing),Speech to text
various operations of the Office Software. It was made
conversion
available in the Microsoft Office for Windows in 1997
1. INTRODUCTION and in 2003 it was discontinued.
Artificial Intelligence when used with machines, it shows
us the capability of thinking like humans. In this, a 3.PROBLEM STATEMENT
computer system is designed in such a way that typically
requires interaction from human. As we know Python is To develop a software application that allows users to
an emerging language so it becomes easy to write a script interact with their computer through voice commands,
for Voice Assistant in Python. The instructions for the eliminating the need for manual keyboard and mouse
assistant can be handled as per the requirement of user. input, providing a more efficient and accessible way to
Speech recognition is the Alexa, Siri, etc. In Python there perform common desktop tasks like opening applications,
is an API called Speech Recognition which allows us to searching the web, managing files, and controlling
convert speech into text. It was an interesting task to make system settings, particularly for users who may have
my own assistant. It became easier to send emails without difficulty with traditional input methods
typing any word, searching on Google without opening
the browser, and performing many other daily tasks like 4. METHODOLOGY
playing music, opening your favorite IDE with the help A methodology for a desktop voice assistant project
of a single voice command. In the current scenario, typically involves: defining project requirements,
advancement in technology is such that they can perform choosing a development platform and libraries,
any task with same effectiveness or can say more implementing speech recognition, natural language
effectively than us. By making this project, I realized that processing (NLP) to understand user commands,

© 2025, IJSREM | www.ijsrem.com DOI: 10.55041/IJSREM41135 | Page 1


International Journal of Scientific Research in Engineering and Management (IJSREM)
Volume: 09 Issue: 01 | Jan - 2025 SJIF Rating: 8.448 ISSN: 2582-3930

integrating with system functionalities, building a user The class diagram is the main building block of object-
interface, and testing and refining the assistant. oriented modeling. It is used both for general conceptual
modeling of the systematic of the application, and for
4.1.Project Definition and Requirements Gathering: detailed modeling translating the models into
programming code. Class diagrams can also be used for
Understand who will primarily use the assistant and data modeling. The classes in a class diagram represent
what tasks they need it to perform (e.g., opening both the main objects, interactions in the application and
applications, setting reminders, controlling media, the classes to be programmed. In the diagram, classes are
searching the web). represented with boxes which contain three parts:

4.2Technical Feasibility • The upper part holds the name of the class.
The current state of NLP and speech recognition • The middle part contains the attributes of the
technology allows for accurate interpretation of user class.
commands. Training machine learning models for
personalized user experiences is feasible with available • The bottom part gives the methods or operations
datasets. Compatibility with major platforms, including the class can take or undertake.
iOS and Android, is technically feasible. APIs provided
by these platforms enable seamless integration of the
voice assistant into various devices.

4.3 Market Feasibility

Consumer trends indicate an increasing preference for


voice-activated services. The integration of voice
assistants in smartphones, smart speakers, and other
devices demonstrates a substantial market opportunity.
While competition exists, the uniqueness of the proposed
voice assistant, coupled with its personalized features,
can differentiate it in the market. Continuous updates and
Fig 1.system design
improvements will be crucial to staying competitive.
5.2 Sequence Diagram
4.4 Economic Feasibility
A sequence diagram is a kind of interaction diagram that
The cost of developing a personal voice assistant
shows how processes operate with one another and in
depends on factors such as software development, AI
what order. It is a construct of a Message Sequence Chart.
model training, and user interface design. However, the
A sequence diagram shows object interactions arranged
potential return on investment, given the demand for
in time sequence. It depicts the objects and classes
voice assistants, justifies the initial costs. Revenue can be
involved in the scenario and the sequence of messages
generated through partnerships with device
exchanged between the objects needed to carry out the
manufacturers, subscription models for premium
functionality of the scenario. Sequence diagrams are
features, and potentially through data analytics insights
typically associated with use case realizations in the
derived from user interactions (ensuring strict privacy
Logical View of the system under development.
compliance).
Sequence diagrams are sometimes called event diagrams,
event scenarios, and timing diagrams.
4.5 Legal and Ethical Feasibility

Adhering to data protection regulations is paramount.


Implementing robust security measures, obtaining user
consent, and anonymizing data ensure compliance with
privacy laws.

5.SYSTEM DESIGN

5.1 Class Diagram

© 2025, IJSREM | www.ijsrem.com DOI: 10.55041/IJSREM41135 | Page 2


International Journal of Scientific Research in Engineering and Management (IJSREM)
Volume: 09 Issue: 01 | Jan - 2025 SJIF Rating: 8.448 ISSN: 2582-3930

Fig 4.Use case diagram

Fig 2.Sequence Diagram 5.5 E-R Diagram

ER Model stands for Entity Relationship Model is a


5.3 Collaboration Diagram high-level conceptual data model diagram. ER model
helps to systematically analyze data requirements to
A collaboration diagram describes interactions among produce a well-designed database. The ER Model
objects in terms of sequenced messages. Collaboration represents real-world entities and the relationships
diagrams represent a combination of information taken between them. Creating an ER Model in DBMS is
from class, sequence, and use case diagrams describing considered as a best practice before implementing your
both the static structure and dynamic behavior of a database
system.

Fig 3.Collaboration Diagram

5.4 Use case diagram


Fig 5.E-R diagram
A use case diagram at its simplest is a representation
of a user's interaction with the system and depicting the
specifications of a use case. A use case diagram can 6. RESULTS
portray the different types of users of a system and the
various ways that they interact with the system. This type
of diagram is typically used in conjunction with the
textual use case and will often be accompanied by other
types of diagrams as well.

Fig 5. Output results

© 2025, IJSREM | www.ijsrem.com DOI: 10.55041/IJSREM41135 | Page 3


International Journal of Scientific Research in Engineering and Management (IJSREM)
Volume: 09 Issue: 01 | Jan - 2025 SJIF Rating: 8.448 ISSN: 2582-3930

Fig 6.shutting down system

Fig 7.Telling battery percentage


Fig 9. Volume up down and mute

7.CONCLUSION

Voice-Controlled Devices uses Natural Language


Processing to process the language spoken by the human
to understand and process the query and respond to the
human with the result. It eases most of the tasks of the
user like searching the web and opening applications on
desktop In the process of making this project we realized
that the concept of Artificial Intelligence in every field is
decreasing human efforts and saving a lot of time.
Fig 8. Telling IP address

REFERENCES

[1] Rabiner Lawrence, Juang Bing-Hwang.


Fundamentals of Speech Recognition Prentice Hall ,
New Jersey, 1993, ISBN 0-13-015157-2 2.
[2] Deller John R., Jr., Hansen John J.L., Proakis
John G. ,Discrete-Time Processing of Speech Signals,
IEEE Press, ISBN 0-7803-5386-2 3.

© 2025, IJSREM | www.ijsrem.com DOI: 10.55041/IJSREM41135 | Page 4


International Journal of Scientific Research in Engineering and Management (IJSREM)
Volume: 09 Issue: 01 | Jan - 2025 SJIF Rating: 8.448 ISSN: 2582-3930

[3]Hayes H. Monson,Statistical Digital Signal Processing


and Modeling, John Wiley & Sons Inc. , Toronto, 1996,
ISBN 0-471-59431-8 4.
[4]Proakis John G., Manolakis Dimitris G.,Digital
Signal Processing, principles, algorithms, and
applications, Third Edition, Prentice Hall , New
Jersey, 1996, ISBN 0-13- 394338-9 5.
[5]Ashish Jain,Hohn Harris,Speaker identification using
MFCC and HMM based techniques,university Of
Florida,April 25,2004.

[6]http://www.cse.unsw.edu.au/~waleed/phd/html/n
ode38.html , downloaded on 2 Oct 2012.

© 2025, IJSREM | www.ijsrem.com DOI: 10.55041/IJSREM41135 | Page 5

You might also like