Automated Extraction and Augmentation of Key Information From Audio Using Speech Recognition and Text Summarization
Automated Extraction and Augmentation of Key Information From Audio Using Speech Recognition and Text Summarization
ISSN No:-2456-2165
Abstract:- This Audio lectures and speeches contain a format, allowing for accurate translation of verbal
wealth of valuable information, but reviewing and communication into textual form. It is important to note that
extracting the key points can be tedious and time- ASR differs from voice recognition, which focuses on
consuming. This paper presents an automated system identifying an individual's specific voice.
that uses speech recognition and text summarization
techniques to identify and summarize the most salient Meanwhile, text summarization condenses lengthy
content from spoken presentations. Audio is first documents into concise overviews by extracting main points.
transcribed to text via a speech recognition engine. The By generating summarized content, it becomes easier to
resulting text is then processed by an extractive review online lectures and workshops.
summarization algorithm based on term frequency-
inverse document frequency (TF-IDF) to extract the most By integrating ASR and text summarization, this paper
important points. These summarized points can leverages NLP to mitigate the difficulties of online learning.
optionally be used to generate relevant supplementary Automated transcription of classes combined with
URLs that provide additional context or resources related summarized notes provides students and professionals with
to the topics covered. This system was developed to streamlined, digestible information. This assists
enable quick review of lectures and speeches by comprehension and retention as education continues adapting
automatically delivering condensed, relevant summaries. to a virtual landscape.
Keywords:- Speech Recognition, Extractive Summarization, The paper exemplifies how NLP and AI can enhance
TF-IDF, URLs. remote collaboration and learning during an unprecedented
shift to online platforms. With customized tools to target
I. INTRODUCTION unique challenges, technology can facilitate engagement,
understanding, and memory despite the limitations of distance
The COVID-19 pandemic had compelled a widespread learning.
transition from traditional in-person classrooms to online
education platforms. However, many students struggle to There are two main methods in order to summarize a
attend or consistently stay focused during virtual classes. given text, that is
Missing sessions or having difficulty concentrating can hinder
learning and retention when classes are held remotely. The Abstractive Summarization
unique challenges posed by online learning environments In abstractive summarization, the given source text
require adapted solutions to support students as education document is paraphrased and shortened as required. With an
moved online amid the pandemic. abstraction algorithm, grammatical inconsistencies can be
avoided as compared to extractive summarization methods.
Amidst the global shift to online education, the impact The abstractive summarization uses trained data to create new
has been significant, affecting approximately 1.2 billion phrases and sentences that provide the most important
children across 186 countries, as highlighted by a report from information from the text.
UNESCO. To address these issues, this paper presents a
system that utilizes natural language processing (NLP) Extractive Summarization
techniques to enhance the online education experience. Extractive summarization algorithms include extracting
the key phrases from the source text document and then
NLP allows computer algorithms to analyze and integrating them to generate a summary. There are no changes
comprehend human language. Its applications range from made to the words or phrases in the source text and the
handwriting recognition and speech recognition to creating summary is generated according to the given metrics.
chatbots and automatic text summarization. Extractive summarization algorithms do not require an
exhaustive set of training data and are comparatively less
In the context of online education, automatic speech complex than abstractive summarization techniques and thus
recognition (ASR) plays a crucial role. ASR algorithms are widely popular.
possess the ability to convert spoken speech into a written