Text Summarization Using NLP
Text Summarization Using NLP
Published in 2020, this paper explores how maintaining irregular data is a big challenge for
organizations, which has increased the demand for text classification tools.
Using both machine learning and deep learning models. The three text classification methods mentioned
are:
*Supervised
*Unsupervised
*Semi-supervised
The authors also elaborate on unique approaches like graph-based methods, transductive SVM, self
cleaning, and co-cleaning. The paper concludes by explaining why it’s
essential to categorize the text using mining for a semi-supervised learning approach to boost accuracy.
[2] Xiaoyu Luo.2021 “Efficient English text classification using selected machine learning
techniques”(June 2021)
This recent paper, published in 2021, details the implementation of the support vector machine
(SVM) method and other ML techniques for classifying English text and documents.
The authors employed the following methods for classifying texts using three different data sets:
*Naive Bayes algorithm
*SVM method
*Logistic regression
Logistic regression cross-validation (LRCV)
The results were pretty solid, with SVM scoring a precision rating of around 90 percent in one of
the data sets, and the highest in all three data sets when simulated on the Weka platform.
Interestingly, the Naive Bayes algorithm worked with the least precision of 12 percent for one of
the data sets.
The author also presents a straightforward approach for categorizing the data using text mining,
attribute abstraction, stop words removal, stemming, and vector space documents.
PROBLEM DESCRIPTION:
In the modern Internet age, textual data is ever increasing. Need some way to condense this data
while preserving the information and meaning.
We need to summarize textual data for that.
Text summarization is the process of automatically generating natural language summaries from an
input document while retaining the important points.
The summary is customizable per user's request, allowing varying extents of summarization.
Key points from the summarization process, together with corresponding time-stamps, are then
presented to the user through a small UI next to the video feed.
Users search and read text from the Internet like Article,Case
Study,documents,Mail,Storys,Message,etc .
And most times the text is large that it makes it harder to read.
It consumes a large amount of time.
MODULES :
● Module 1:
Extracting Input from Input panel or youtube link panel
● Module 2:
Youtube-Transcript-API
● Module 3:
Summarization using BART
HARDWARE AND SOFTWARE REQUIRNMENT:
Software Requirements:
Spyder
google colaboratory
Text Editor-Notepad++
FrontEnd-HTML and CSS
Python
Hardware Requirements: