Classification of Holy Quran Translation

The document summarizes techniques for feature reduction when classifying text documents using neural networks. It discusses removing stop words, stemming words to their roots, and using term frequency to assign weights and select important features. Term frequency counts word frequencies and regards more frequent words as more significant features for classification. Feature reduction techniques like this can enhance neural network effectiveness by increasing susceptibility and leading to better understanding of the classification learning process.

Uploaded by

khalidaaboud72

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views8 pages

Classification of Holy Quran Translation

Uploaded by

khalidaaboud72

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

J. Eng. Applied Sci.

, 13 (12): 4468-4475, 2018

Fig. 2: String to word vector filter

methods of feature reduction should be applied to remove Stop words handler: There are many words in the verses
the irrelevant features from the initial feature set in order are repeated frequently and essentially do not carry any
to enhance the effectiveness of the NN classifier by information (Aggarwal and Zhai, 2012). Thus, their
increasing the susceptibility of this classifier, through presence in verses classification will present a lack of
feeding these attributes to the classifier in order to lead to understanding properly to the content of the verses.
more understanding of the learning process of classifying
verses. One of the methods of the feature reduction is the Stemmer: The aim of stemming is to minimize the words
feature selection method which is performed by applying to their roots which could be easily used to differentiate
the feature weighting scheme. the words.
The feature weighting scheme that applied by this
research is Term Frequency (TF) technique (Babu et al., Term Frequency (TF) transform: This technique
2014) in order to rank the features in the initial feature represents the feature selection method which is
vector and then choose a number of the high scoring responsible for generating the features set of the initial
features as a new feature subset (vector representation of features set depending on the word frequencies by
verses) which is considered the distinctive attributes in calculating their weight based on the following Eq. 1:
classifying the verses. These feature vectors are then
used to train the neural network. Based on the outcomes (1)
of the conducted experiments in this study and according
to the best result from these outcomes that attained in
classifying of the verses, the following tasks in the string where, fij is the frequency of word i in verse j (instance).
to word vector filter have been adjusted such as According to the significance of the terms in the verses,
illustrated in Fig. 2. this process includes assigning a weight to each term
which indicates the relative importance of the term in
Tokenizer: Tokenization is the process of dividing a verse and this depends on the word frequency in the
sequence of text in the verses into words, phrases, based verses. Therefore, the most repeated words in the
on n-gram technique. document (its term frequency is high) are regarded a more
significant in this document (Wang et al., 2012). If a
Lower case tokens: All the word tokens are converted to word (except stop words) appears within a particular
lowercase before being added to the feature set. Based on category, the word should be considered as a feature
the experiments the normalization technique improved the or discriminator of this category. For example, “Fasting”
classification accuracy (Patki and Kelkar, 2013; Uysal and or “Ramadan” frequently appears in the Fasting
Gunal, 2014). category.

4471

Instrumental Music Education Teaching With The Musical and Practical in Harmony
95% (20)
Instrumental Music Education Teaching With The Musical and Practical in Harmony
462 pages
Magical Parent-Magical Child
100% (2)
Magical Parent-Magical Child
232 pages
Detailed Lesson Plan in Math 10 (Permutation)
100% (4)
Detailed Lesson Plan in Math 10 (Permutation)
4 pages
Implemented Stemming Algorithms For Six Ethiopian Languages
No ratings yet
Implemented Stemming Algorithms For Six Ethiopian Languages
5 pages
SanchezMarco 2012 PHD
No ratings yet
SanchezMarco 2012 PHD
235 pages
A Comparative Study of Feature Selection Methods
No ratings yet
A Comparative Study of Feature Selection Methods
9 pages
Different Type of Feature Selection For Text Classification
No ratings yet
Different Type of Feature Selection For Text Classification
6 pages
Demos 016
No ratings yet
Demos 016
6 pages
A_Comparative_Study_for_Arabic_Text_Clas
No ratings yet
A_Comparative_Study_for_Arabic_Text_Clas
11 pages
Frequent Term Based Text Summarization For Bahasa Indonesia: M.Fachrurrozi, Novi Yusliani, and Rizky Utami Yoanita
No ratings yet
Frequent Term Based Text Summarization For Bahasa Indonesia: M.Fachrurrozi, Novi Yusliani, and Rizky Utami Yoanita
3 pages
Regular Expressions Demystified: A Practical Guide with Examples
From Everand
Regular Expressions Demystified: A Practical Guide with Examples
William E. Clark
No ratings yet
An Amharic Stemmer Reducing Words To The
No ratings yet
An Amharic Stemmer Reducing Words To The
5 pages
Using Suffix Arrays To Compute Term Frequency and Document Frequency For All Substrings in A Corpus
No ratings yet
Using Suffix Arrays To Compute Term Frequency and Document Frequency For All Substrings in A Corpus
30 pages
Summary On Recognizing Contextual Polarity in Phrase Level Sentiment Analysis
No ratings yet
Summary On Recognizing Contextual Polarity in Phrase Level Sentiment Analysis
5 pages
dhandra2007
No ratings yet
dhandra2007
5 pages
IR Assignment Article Review 2023
No ratings yet
IR Assignment Article Review 2023
7 pages
On Term Frequency Factor in Supervised Term Weighting Schemes For Text Classification
No ratings yet
On Term Frequency Factor in Supervised Term Weighting Schemes For Text Classification
16 pages
modelling freq effects in language chanre
No ratings yet
modelling freq effects in language chanre
35 pages
Ancient Tamil Vattezhutthu Alphabets Recognition in Stone Inscription Using Wavelet Transform and SVM Classifier
No ratings yet
Ancient Tamil Vattezhutthu Alphabets Recognition in Stone Inscription Using Wavelet Transform and SVM Classifier
5 pages
06838564 (1)
No ratings yet
06838564 (1)
5 pages
On_the_behavior_of_feature_selection_methods_dealing_with_noise_and_relevance_over_synthetic_scenarios
No ratings yet
On_the_behavior_of_feature_selection_methods_dealing_with_noise_and_relevance_over_synthetic_scenarios
8 pages
Python Regular Expressions Explained: A Practical Guide with Examples
From Everand
Python Regular Expressions Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
A Stochastic Parts Program and Noun Phrase Parser For Unrestricted Text
No ratings yet
A Stochastic Parts Program and Noun Phrase Parser For Unrestricted Text
8 pages
Karavul Et Al
No ratings yet
Karavul Et Al
8 pages
A Stop List For General Text
No ratings yet
A Stop List For General Text
17 pages
Ijcses 030602
No ratings yet
Ijcses 030602
13 pages
(IJIT-V6I3P1) :asst. Prof. Omprakash Yadav, Saikumar Kandakatla, Shantanu Sawant, Chandan Soni, Murari Indra Bahadur
No ratings yet
(IJIT-V6I3P1) :asst. Prof. Omprakash Yadav, Saikumar Kandakatla, Shantanu Sawant, Chandan Soni, Murari Indra Bahadur
4 pages
Consistency and Structure Analysis of Scholarly Papers Using Based On Natural Language Processing
No ratings yet
Consistency and Structure Analysis of Scholarly Papers Using Based On Natural Language Processing
18 pages
Selecting Features in On-Line Handwritten Whiteboard Note Recognition: Sfs or SFFS?
No ratings yet
Selecting Features in On-Line Handwritten Whiteboard Note Recognition: Sfs or SFFS?
4 pages
Articulo Revista IPN
No ratings yet
Articulo Revista IPN
12 pages
Willettp9 PorterStemmingReview
No ratings yet
Willettp9 PorterStemmingReview
9 pages
Building WordNet For Afaan Oromoo
No ratings yet
Building WordNet For Afaan Oromoo
6 pages
IJIVP_Vol_11_Iss_4_Paper_3_2432_2440
No ratings yet
IJIVP_Vol_11_Iss_4_Paper_3_2432_2440
9 pages
Slant Estimation Algorithm For Ocr Systems: Abstract
No ratings yet
Slant Estimation Algorithm For Ocr Systems: Abstract
21 pages
Zhoulang 2008
No ratings yet
Zhoulang 2008
6 pages
Topic Modeling Using LDA
No ratings yet
Topic Modeling Using LDA
10 pages
ssrn-5135639
No ratings yet
ssrn-5135639
12 pages
Zhou 2016
No ratings yet
Zhou 2016
14 pages
Handwritten Devanagari Word Recognition: A Curvelet Transform Based Approach
No ratings yet
Handwritten Devanagari Word Recognition: A Curvelet Transform Based Approach
8 pages
CACIC 20070725 Induction Trees LopezDeLuise - v7
No ratings yet
CACIC 20070725 Induction Trees LopezDeLuise - v7
12 pages
A Study Using N-Gram Features For Text Categorization
No ratings yet
A Study Using N-Gram Features For Text Categorization
10 pages
(40-48)CE Ancient Tamil Character Recognition-Format
No ratings yet
(40-48)CE Ancient Tamil Character Recognition-Format
9 pages
Dimensionality Reduction in Automated Evaluation of Descriptive Answers Through Zero Variance, Near Zero Variance and Non Frequent Words Techniques - A Comparison
No ratings yet
Dimensionality Reduction in Automated Evaluation of Descriptive Answers Through Zero Variance, Near Zero Variance and Non Frequent Words Techniques - A Comparison
6 pages
A Language Independent Approach To Multilingual Text Summarization
No ratings yet
A Language Independent Approach To Multilingual Text Summarization
10 pages
Automatic Amharic Text News Classification: Aneural Networks Approach
No ratings yet
Automatic Amharic Text News Classification: Aneural Networks Approach
11 pages
Ijecet: International Journal of Electronics and Communication Engineering & Technology (Ijecet)
No ratings yet
Ijecet: International Journal of Electronics and Communication Engineering & Technology (Ijecet)
6 pages
Robust Vowel Detection
No ratings yet
Robust Vowel Detection
4 pages
Aspectual Verbs - Aspect Constructions in The Making
No ratings yet
Aspectual Verbs - Aspect Constructions in The Making
1 page
Telstem:An Unsupervised Telugu Stemmer With Heuristic Improvements and Normalized Signatures
No ratings yet
Telstem:An Unsupervised Telugu Stemmer With Heuristic Improvements and Normalized Signatures
42 pages
The Use of Bigrams To Enhance
No ratings yet
The Use of Bigrams To Enhance
31 pages
A Study On The Architecture For Text Categorization and Summarization
No ratings yet
A Study On The Architecture For Text Categorization and Summarization
4 pages
Adeoti F
No ratings yet
Adeoti F
45 pages
ocr_progress4
No ratings yet
ocr_progress4
22 pages
A Survey On Word Representation In Natural Language
No ratings yet
A Survey On Word Representation In Natural Language
7 pages
Designing A Rule Based Stemming Algorithm For Kambaata Language Text
100% (1)
Designing A Rule Based Stemming Algorithm For Kambaata Language Text
14 pages
Distributional Features For Text Categorization: (Xuexb, Zhouzh) @lamda - Nju.edu - CN
No ratings yet
Distributional Features For Text Categorization: (Xuexb, Zhouzh) @lamda - Nju.edu - CN
12 pages
Designing a Stemmer for Geez Text Using Rule Based Approach
No ratings yet
Designing a Stemmer for Geez Text Using Rule Based Approach
6 pages
Designing A Stemmer For Geez Text Using Rule Based Approach PDF
No ratings yet
Designing A Stemmer For Geez Text Using Rule Based Approach PDF
6 pages
Irfan 2017
No ratings yet
Irfan 2017
5 pages
A Method For Calculation of Finite Fatigue Life Under Multiaxial Loading in High-Cycle Domain
No ratings yet
A Method For Calculation of Finite Fatigue Life Under Multiaxial Loading in High-Cycle Domain
7 pages
JLCL Workshop Lexical-Semntic and Ontolgical Resources
No ratings yet
JLCL Workshop Lexical-Semntic and Ontolgical Resources
120 pages
Algorithm For Devanagari Character
No ratings yet
Algorithm For Devanagari Character
6 pages
Testing Different Log Bases For Vector Model Weighting Technique
No ratings yet
Testing Different Log Bases For Vector Model Weighting Technique
15 pages
The Automatic Customer (Gamification Summit)
100% (2)
The Automatic Customer (Gamification Summit)
44 pages
Enumerator JD
No ratings yet
Enumerator JD
2 pages
Book Abcsofart Elementsandprinciplesofdesign
No ratings yet
Book Abcsofart Elementsandprinciplesofdesign
25 pages
Portfolio 2021-2022 Unay
No ratings yet
Portfolio 2021-2022 Unay
40 pages
Task 2 - Contexts in Pragmatics
No ratings yet
Task 2 - Contexts in Pragmatics
5 pages
Learning Assessment Portfolio-2
No ratings yet
Learning Assessment Portfolio-2
14 pages
Research Strategies (Methodologies)
No ratings yet
Research Strategies (Methodologies)
26 pages
The Importance of Grammar in ELT: Arab Open University, Oman
No ratings yet
The Importance of Grammar in ELT: Arab Open University, Oman
14 pages
The AACN Synergy Model For Patient Care
100% (3)
The AACN Synergy Model For Patient Care
5 pages
Volume38 Number2 Article1
No ratings yet
Volume38 Number2 Article1
6 pages
Literature Matrix (Peru)
No ratings yet
Literature Matrix (Peru)
49 pages
Evaluation of Teaching Effectiveness
No ratings yet
Evaluation of Teaching Effectiveness
10 pages
HG-G12 Module 2 RTP
No ratings yet
HG-G12 Module 2 RTP
10 pages
Understanding IELTS Writing Band Descriptors
No ratings yet
Understanding IELTS Writing Band Descriptors
22 pages
Improving The Teaching of Personal Pronouns Through The Use of Jum-P' Language Game Muhammad Afiq Bin Ismail
No ratings yet
Improving The Teaching of Personal Pronouns Through The Use of Jum-P' Language Game Muhammad Afiq Bin Ismail
5 pages
PCA - 5th
No ratings yet
PCA - 5th
10 pages
Rationale For Instruction: Social Studies Lesson Plan Template
No ratings yet
Rationale For Instruction: Social Studies Lesson Plan Template
4 pages
L1-Test Structure and Introduction To The Listening Test: Lesson Instructions Suggested Time
No ratings yet
L1-Test Structure and Introduction To The Listening Test: Lesson Instructions Suggested Time
13 pages
Assessment Process: Appreciation of Digital Literacy (Level 1) Module No. Module Name Learning Hours
No ratings yet
Assessment Process: Appreciation of Digital Literacy (Level 1) Module No. Module Name Learning Hours
3 pages
Deconstruction
No ratings yet
Deconstruction
19 pages
Demo Teaching LP
No ratings yet
Demo Teaching LP
4 pages
40.validation of The Self-Discrepancies Scale (S-DS) - A Tool To Investigate The Self in Clinical and Research Settings
No ratings yet
40.validation of The Self-Discrepancies Scale (S-DS) - A Tool To Investigate The Self in Clinical and Research Settings
9 pages
Lesson 1 VMCVO
No ratings yet
Lesson 1 VMCVO
3 pages
Study guide TMN3705 Print Version
No ratings yet
Study guide TMN3705 Print Version
38 pages
Sentence Structures: Quotations: On Writing Sentences
No ratings yet
Sentence Structures: Quotations: On Writing Sentences
8 pages
Assistant Candidate Selection Committee
No ratings yet
Assistant Candidate Selection Committee
2 pages
TP Video Reflection Rubric - ENG
No ratings yet
TP Video Reflection Rubric - ENG
3 pages

Classification of Holy Quran Translation

Uploaded by

Classification of Holy Quran Translation

Uploaded by

J. Eng. Applied Sci.

, 13 (12): 4468-4475, 2018

Fig. 2: String to word vector filter

You might also like