Debapriya Sengupta, Goutam Saha - Identification of The Major Language Families of India and Evaluation of Their Mutual Influence 2016
Debapriya Sengupta, Goutam Saha - Identification of The Major Language Families of India and Evaluation of Their Mutual Influence 2016
influence
Author(s): Debapriya Sengupta and Goutam Saha
Source: Current Science , 25 February 2016, Vol. 110, No. 4 (25 February 2016), pp. 667-
681
Published by: Current Science Association
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide
range of content in a trusted digital archive. We use information technology and tools to increase productivity and
facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected].
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
https://about.jstor.org/terms
is collaborating with JSTOR to digitize, preserve and extend access to Current Science
20 ms,
cepstral coefficient (SFCC) is a newer method passed
and is through Hamming window and then its
power
known to work well in speech recognition. SDC hasspectral
been density (PSD) is calculated. The resulting
signal
successfully used for language identification. GMM is passed
is through the mel scale filter bank. Loga
rithm of
extensively used in speaker or language identification the output of the mel filter bank is taken and its
for
discretetool,
modelling purpose. SVM is a powerful classification cosine transform (DCT) is evaluated. These con
particularly suitable for two-class problems. stitute the mel coefficients". We have used frame size of
20 ms
The rest of the article is arranged as follows: the nextwith an overlap of 10 ms. The number of filter
banks is 20.
section gives a brief description of the feature extraction
and modelling techniques that have been used in the
study. Next, the database and experimentalSpeech set-upsignal-based
are frequency cepstral coefficients:
described. The results of the experiments are SFCC
thenis apre
frequency warping technique based purely on
sented and discussed, followed by conclusion. the properties of the acoustic speech signal. The filter
bank width depends on the data. Train data are divided
Feature extraction
Frequency (Hz)-»
Figure
Figure 1.1.Mel
Melfrequency
frequencycepstral
cepstral
coefficient
coefficient
filterfilter
bank.bank. Figure 3. SFCC filter
filter bank.
bank.
% 1
RASTA filter
V ZZ
SDC feature extraction
r
Forty-nine SDC coefficients
r
Seven MFCC/SFCC
coefficients
Fifty-six MFCC/SFCC +
SDC coefficients
Figure
Figure 4. Steps
4. Steps
required
required
to derive
toMFCC/SFCC
derive MFCC/SFCC
+ shifted delta +
coefficient
shifted (SDC)
delta
features
coefficient
of speech.
(SDC) features of speech
into frames of 20 ms with 10 ms overlap and Figure 4 shows the steps required to get MFCC/
passed
through Hamming window. PSD of each frame SFCC +is calcu
SDC features from speech. SDC is discussed in
lated and averaged over all frames, and itsthe literature713'14.
logarithm is
computed. Average energy is computed by summing up
the log PSD and dividing it by the number of filter banks.
Modelling
We have used 20 filter banks for our experiments. The
upper cut-offs for each filter bank are chosen to be such
For modelling the feature vectors, we have used GMM
that the log energy of the filter banks is equal to the aver
and SVM.
age energy12. The rest of the procedure is the same as that
used in case of MFCC. Figure 2 shows the steps involved
Gaussian mixture models: In GMM, given the feature
in computing SFCC filter banks, whereas Figure 3 shows
the SFCC filter bank.
vector of train data, the aim is to estimate the parameters
of the GMM A, which best represents the distribution of
the feature vectors. A Gaussian mixture density is a
Shifted delta coefficients: Delta cepstral features are
weighted sum of M component densities. It can be repre
computed across multiple speech frames and stacked
sented by
together to give shifted delta cepstral features. Four
parameters are used to specify these features: N, d, P and M
Experimental set-up
Database preparation
mA
mn
Feature
extraction
■H MAP adaptation—► m
Speech input
mk,
GMM
GMMSupervector
Supervector
Figure
Figure 5. The
5. hyperplane
The hyperplane
which ensures
which
maximum
ensures
separation
Figure maximum
Figure6.6.be
Formation
Formation
of of separation
Gaussian
Gaussian
mixture
mixture
modelmodelbe
(GMM)(GMM)
super super
tween the two classes. vectors.
670
CURRENT SCIENCE, VOL. 110, NO. 4, 25 FEBRUARY 2016
There are various reasons for which we have selected tion of silence and unwanted voices are deleted as is done
this website as our data source. The quality of speech is for training speech. From these segments, we select 100
good as it contains very little noise. Speech is sufficiently utterances for each family and for each duration. These
loud and is of sufficiently long duration. A large number come from all the languages belonging to the particular
of speakers (news readers) are available in each language. family. For example, since Dravidian family has 4 lan
This reduces speaker bias. Both male and female speakers guages, there are 25 utterances from each language and
are present in almost all the available languages, ensuring Indo-European family has nearly 6 utterances from each
little or no gender bias. Speech covers a large variety of language. These 25 or 6 utterances from each language
topics that helps capture details of acoustic information. are equally divided between male and female speakers
To prepare the database, we have downloaded all news (where exact equal division is not possible, nearly equal
files that are available in the 22 languages mentioned ear division is taken). This is how one test set is prepared.
lier (18 Indo-European and 4 Dravidian languages). We have prepared three such test sets to verify our re
After listening to each of the news files, we have rejected sults. These three test sets are again mutually exclusive,
those of poor quality. Almost all news bulletins have a i.e. the 100 utterances contained in set A are different
short music of 10 sec at the beginning and the end. These from the 100 utterances in sets B or C. Figure 7 shows
have been removed in our experimental database. The the structure of train and test sets.
speech is originally in ,mp3 format. It is converted to
.wav format to enable further processing.
Influence of Dravidian family on Indo-European
languages
Language family identification
Two languages spoken in the neighbouring regions may
We train the systems to identify speech from each of the be influenced by each other. We have tried to test this
two families (Indo-European and Dravidian). Then we experimentally. Also, the extent of influence can be
test them by giving speech samples from both the fami indirectly measured from the accuracy of correct identifi
lies as input and find the percentage of correct identifica cation.
tion. Since languages belonging to the same family sound
similar (have similar acoustic features), it is expected that
Training: For each family, 4 h of speech is taken for a system trained with a language family can identify
training. This 4 h contains speech from all the languages whether a language belongs to that family or not even if
belonging to the families, i.e. Indo-European training set that language is not used during training of the system.
contains news files from all the 18 languages in equal Based on this idea, we trained the systems by leaving out
proportion. Since each news file is nearly of 10 min dura one of the languages of Indo-European family. Then
tion, we have taken two files from each language on an when we test these systems with the language left out,
average. Similarly, since Dravidian family has four lan languages having more Dravidian influence are expected
guages, eight files from each language have been taken to give less identification accuracy compared to those
on an average. This is done so that the systems do not get having less Dravidian influence.
biased towards any particular language. Also, duration of
male and female speech is nearly equal in each family to Influence on neighbouring Indo-European languages:
reduce gender bias. After the train data are arranged in Indo-European languages are spoken by most of Central
this format, each speech file is broken into segments of and North Indian states. Dravidian languages are spoken
30 sec duration. Each segment is listened to and segments in the southern states of Karnataka (Kannada), Kerala
containing music, long duration of silence or unwanted (Malayalam), Tamil Nadu (Tamil) and Andhra Pradesh
sounds are deleted. At the end, we have two sets of (Telugu). The neighbouring Indo-European speaking
speech files, one for each family, each having 4 h of states are Chhattisgarh, Goa, Maharashtra and Odisha
speech segmented into 30 sec duration. This is the speaking Chhattisgarhi, Konkani, Marathi and Odia
training set. respectively. The neighbouring languages are expected to
have more Dravidian influence than non-neighbouring
Testing: This is done for three different utterance dura languages. We tested the influence of Dravidian languages
tions, short (3 sec), medium (10 sec) and long (30 sec). on each of the neighbouring languages. Figure 8 shows
For each utterance duration, we have 100 test utterances. the Dravidian speaking states and their neighbouring
The train and test speaker sets are mutually exclusive. Indo-European speaking states.
News files from test speakers are chosen taking care of
the fact that the number of male and female-speaking Training: In order to test the influence of Dravidian
files is nearly equal. These news files are then broken in family on neighbouring Indo-European languages, we
to 3 sec, 10 sec and 30 sec segments. Music, long dura have prepared a train dataset which is similar to that
Testing
Training Separate set of speakers
Lang 2
Lang 1
Lang L
3s 3 s 3 s
10 s 10 s 10s
30 s
30 s 30 s
ft t tt t t *
3 s data 10 s data 30 s data
Figure7. 7.
Figure Train
Train and and test dataset.
test dataset.
Testing: The test set contains speech of only the left out
language. We select some news files from the left out
8 / V" language keeping nearly equal proportion of male and
female speakers. These files are broken into segments of
v
/w' 1) Tamil Nadu 3 sec, 10 sec and 30 sec duration. We select 100 segments
2) Kerala from each duration to prepare the test dataset. Similar
3) Karnataka
procedure is carried out for all the four neighbouring
4) Andhra Pradesh
5) Goa Indo-European languages. The only exception is Chhat
6) Maharashtra tisgarhi, where 100 test utterances could not be collected
7) Odisha
for 30 sec utterance duration due to shortage of data. So
8) Chhattisgarh
tests are done with 86 segments.
Dravidian-speaking states
Influence on non-neighbouring Indo-European langua
ges: In order to compare the results of Dravidian influ
ence on neighbouring Indo-European languages, we have
Figure 8. Dravidian-speaking states and their neighbours.
done similar experiments on non-neighbouring Indo
European languages as well. For this, we have randomly
described in the section on 'Training'; but with a little
selected four Indo-European languages which do not have
change. As in the earlier case, we take 4 h of speech for
a Dravidian-speaking neighbour. The languages we
each language family. But here the Indo-European dataset
selected are Bengali, Gujarati, Hindi and Punjabi.
contains 17 languages, unlike 18 in the previous case. We
leave out one neighbouring Indo-European language each Training: The data are prepared in the same way as des
time. For example, to test the influence of Dravidian on cribed earlier. Each system is trained four times, each
Chhattisgarhi, we train the system with all 17 Indo time leaving out one of the four selected Indo-European
European languages, except Chhattisgarhi. So, Chhattis languages and compensating it with other Indo-European
garhi news files are removed from the training set of the languages of equal duration and equal male-female pro
earlier section and replaced by news of any other Indo portion.
Table
Table 1. Language
1. Language
family identification
family identification
accuracy of threeaccuracy
test datasets
ofusing
three
MFCCtest
+ SDCdatasets
+ GMM andusing
SFCC +MFCC + SDC + GMM and S
SDC + GMM
MFCC
MFCC + SDC++ GMM
SDC + GMM SFCC + SDC
SFCC + + GMM
SDC + GMM
Test duration M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O.
(sec) 64 128 64 128 64 128 64 128 64 128 64 128
MFCC
MFCC + SDC+
+ SVM
SDC + SVM SFCC + SDC
SFCC + +
SDCSVM
+ SVM
Test duration M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O.
(sec) 64 128 64 128 64 128 64 128 64 128 64 128
@ 1Q0 100
100
□
03s
3 sBIOs
B10sB30
B30 s
100
++ 90
90
tÎ 80
|70
O
o
a
o
E
S560
60 ■Iii
ill
£ 50
50 j
| 4040
ro
30
e 30
'■£
% 20
20 c
a>
-a
10
Model order 64 Model order 128 Model order 64 Model order 128
□ 3s BIOs B30 s 03
□ 3s
s BBIOs
10 s B30
B30 s
03s ■ 10 s B30s □ 33s
s ■
BIOs
10 s B30
B 30 s
s
t
£
1aI
Set A
A Set B
B Set
Set IC
Set A Set B Set C Set A Set B Set C Set A Set B Set C
Test
Test sets—►
sets—*
Test sets —»
Test sets —* Test sets —>
Model order 64 Model order 128 Model order 64 Model order 128
Figure 9 a d. Graphical representation of language family identification accuracy of three sets of test data using the f
Table
Table 3. Identification
3. Identification
accuracy accuracy
of Indo-European
of Indo-European
languages which arelanguages
neighbouringwhich
to Dravidian
are family,
neighbouring
using MFCCto
+ SDC
Dravidian
+ GMM and
family, using MFCC
SFCC + SDC + GMM
MFCC
MFCC +GMM
+ SDC + SFCC +
SDC + GMM SFCC + SDC + SDC
GMM+ GMM
Test duration M.O. M.O. M.O M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O.
(sec) 64 128 64 128 64 128 64 128 64 128 64 128 64 128 64 128
3 42 41 69 72 75 79 32 32 49 41 75 78 78 74 26 28
10 41 37 69 76 90 90 34 39 52 43 78 78 89 87 35 38
30 47.6744 36.0465 63 73 94 95 25 29 48.8372 43.0233 67 69 92 93 23 25
MFCC
MFCC + SDC ++SVM SFCC ++SDC
SDC + SVM SFCC + SDC + SVM
SVM
Test duration M.O. M.O. M.O M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O.
(sec) 64 128 64 128 64 128 64 128 64 128 64 128 64 128 64 128
3 38 28 68 67 81 80 61 64 32 27 62 65 80 77 58 51
10 38 37 72 80 91 88 65 73 39 40 80 84 89 90 55 64
30 40.6977 48.8372 63 72 95 95 75 77 45.3488 45.3488 69 77 95 97 60 64
MFCC
MFCC + SDC +
+GMM SFCC +
SDC + GMM SFCC + SDC + SDC
GMM+ GMM
Test duration M.O. M.O. M.O M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O,
(sec) 64 128 64 128 64 128 64 128 64 128 64 128 64 128 64 128
3 89 90 59 63 94 91 81 79 88 87 63 64 90 95 76 74
10 95 95 74 72 98 99 89 88 94 94 77 75 99 100 79 83
30 99 100 64 64 100 100 93 91 100 100 62 64 100 100 75 71
MFCC
MFCC + SDC ++SVM
SDC + SVM SFCC + SDC
SFCC ++SDC
SVM
+ SVM
Test duration M.O. M.O. M.O M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O.
(sec) 64 128 64 128 64 128 64 128 64 128 64 128 64 128 64 128
3 76 72 79 77 83 90 74 77 76 79 73 70 83 83 73 65
10 90 91 91 90 94 97 89 88 87 91 87 87 98 97 84 82
30 98 95 90 92 98 99 98 94 93 95 84 81 99 99 89 83
110 s 130 s
□ 3s BIOs ■ 30 s (g) 03 s H 10 s »30 s □ 3s BIOs B30 s
/ </
J* ** if #
SBP
Languages
Languages —»—» Languages —»
—* Languages
Languages » Languages•
Languages■
Model order 64 Model order 128 Model order 64 Model order 128
© <2)
□
□ 3s
3s ■BIOs
10 s B
■ 30
30 s □
□ 3s
3s BBIOs
10 s B
■ 30
30 s □ 3s
3s BIOs
B10s B30 s
130s □ 3s
□ 3s BIOs
B10sB30
«30s
s
O"
Figure 10 a d. Graphical representation of identification accuracy of Indo-European languages which are neighbouring to D
ing the four systems.
□ 3s ■ 110s
10 s ■
B30
30 s 03s
03s ■
BIOs
10 s «30
130 s □ 3s 30 s □
□ 3s
3s ■
■ 10
10s
s 130
130ss
100
100 100
90
| 90
90
BO 80
70
_ 80
70
£ 70
60
60 5" 60
ra
50
50 3 50
0
40
40
CO 40
30
30
.§ 30
20
20
S 20
10
0
10
1 10
0
I 0
& ê & s #
M # # #
<$>
Languages
Languages —>
•rj
<toe■"V
^
Languages »•
: Languages
Languages a» Languages
Languages ■
Model order 64 Model order 128 Model
Model order
order
64 64 Model Model order
order 128
128
MFCC SDC + GMM SFCC + SDC + GMM
90
| 90
80 ^ 80
£
70 & 70
60
§ 60
50 3 50
40 ra 40
30 § 30
20
10 10
0 0
#J-
### > > ,#
V<?>
&UjS»<?
-
Languages
Languages — — Languages
Languages *> Languages Languages
Languages *
Model order 64 Model order 128 Model order 64 Model order 128
Figure 11 a-d. Graphical representation of identification accuracy of Indo-European languages which are non-neighb
family, using the four systems.
Table
Table 7.
7. Identification
Identificationaccuracy
accuracy
of of
Dravidian
Dravidian
languages
languages
which
which
are neighbouring
are neighbouring
to Indo-European
to Indo-European
family, family,
using theusing
four
MFCC
MFCC + SDC + GMM
+ SDC + GMM SFCC
SFCC ++SDC
SDC + GMM MFCC
+ GMM + +SDC
MFCC + SVM
SDC + SVM SFCCSFCC
+ SDC + +SVM
+ SDC SVM
Test duration M.O. M.O. M.O M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O.
(sec) 64 128 64 128 64 128 64 128 64 128 64 128 64 128 64 128
3 40 44 42 59 54 51 53 51 77 69 54 58 78 77 49 55
10 43 47 52 56 55 47 54 56 83 76 42 42 88 84 45 41
30 42 50 54 57 64 57 52 52 85 83 42 38 90 88 39 34
□ 3s
□ 3 s ■ 10 s 130 s Bios B30s
□ 3s BIOs
&
5"
2
9
8
CO
c
a>
T)
Kannada
Kannada Telugu
Telugu Kannada Telugu Kannada Telugu Kannada
Kannada Telugu
Telugu
Languages
Languages - Languages
Languages » Languages *
» Languages >
Model order 64 Model order 128 Model order 64 Model order 128
Figure 12 a-d. Graphical representation of identification accuracy of Dravidian languages which are neighbouring to Indo-European family, us
ing the four systems.
Unlike Kerala, Tamil Nadu was ruled mostly by Indian Malayalam, which is in accordance with historical evi
rulers like the Pallavas, the Rashtrakutas, the Cholas and
dence.
the Pandyas. So colonial influence is not expected inTelugu is the only Dravidian language which consis
Tamil. The Tamil results show better accuracy thantently gives low accuracy. This is because Andhra Pradesh
Table
Table8. 8.
Identification
Identification
accuracy
accuracy
of Dravidian
of Dravidian
languages which
languages
are non-neighbouring
which are non-neighbouring
to Indo-European family,
to using
Indo-European
the four systems
family, using th
MFCC
MFCC + SDC + GMM
+ SDC + GMM SFCC
SFCC ++SDC
SDC + GMM MFCC
+ GMM + +SDC
MFCC + SVM
SDC + SVM SFCCSFCC
+ SDC + +SVM
+ SDC SVM
Test duration M.O. M.O. M.O M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O. M.O.
(sec) 64 128 64 128 64 128 64 128 64 128 64 128 64 128 64 128
3 65 65 73 76 58 61 67 73 80 81 77 80 70 79 79 79
10 59 63 79 82 69 60 76 85 77 72 79 82 74 71 81 81
30 66 74 87 95 73 63 80 86 76 76 77 86 76 71 83 83
Malayalam
MalayalamTamil Tamil Malayalam Tamil
Malayalam Tamil Malayalam
MalayalamTamil
Tamil
Languages
Languages
—* —* Languages—*■ Languages *
> Languages
Languages >:
Model order 64 Model order 128 Model order 64 Model order 128
MFCC + SDC + GMM SFCC + SDC + GMM
□ 3s 110 s 130 s
100 □
□ 3s
3s■B10s
10 s
I 90
• 80
>. 70
o
| 60
8 50
(C
c 40
o
130
| 20
-o 10
Malayalam
Malayalam Tamil
Tamil Malayalam
Malayalam Tamil
Tamil Malayalam
Malayalam Tamil
Tamil Malayalam
MalayalamTamil
Tamil
Languages
Languages —— Languages Languages
Languages —*
—> Languages—>
Model
Model order 64
order 64 ModelModel
orderorder128
128 Model order 64 Model order 128
Figure
Figure13
13a-d.
a d.
Graphical
Graphical
representation
representation
of identification
of identification
accuracy accuracy
of Dravidian
of languages
Dravidianwhich
languages
are non-neighbouring
which are non-neighbouring
to Indo-European family,
to Indo-Eur
using
usingthe
thefour systems.
four systems.
princely
was under Mughal rule from 14th to 18th century. During state of Hyderabad. This heralded an era of Pe
this time, Telugu was highly influenced by Urdu. In the
sian/Arabic influence on Telugu30. Figure 14 shows tha
latter half of the 17th century, Mughal rule Telugu
extended has descended from the Proto-South-Central
further south culminating in the establishment of thebranch, whereas the other three Dravidian
Dravidian
CURRENT SCIENCE, VOL. 110, NO. 4, 25 FEBRUARY 2016 679
Conclusion
languages have descended from Proto-South-Dravidian
branch3. This separates out Telugu from the other three
Dravidian languages. This study uses machine learning for automatic recogn
tion of
A look at all the systems indicates that Kannada the major Indian language families. MFCC and
results
show significant improvement when SVM SFCC in conjunction
is used. For with SDC have been used as featur
the rest of the languages, results do not showextraction tools. For modelling of feature vectors, GMM
much varia
tion from system to system. and SVM are used. It is found that all the four system
can identify the language families with high accuracy
We have evaluated the influence of one language famil
Analysis
on the other. We see that in most of the cases the neig
bouring languages are influenced more by the other family
There are certain notable facts in the results. Marathi, in
Also, among the neighbouring languages, some show
spite of being a neighbouring Indo-European language,
higher or lower influence than others. This can be linked
shows high accuracy indicating low Dravidian influence.
to certain known historical facts. The work opens new
This can be possibly due to non-porosity among the
scope of study which will enable us to know our histor
border states. Chhattisgarhi, Odia and Telugu give con better.
sistent low results. A look at the geographical location of
the three states speaking these languages shows that they
1. Ishtiaq, M., Language Shifts Among the Scheduled Tribes in India:
are neighbouring states lying on the eastern region of
A Geographical Study, Motilal Banarsidass Publ., 1999.
India. There should be some specific historical reason 2. CIA World Factbook; https://www.cia.gov.
behind this high intermingling of languages in these re 3. Ethnologue: Languages of the World; http://www.ethnologue.com
gions apart from the ones stated above, which is yet to be 4. Encyclopedia Britannica; http://www.britannica.com
found out. 5. Zissman, M. A., Automatic language identification of telephon
speech. Lincoln Lab. J., 1995, 8(2), 115-144.
Overall accuracy of non-neighbouring Indo-European
6. Torres-Carrasquillo, P. A., Reynolds, D. A. and Deller Jr, J. R.
languages is more than overall accuracy of non Language identification using Gaussian mixture model tokeniz
neighbouring Dravidian languages. One reason for thistion. In International Conference Spoken Language Processing,
could be the geographical extent of the Indo-EuropeanDenver, Colorado, United States, September 2002.
7. Torres-Carrasquillo, P. A., Singer, E., Kohler, M. A., Green
speaking states. For example, Punjabi or Bengali (even
R. J., Reynolds, D. A. and Deller Jr, J. R.. Approaches to languag
Hindi in a few cases) is spoken in states which are far
identification using Gaussian mixture models and shifted delt
away from the Dravidian states. So it is unlikely that cepstral features. International Conference Spoken Languag
Dravidian influence would reach these states. Whereas Processing, Denver, Colorado, United States, 2002.
even the farthest Dravidian-speaking states are geographi
8. Campbell, W. M., Campbell, J. P., Reynolds, D. A., Singer, E. an
cally more closer to the Indo-European-speaking states. Torres-Carrasquillo, P. A., Support vector machines for speaker
and language recognition. Computer Speech and Language, 18. Reynolds, D. A., Quatieri, T. F. and Dunn, R. B., Speaker verifica
Elsevier, 2006, pp. 210-229. tion using adapted Gaussian mixture models. Digital Signal Proc
9. Li, H., Ma, B. and Lee, K. A., Spoken language recognition: ess.,
from2000,10(1-3), 19-41.
fundamentals to practice. Spoken Language Recogn., Proc. IEEE,Bharati; http://newsonair.nic.in
19. Prasar
December 2012. 20. Majumdar, R. C., Raychaudhuri, H. C. and Datta, K., An
10. Davis, S. B. and Mermelstein, P., Comparison of parametric repre
Advanced History of India, Macmillan, 1946.
sentations for monosyllabic word recognition in continuously
21. http://orissa.gov.in
spo
ken sentences. IEEE Trans. Acoust., Speech Signal Process.,
22.1980,
http://chhattisgarh.nic.in
28(4), 357-366. 23. Diffie, B. W. and Winius, G. D., Foundations of the Portuguese
11. Vergin, R., Shaughnessy, D. O. and Farhat, A., Generalized Empire,
mel 1415-1850, University of Minnesota Press, Minnesota
frequency cepstral coefficients for large-vocabulary speaker Archive Editions, 1977.
independent continuous-speech recognition. IEEE Trans. 24. Shastry, B. S. and Borges, C. J., Goa-Kanara Portuguese Rela
Speech
Audio Process., 1999, 7(5), 525-532. tions, 1498-1763, The Xavier Centre of Historical Research,
2000.
12. Paliwal, K., Shannon, B., Lyons, J. and Wojcicki, K., Speech
signal-based frequency warping. IEEE Signal Process. Lett.,
25. 2009,
http://www.goakonkaniakademi.org
16(4), 319-322. 26. http://www.mu.ac.in
27. Ravindran,
13. Matejka, P., Bürget, L., Schwarz, P. and Cernocky, J., Brno Uni P. N.. Black Pepper: Piper Nigrum, CRC Press,
versity of Technology System for NIST 2005 Language Recogni 2004.