Optical Character Recognition (OCR) For Printed Devnagari Script UsingArtificial Neural Network
Optical Character Recognition (OCR) For Printed Devnagari Script UsingArtificial Neural Network
91-95
ABSTRACT
There are about 300 million people in India who speak Hindi and write Devnagari script. Research in Optical Character
Recognition (OCR) is popular for its application potential in banks, post offices, defense organizations and library
automation etc. However most of the OCR systems are available for European texts. In this paper, we have proposed
a technique for OCR System for different five fonts and sizes of printed Devnagari script using Artificial Neural
Network. The recognition rate of the proposed OCR system with the image document of Devnagari Script has been
found to be quite high.
Keywords: OCR, Preprocessing, Segmentation, Feature Extraction, Classification, ANN, Skew Detection and Correction
including height, width, density, loops, lines, stems and inferior to the original, News papers: generally printed
other character traits. Feature extraction is a perfect on low quality paper etc.
approach for OCR of magazines, laser print and high
For such degraded documents, the system
quality images.
recognition accuracy comes down to 80-90%. But if we
Structural Analysis: Structural Analysis identifies want to use the OCR system for Banking and Corporate
characters by examining their sub features- shape of the sector, this accuracy rate is not up-to-mark.
image, sub-vertical and horizontal histograms. Its
Devnagari is most popular script to write Hindi as
character repair capability is great for low quality text
well as Sanskrit, Marathi, Sindhi, and Nepali language
and newsprints.
with minor modifications.
Neural Networks: This strategy simulates the way
the human neural system works. It samples the pixels in
each image and matches them to a known index of
character pixel patterns. The ability to recognize
characters through abstraction is great for faxed
documents and damaged text. Neural networks are ideal
for specific types of problems, such as processing stock
market data or finding trends in graphical patterns.
3.1. Preprocessing
In the proposed OCR system, text digitization is done
by a flatbed scanner having resolution between 100 and
600 dpi. The digitized images are usually in gray tone,
and for a clear document, a simple histogram based
Fig. 1: Diagrammatic Structure of the OCR System. threshold approach is sufficient for converting them to
two tone images. The histogram of gray values of the
2.2. Stages in Design of OCR Systems pixels shows two prominent peaks, and a middle gray
value located between the peaks is a good choice for
Various stages of OCR system design are given in figure 2. threshold.
2.3. Reasons for Poor Performance of OCR Systems For salt and pepper noise we generally use median
filter. Median filter replaces the value of a pixel by the
Existing OCR systems generally show poor performance median of gray levels in the neighborhood of that pixel
for documents like old books: print and paper quality (the original value of the pixel is included in the
inferior due to aging, Copied Materials: documents like computation of the median), Median filters provide
photocopies or faxed documents, where print quality is excellent noise reduction capabilities, with considering
Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network 93
3.3. Classification
Fig 9(b): Output of Mean Feature Vector of Kha
Classification is performed based on the extracted
features. Here we are using ANN approach.
For initial classification of characters, we consider
three features as follows:
• Mean Distance;
• Histogram of projection based on spatial position
of pixel;
• Histogram of projection based on pixel value.
ANN Approach for Classification: Artificial Neural
Network approach has been used for classification and
recognition. It is a computational model widely used in
situation where the problem is complex and data is
subject to statistical variation. Training and recognition Fig 9(c): Histogram of KA Rowwise
phase of the ANN has been performed using
conventional back propagation algorithm with two 4. RESULTS AND DISCUSSIONS
hidden layers. The architecture of a neural network The experiments have illustrated that the artificial neural
determines how a neural network transfers its input into network concept can be applied successfully to solve the
output. This transfer can be viewed as a computation Devnagari Optical Character Recognition Problem. There
are many factors that affect the performance of OCR
3.4. Feature Extraction system for Devnagari Script. It is concluded that the input
matrix of size 48X57 gives better results than other
Feature extraction is one of the most important steps in choices. The recognition rate of OCR system with the
developing a classification system. This step describes image document of Devnagari Script is quite high as
the various features selected by us for classification of shown in the output.
the selected characters.
However, other kinds of preprocessing and neural
Classification based on the above three features has network models may be tested for a better recognition
been shown in figure 9(a), 9(b) & 9(c). rate in the future research in OCR System. Character
segmentation method which is incorporated in this paper
Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network 95
could be improved to handle large variety of touching Script Using Fuzzy Multi factorial Analysis”, IEEE
characters that occur often in images obtained from Transaction on System, Man and Cybernetics- Part C:
inferior-quality documents. The test set used in this Applications and Reviews, 32, November 2002. Page(s):
experiment is of 77 characters of five different types of 449-459.
fonts. This can be increased for better results. The [4] B. B. Chaudhary and U. Pal, “OCR Error Detection and
toughest phase in the experiment is getting a good set of Correction of an Inflectional Indian Language Script”,
characters for classification. Pattern Recognition 1996, IEEE Proceeding of 13 th
International Conference on 25-29 Aug., 3, 1996 page(s):
245-249.
5. FUTURE SCOPE OF WORK
[5] Nallasamy Mani and Bala Srinivasan, “Application of
Future enhancements that can be done on this paper
Artificial Network Model for Optical Character
include use of a dictionary of words to correct the output
Recognition”, System, Man and Cybernetics, 1997,
[8]. Implementing use of dictionary words may improve “Computational Cybernetics and Simulation”. 1997 IEEE
the performance of OCR system. One can also implement International Conference on 12-15 Oct. 1997 page(s): 2517-
the project for classifying hand-written text. 2520 3.
Segmentation of characters in hand written documents
[6] Veena Bansal and R.M.K. Sinha, “A Complete OCR for
is very complex as compared to printed documents. Multi
Printed Hindi Text in Devnagari Script”, Sixth
factorial Fuzzy System can be used for segmenting the
International Conference on Document Analysis and
characters in hand written documents.
Recognition, IEEE Publication, Seatle USA, 2001. Page(s):
800-804.
REFERENCES
[7] Veena Bansal and R.M.K. Sinha, “A Devnagari OCR and
[1] S. Mori et. al, “Historical Review of OCR Research and A Brief Overview of OCR for Indian Script”, PROC
Development”, Proceeding IEEE, 80, no 7, pp. 1029-1058,
Symposium on Transaction support System (STRANS 2001),
July 1992.
Feb. 15-17, 2001, Kanpur, India.
[2] A. A. Chaudhary, E.A.S. Ahmad, S. Hossain, C. M.
Rahman, “OCR of Bangla Character Using Neural [8] Bansal, V., Sinha, R.M.K., “Partitioning and Searching
Network: A better Approach”, 2nd International Conference Dictionary for Correction of Optically Read Devnagari
on Electrical Engineering (ICEE 2002), khuln, Bangladesh. Character Strings”, Document Analysis and Recognition,
[3] Utpal Garain and Bidyut B. Chaudhary, “Segmentation 1999. ICDAR’99, Proceedings of the Fifth International
of Touching Character in Printed Devnagari and Bangla Conference on 20-22 Sept. 1999 Page(s): 653-656.