0% found this document useful (0 votes)

47 views

Optical Character Recognition (OCR) For Printed Devnagari Script UsingArtificial Neural Network

1. The document proposes an optical character recognition (OCR) system for printed Devnagari script using artificial neural networks. 2. The proposed OCR system includes preprocessing steps like noise removal, skew detection and correction. It then segments characters and extracts their features before classifying them using artificial neural networks. 3. The recognition rate of the proposed OCR system for Devnagari script documents was found to be quite high, showing potential for use in banking, corporate, and other sectors.

Uploaded by

Mario Guillèn

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views

Optical Character Recognition (OCR) For Printed Devnagari Script UsingArtificial Neural Network

Uploaded by

Mario Guillèn

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

International Journal of Computer Science & Communication Vol. 1, No. 1, January-June 2010, pp.

91-95

Optical Character Recognition (OCR) for Printed Devnagari Script Using

Artificial Neural Network
Raghuraj Singh1, C. S. Yadav2, Prabhat Verma3, Vibhash Yadav4
1,3
Department of Computer Science & Engineering, Harcourt Butler Technological Institute, Kanpur-208002, India
2
Department of Computer Science and Engineering, Noida Institute of Engineering and Technology, Greater Noida-201306,
India
4
Pranveer Singh Institute of Technology, Kanpur-208016, India
Email: [email protected], [email protected], [email protected], [email protected]

ABSTRACT
There are about 300 million people in India who speak Hindi and write Devnagari script. Research in Optical Character
Recognition (OCR) is popular for its application potential in banks, post offices, defense organizations and library
automation etc. However most of the OCR systems are available for European texts. In this paper, we have proposed
a technique for OCR System for different five fonts and sizes of printed Devnagari script using Artificial Neural
Network. The recognition rate of the proposed OCR system with the image document of Devnagari Script has been
found to be quite high.
Keywords: OCR, Preprocessing, Segmentation, Feature Extraction, Classification, ANN, Skew Detection and Correction

1. INTRODUCTION preprocessing step it is expected to include noise removal,

Optical Character Recognition is a process by which we skew detection & correction. After finding out the feature
convert printed document or scanned page to ASCII of the segmented characters artificial neural network
character that a computer can recognize. The document (ANN) [1], [3] and [4] will be used for classification
image itself can be either machine printed or purpose. Efforts have been made to improve the
handwritten, or the combination of two. Computer performance of character recognition using artificial
system equipped with such an OCR system can improve neural network techniques. The proposed OCR system
the speed of input operation and decrease some possible shall be capable of accepting document images from a
human errors. Recognition of printed characters is itself file or from a scanner directly. Recognized characters can
a challenging problem since there is a variation of the also be displayed and edited.
same character due to change of fonts or introduction of
different types of noises. Difference in font and sizes 2. DESIGN OF OCR
makes recognition task difficult if preprocessing, feature Various approaches used for the design of OCR systems
extraction and recognition are not robust. There may be are discussed below:
noise pixels that are introduced due to scanning of the
image. Besides, same font and size may also have bold Matrix Matching: Matrix Matching converts each
face character as well as normal one. Thus, width of the character into a pattern within a matrix, and then
stroke is also a factor that affects recognition. Therefore, compares the pattern with an index of known characters.
a good character recognition approach must eliminate Its recognition is strongest on monotype and uniform
the noise after reading binary image data, smooth the single column pages.
image for better recognition, extract features efficiently, Fuzzy Logic: Fuzzy logic is a multi-valued logic that
train the system and classify patterns. Till now there is allows intermediate values to be defined between
no complete OCR for printed Devnagari Script which conventional evaluations like yes/no, true/false, black/
gives 100% success rate. white etc. An attempt is made to attribute a more human-
In this paper, we present a scheme to develop like way of logical thinking in the programming of
complete OCR system for different five fonts and sizes computers. Fuzzy logic is used when answers do not have
of Devnagari characters so that we can use this system a distinct true or false value and there is uncertainly
in Banking and Corporate sectors. We have implemented involved.
steps of the OCR system like preprocessing, Feature Extraction: This method defines each
segmentation, feature extraction and classification. In character by the presence or absence of key features,
92 International Journal of Computer Science & Communication (IJCSC)

including height, width, density, loops, lines, stems and inferior to the original, News papers: generally printed
other character traits. Feature extraction is a perfect on low quality paper etc.
approach for OCR of magazines, laser print and high
For such degraded documents, the system
quality images.
recognition accuracy comes down to 80-90%. But if we
Structural Analysis: Structural Analysis identifies want to use the OCR system for Banking and Corporate
characters by examining their sub features- shape of the sector, this accuracy rate is not up-to-mark.
image, sub-vertical and horizontal histograms. Its
Devnagari is most popular script to write Hindi as
character repair capability is great for low quality text
well as Sanskrit, Marathi, Sindhi, and Nepali language
and newsprints.
with minor modifications.
Neural Networks: This strategy simulates the way
the human neural system works. It samples the pixels in
each image and matches them to a known index of
character pixel patterns. The ability to recognize
characters through abstraction is great for faxed
documents and damaged text. Neural networks are ideal
for specific types of problems, such as processing stock
market data or finding trends in graphical patterns.

2.1. Structure of OCR Systems

Diagrammatic representation of the structure of an OCR
system is given in figure 1.

Fig 2: Stages in OCR Design

3. PROPOSED OCR SYSTEM

Following steps have been followed in the design of
proposed OCR system:
• Preprocessing;
• Segmentation;
• Feature Extraction;
• Classification.

3.1. Preprocessing
In the proposed OCR system, text digitization is done
by a flatbed scanner having resolution between 100 and
600 dpi. The digitized images are usually in gray tone,
and for a clear document, a simple histogram based
Fig. 1: Diagrammatic Structure of the OCR System. threshold approach is sufficient for converting them to
two tone images. The histogram of gray values of the
2.2. Stages in Design of OCR Systems pixels shows two prominent peaks, and a middle gray
value located between the peaks is a good choice for
Various stages of OCR system design are given in figure 2. threshold.

2.3. Reasons for Poor Performance of OCR Systems For salt and pepper noise we generally use median
filter. Median filter replaces the value of a pixel by the
Existing OCR systems generally show poor performance median of gray levels in the neighborhood of that pixel
for documents like old books: print and paper quality (the original value of the pixel is included in the
inferior due to aging, Copied Materials: documents like computation of the median), Median filters provide
photocopies or faxed documents, where print quality is excellent noise reduction capabilities, with considering
Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network 93

less blurring than linear smoothing filters of similar size

as shown in figure 3 & 4.

Fig 5: Vowels Modifiers of Devnagari

Fig 3: Image with Salt and Pepper Noise

Fig 6: Basic Characters of Devnagari

Fig 4: Image without Salt and Pepper Noise
As shown in figure 7, in Devnagari script, a text word
Derivative operator enhances edges and other may be partitioned into three zones. The upper zone
discontinuities (noise) and deemphasizes area with denotes the portion above the headline, the middle zone
slowly varying gray level values. covers the portion of basic and compound characters
below the headline, and the lower zone may contain
3.2. Segmentation where some vowel and consonant modifiers can reside.
For a long number of characters (basic as well as
Segmentation is one of the most important phases of OCR
compound) there exists a horizontal line at the upper
system. By applying good segmentation techniques we
part called “shirorekha” or headline in Hindi. The
can increase the performance of OCR. Segmentation
imaginary line separating the middle and lower zone
subdivides an image into its constituent regions or may be called the base line.
objects. Basically in segmentation, we try to extract basic
constituent of the script, which are certainly characters.
This is needed because our classifier recognizes these
characters only.
Segmentation phase is also crucial in contributing
to this error due to touching characters, which the
classifier cannot properly tackle. Even in good quality Fig 7: Partitioning of a Text Word into Zones
documents, some adjacent characters touch each other Line, Word and Character Segmentation: Once the
due to inappropriate scanning resolution. Numbers of text blocks are detected, the OCR system automatically
constituent characters touching each other in Devnagari finds individual text lines, segments the words, and then
and Bangla scripts are shown in table 1. separates the characters accurately.
Table 1 Segmentation of Line: Text lines are detected by
Constituent Characters Touching each other horizontal scanning. For segmentation of line, we scan
Script Touching An Image of Touching Characters of scanned document page horizontally from the top and
Characters Consists find the last row containing all white pixels, before a
black pixel is found. Then we find the first row containing
Two Three Four
entire white pixel just after the end of black pixels. We
Devnagari 11577 11183 394 nil repeated this process on entire page to find out all lines.
96.6% 3.4%
Segmentation of Words: After finding a particular
Bangla 16714 15277 953 484 line we separate individual words. This is done by
91.4% 5.7% 2.9%
vertical scanning.
To tackle the touching characters in Devnagari
Segmentation of Individual Characters: Once we
documents, at first, we attempt to identify the touching get the words we segment it to individual characters.
characters. Next, they are segmented into constituent Before segmenting words to individual characters, we
ones using a fuzzy decision making approach. locate the head line. This is done by finding the rows
Basic characters and vowels modifiers of the having maximum number of black pixels in a word. After
Devnagari are shown in figure 5 & 6. locating head line we remove it i.e. converts it in white
pixels. After removing head line our word is divided into
94 International Journal of Computer Science & Communication (IJCSC)

three horizontal parts known as upper zone, middle zone

and lower zone. Individual characters are separated from
each zone by applying vertical scanning.
Output of segmentation algorithm is shown in figure 8.

Fig 9(a): Output of Vertical Projection of Kha

Fig 8: Output of Segmentation

3.3. Classification
Fig 9(b): Output of Mean Feature Vector of Kha
Classification is performed based on the extracted
features. Here we are using ANN approach.
For initial classification of characters, we consider
three features as follows:
• Mean Distance;
• Histogram of projection based on spatial position
of pixel;
• Histogram of projection based on pixel value.
ANN Approach for Classification: Artificial Neural
Network approach has been used for classification and
recognition. It is a computational model widely used in
situation where the problem is complex and data is
subject to statistical variation. Training and recognition Fig 9(c): Histogram of KA Rowwise
phase of the ANN has been performed using
conventional back propagation algorithm with two 4. RESULTS AND DISCUSSIONS
hidden layers. The architecture of a neural network The experiments have illustrated that the artificial neural
determines how a neural network transfers its input into network concept can be applied successfully to solve the
output. This transfer can be viewed as a computation Devnagari Optical Character Recognition Problem. There
are many factors that affect the performance of OCR
3.4. Feature Extraction system for Devnagari Script. It is concluded that the input
matrix of size 48X57 gives better results than other
Feature extraction is one of the most important steps in choices. The recognition rate of OCR system with the
developing a classification system. This step describes image document of Devnagari Script is quite high as
the various features selected by us for classification of shown in the output.
the selected characters.
However, other kinds of preprocessing and neural
Classification based on the above three features has network models may be tested for a better recognition
been shown in figure 9(a), 9(b) & 9(c). rate in the future research in OCR System. Character
segmentation method which is incorporated in this paper
Optical Character Recognition (OCR) for Printed Devnagari Script Using Artificial Neural Network 95

could be improved to handle large variety of touching Script Using Fuzzy Multi factorial Analysis”, IEEE
characters that occur often in images obtained from Transaction on System, Man and Cybernetics- Part C:
inferior-quality documents. The test set used in this Applications and Reviews, 32, November 2002. Page(s):
experiment is of 77 characters of five different types of 449-459.
fonts. This can be increased for better results. The [4] B. B. Chaudhary and U. Pal, “OCR Error Detection and
toughest phase in the experiment is getting a good set of Correction of an Inflectional Indian Language Script”,
characters for classification. Pattern Recognition 1996, IEEE Proceeding of 13 th
International Conference on 25-29 Aug., 3, 1996 page(s):
245-249.
5. FUTURE SCOPE OF WORK
[5] Nallasamy Mani and Bala Srinivasan, “Application of
Future enhancements that can be done on this paper
Artificial Network Model for Optical Character
include use of a dictionary of words to correct the output
Recognition”, System, Man and Cybernetics, 1997,
[8]. Implementing use of dictionary words may improve “Computational Cybernetics and Simulation”. 1997 IEEE
the performance of OCR system. One can also implement International Conference on 12-15 Oct. 1997 page(s): 2517-
the project for classifying hand-written text. 2520 3.
Segmentation of characters in hand written documents
[6] Veena Bansal and R.M.K. Sinha, “A Complete OCR for
is very complex as compared to printed documents. Multi
Printed Hindi Text in Devnagari Script”, Sixth
factorial Fuzzy System can be used for segmenting the
International Conference on Document Analysis and
characters in hand written documents.
Recognition, IEEE Publication, Seatle USA, 2001. Page(s):
800-804.
REFERENCES
[7] Veena Bansal and R.M.K. Sinha, “A Devnagari OCR and
[1] S. Mori et. al, “Historical Review of OCR Research and A Brief Overview of OCR for Indian Script”, PROC
Development”, Proceeding IEEE, 80, no 7, pp. 1029-1058,
Symposium on Transaction support System (STRANS 2001),
July 1992.
Feb. 15-17, 2001, Kanpur, India.
[2] A. A. Chaudhary, E.A.S. Ahmad, S. Hossain, C. M.
Rahman, “OCR of Bangla Character Using Neural [8] Bansal, V., Sinha, R.M.K., “Partitioning and Searching
Network: A better Approach”, 2nd International Conference Dictionary for Correction of Optically Read Devnagari
on Electrical Engineering (ICEE 2002), khuln, Bangladesh. Character Strings”, Document Analysis and Recognition,
[3] Utpal Garain and Bidyut B. Chaudhary, “Segmentation 1999. ICDAR’99, Proceedings of the Fifth International
of Touching Character in Printed Devnagari and Bangla Conference on 20-22 Sept. 1999 Page(s): 653-656.

LY COMP Syllabus Pattern 2021
No ratings yet
LY COMP Syllabus Pattern 2021
25 pages
Tmi para Fls Motor Flash-Cat
No ratings yet
Tmi para Fls Motor Flash-Cat
28 pages
Project Report OCR
92% (25)
Project Report OCR
50 pages
Ocr & Cbir
No ratings yet
Ocr & Cbir
13 pages
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
No ratings yet
Optical Character Recognition Using MATLAB: Sandeep Tiwari, Shivangi Mishra, Priyank Bhatia, Praveen Km. Yadav
4 pages
IJNRD2304119
No ratings yet
IJNRD2304119
5 pages
Text Color Images
No ratings yet
Text Color Images
6 pages
Automatically Detect and Recognize Text in Natural Images
No ratings yet
Automatically Detect and Recognize Text in Natural Images
19 pages
Optical Character Recognition (OCR) System
No ratings yet
Optical Character Recognition (OCR) System
5 pages
370 oct ijamte - 1126
No ratings yet
370 oct ijamte - 1126
7 pages
Optical Character Recognizer: Team Member
No ratings yet
Optical Character Recognizer: Team Member
7 pages
Optical_Character_Recognition_Techniques
No ratings yet
Optical_Character_Recognition_Techniques
6 pages
Review Paper On Raspberry Pi Based Ocr With Tts (19104013, 18004037, 18004051)
No ratings yet
Review Paper On Raspberry Pi Based Ocr With Tts (19104013, 18004037, 18004051)
5 pages
Number Plate Recognition Using Ocr Techn
No ratings yet
Number Plate Recognition Using Ocr Techn
5 pages
Optical Character Recognition (OCR) : 17bit110 Soham Modi
No ratings yet
Optical Character Recognition (OCR) : 17bit110 Soham Modi
2 pages
An Efficient OCR System Based On The Regional Feature Using The ASVM As Classifier
No ratings yet
An Efficient OCR System Based On The Regional Feature Using The ASVM As Classifier
7 pages
Offline Handwritten Character Recognition Techniques Using Neural Network A Review
100% (1)
Offline Handwritten Character Recognition Techniques Using Neural Network A Review
8 pages
Optical Character Recognition
No ratings yet
Optical Character Recognition
27 pages
علي عبد حسين - ماستر عام - OCR
No ratings yet
علي عبد حسين - ماستر عام - OCR
12 pages
Breaking The PayPal HIP A Comparison of
No ratings yet
Breaking The PayPal HIP A Comparison of
17 pages
Assignment 2 MLDS Lab
No ratings yet
Assignment 2 MLDS Lab
3 pages
Paper 1
No ratings yet
Paper 1
3 pages
Character Recognition of Devanagari Characters Using Artificial Neural Network
No ratings yet
Character Recognition of Devanagari Characters Using Artificial Neural Network
4 pages
OCR Assignment
No ratings yet
OCR Assignment
5 pages
Design Phase
No ratings yet
Design Phase
10 pages
Optical_character_recognition_system_using_artific
No ratings yet
Optical_character_recognition_system_using_artific
7 pages
Ocr
No ratings yet
Ocr
16 pages
Review On Optical Character Recognition of Devanagari Script Using Neural Network
No ratings yet
Review On Optical Character Recognition of Devanagari Script Using Neural Network
6 pages
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
100% (1)
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
61 pages
Tài liệu về OCR
No ratings yet
Tài liệu về OCR
4 pages
rp 1_merged
No ratings yet
rp 1_merged
104 pages
Optical Character Recognition
No ratings yet
Optical Character Recognition
3 pages
Optical Character Recognition Using Convolutional Neural Network[1][1]
No ratings yet
Optical Character Recognition Using Convolutional Neural Network[1][1]
5 pages
Optical Character Recognition Using Artificial Neural Network
No ratings yet
Optical Character Recognition Using Artificial Neural Network
4 pages
Research Paper On OCR
No ratings yet
Research Paper On OCR
4 pages
Ocr
No ratings yet
Ocr
3 pages
Optical Character Recognition For Devanagari Script
No ratings yet
Optical Character Recognition For Devanagari Script
5 pages
Design of An OCR System and Its Hardware Implementation
No ratings yet
Design of An OCR System and Its Hardware Implementation
18 pages
Optical Character Recognition Project Report
No ratings yet
Optical Character Recognition Project Report
71 pages
OCR Using Image Processing
No ratings yet
OCR Using Image Processing
8 pages
A Deep Neural Network Based Holistic Approach for Optical Character Recognition of Handwritten Documents
No ratings yet
A Deep Neural Network Based Holistic Approach for Optical Character Recognition of Handwritten Documents
9 pages
Soft Computing
No ratings yet
Soft Computing
16 pages
CPP Synopsis
No ratings yet
CPP Synopsis
6 pages
Main PPT2
No ratings yet
Main PPT2
31 pages
Handwritten Script Recognition
No ratings yet
Handwritten Script Recognition
5 pages
Raspberry Pi
No ratings yet
Raspberry Pi
21 pages
Aabin
No ratings yet
Aabin
4 pages
JETIR1804232
No ratings yet
JETIR1804232
3 pages
3
No ratings yet
3
11 pages
Ocr Ann PDF
100% (1)
Ocr Ann PDF
4 pages
10 - Chapter 2
No ratings yet
10 - Chapter 2
37 pages
Rotation, Scale and Font Invariant Character Recognition System Using Neural Networks
No ratings yet
Rotation, Scale and Font Invariant Character Recognition System Using Neural Networks
6 pages
An Analysis of The Performance of Named Entity Recognition Over Ocred Documents
No ratings yet
An Analysis of The Performance of Named Entity Recognition Over Ocred Documents
2 pages
Introduction To Optical Character Recognition (OCR)
No ratings yet
Introduction To Optical Character Recognition (OCR)
29 pages
Multimedia and WS-CS 550-Content Analysis v1
No ratings yet
Multimedia and WS-CS 550-Content Analysis v1
27 pages
Ajay Kumar Garg Engineering College: 27 Delhi-Hapur Bypass Road GHAZIABAD-201001
No ratings yet
Ajay Kumar Garg Engineering College: 27 Delhi-Hapur Bypass Road GHAZIABAD-201001
9 pages
Development of Text Extraction Technique 3acb33e9
No ratings yet
Development of Text Extraction Technique 3acb33e9
8 pages
Optical Character Recognition - Report
50% (2)
Optical Character Recognition - Report
33 pages
Diagonal Based Feature Extraction For Handwritten Alphabets Recognition System Using Neural Network
No ratings yet
Diagonal Based Feature Extraction For Handwritten Alphabets Recognition System Using Neural Network
12 pages
Optical Character Recognition: Fundamentals and Applications
From Everand
Optical Character Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
From Everand
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
Fouad Sabry
No ratings yet
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Assignment OF Introduction To: Cyber Security
No ratings yet
Assignment OF Introduction To: Cyber Security
11 pages
B.SC IT (Cyber Security) - SEM-I-Detailed Syllabus - 0
No ratings yet
B.SC IT (Cyber Security) - SEM-I-Detailed Syllabus - 0
13 pages
ABB_RTU520
No ratings yet
ABB_RTU520
4 pages
Jmol Application Instruction Sheet English
No ratings yet
Jmol Application Instruction Sheet English
2 pages
Beamer Bc Style
No ratings yet
Beamer Bc Style
16 pages
Cracking the YouTube Code With VidIQ AI Tool (Chukwubueze, Israel Joshua) (Z-Library)
100% (1)
Cracking the YouTube Code With VidIQ AI Tool (Chukwubueze, Israel Joshua) (Z-Library)
45 pages
Practical Lab Manual-CSE-492
No ratings yet
Practical Lab Manual-CSE-492
4 pages
Collaborative Ict Development: Empowerment Technologies Empowerment Technologies
100% (3)
Collaborative Ict Development: Empowerment Technologies Empowerment Technologies
26 pages
The Osi Model: Fundamentals of Networking - NET 101
No ratings yet
The Osi Model: Fundamentals of Networking - NET 101
22 pages
DS-2CD3625G0-IZS 2 MP IR Varifocal Bullet Network Camera: Key Features
No ratings yet
DS-2CD3625G0-IZS 2 MP IR Varifocal Bullet Network Camera: Key Features
4 pages
Introduction To Cybercrime and Environmental Laws and Protection
100% (1)
Introduction To Cybercrime and Environmental Laws and Protection
87 pages
C10 - Dynamic Programming
No ratings yet
C10 - Dynamic Programming
43 pages
Lesson 5 Algebraic Expressions
No ratings yet
Lesson 5 Algebraic Expressions
9 pages
Netflix On AWS
No ratings yet
Netflix On AWS
6 pages
GPTK
No ratings yet
GPTK
5 pages
Pakistan Maths - 3
No ratings yet
Pakistan Maths - 3
76 pages
Current Transformer CT Meter
No ratings yet
Current Transformer CT Meter
13 pages
ICON EEIAndiAdriansyahIoTPetFeederwithWeb
No ratings yet
ICON EEIAndiAdriansyahIoTPetFeederwithWeb
7 pages
Object-Oriented Programming (OOP) Lecture No. 2
No ratings yet
Object-Oriented Programming (OOP) Lecture No. 2
21 pages
Beat 112023
No ratings yet
Beat 112023
84 pages
Ria's Portfolio
No ratings yet
Ria's Portfolio
1 page
WATER JUG,DFS,BFS - Jupyter Notebook
No ratings yet
WATER JUG,DFS,BFS - Jupyter Notebook
3 pages
Exno 8 http cookies
No ratings yet
Exno 8 http cookies
5 pages
The Largest Prime Number in The World: January 2021
No ratings yet
The Largest Prime Number in The World: January 2021
5 pages
Delata 1-2 Lado, Mark Prince B.
No ratings yet
Delata 1-2 Lado, Mark Prince B.
8 pages
Ajp Exp 1-24
No ratings yet
Ajp Exp 1-24
257 pages
Mobility-Online: User Guide For Degree Students
No ratings yet
Mobility-Online: User Guide For Degree Students
7 pages
Computer Problems - : Review On Computer Programming
No ratings yet
Computer Problems - : Review On Computer Programming
7 pages

Optical Character Recognition (OCR) For Printed Devnagari Script UsingArtificial Neural Network

Uploaded by

Optical Character Recognition (OCR) For Printed Devnagari Script UsingArtificial Neural Network

Uploaded by

International Journal of Computer Science & Communication Vol. 1, No. 1, January-June 2010, pp.

Optical Character Recognition (OCR) for Printed Devnagari Script Using

1. INTRODUCTION preprocessing step it is expected to include noise removal,

2.1. Structure of OCR Systems

Fig 2: Stages in OCR Design

3. PROPOSED OCR SYSTEM

less blurring than linear smoothing filters of similar size

Fig 5: Vowels Modifiers of Devnagari

Fig 3: Image with Salt and Pepper Noise

Fig 6: Basic Characters of Devnagari

three horizontal parts known as upper zone, middle zone

Fig 9(a): Output of Vertical Projection of Kha

Fig 8: Output of Segmentation

You might also like