OCR For Printed Kannada Text To Machine Editable F
OCR For Printed Kannada Text To Machine Editable F
net/publication/234762329
OCR for printed Kannada text to machine editable format using database
approach
CITATIONS READS
20 10,724
3 authors:
Sagar B M G. Shobha
Bangalore University Rashtreeya Vidyalaya College of Engineering
5 PUBLICATIONS 72 CITATIONS 117 PUBLICATIONS 1,139 CITATIONS
Ramakanth Kumar P.
Rashtreeya Vidyalaya College of Engineering
72 PUBLICATIONS 391 CITATIONS
SEE PROFILE
All content following this page was uploaded by Sagar B M on 25 October 2016.
Abstract: - This paper describes an Optical Character Recognition (OCR) system for printed text
documents in Kannada, a South Indian language. The proposed OCR system for the recognition
of printed Kannada text, which can handle all types of Kannada characters. The system first
extracts image of Kannada scripts, then from the image to line segmentation then segments the
words into sub-character level pieces. For character recognition we have used database approach.
The level of accuracy reached to 100%.
6. Experimental Results
Figure 4.1 shows the input to the system and
once we say recognize we get the output at
the bottom.
Since we are using database approach for
the character recognition we get 100%
accuracy. But the limitation for this
approach is that for each character we need
to have details like Character ASCII value,
Character name, Character BMP image,
Character width, length and total number of
ON pixel in the image. This takes lot of
space as well as lot of computation involved
in recognizing the character. But we get
Output 100% accuracy.
Figure: 4.1 shows interface for viewing 8. Conclusion & future work
and editing documents in Kannada. In this paper, we have presented a database
approach for recognizing Kannada
characters.
Kannada is widely used language in South
India. Lots of applications need Kannada
OCR which can give 100% accuracy. The VTU. His research interests are Pattern
database approach shows the required Recognition. He has guided more than 25
accuracy but with the above said limitation. under graduate projects. He has presented
Using Neural Network, Support Vector and published papers at national conference
Machine recognition work can be carried out / International Conference.
but not with the required accuracy. But we
can make use of dictionary approach to
increase the accuracy.
Reference:
[1] R SANJEEV KUNTE and R D
SUDHAKER SAMUEL "A simple and
efficient optical character recognition
system for basic symbols in printed Kannada Dr. Shobha G., Professor of Computer
text" by Science & Engg. She has been awarded
Ph.D for her thesis titled “Knowledge
[2] "Hidden Markov Models for Online Discovery in Transactional Database
Handwritten Tamil Word Recognition" Systems” from Mangalore University,
Bharath A, Sriganesh Madhvanath, HP Mangalore. She obtained her M.S. degree in
Laboratories India HPL-2007-108, July 6, Software Systems from BITS, Pillani and
2007 BE in Computer Science from Gulbarga
University. Her research interests are Data
[3] T V ASHWIN and P S SASTRY "A Mining, DBMS, and Operating Systems &
font and size-independent OCR system for Networking. She has guided more than 30
printed Kannada documents using support undergraduate and 09 post graduate projects.
vector machines", Department of Electrical
Engineering, Indian Institute of Science,
Bangalore 560 012, India