0% found this document useful (0 votes)

31 views5 pages

Raj Synopsis12

The document proposes developing an OCR model that can accurately extract text from images and scanned documents in multiple languages while preserving document structure. The project aims to improve OCR accuracy, support continuous learning, and extract text from historical documents. Methodologies will include pre-processing, segmentation, character clustering, and training an OCR database to recognize text in new documents using tools like Python, Boto3, and AWS.

Uploaded by

shuklavikas2392002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views5 pages

Raj Synopsis12

Uploaded by

shuklavikas2392002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

SYNOPSIS FOR MAJOR PROJECT

OCR Model

Bachelor of Engineering
In
Computer Science and Engineering

Proposed By

Raj Singh

Registration No. 2012051299

Under the guidance of

Dr. Sarika Chaudhary

Associate Professor & HOD CSE

Department of Computer Science & Engineering

DPG Institute of Technology & Management
Gurugram, Haryana
Session: Jan 2024 – May 2024
AIM AND OBJECTIVE OF THE PROJECT

OCR, or Optical Character Recognition, is a technology that converts different types of

documents, such as scanned paper documents, PDFs, or images captured by a digital camera,
into editable and searchable data. The aim and objectives of an OCR model revolve around
enhancing the capabilities of extracting text information from images or scanned documents.
Here are the key objectives:

1. Text Extraction:

Aim: The primary goal is to accurately extract text from images or scanned documents.

Objectives: Develop algorithms that can identify and recognize characters, words, and
sentences within images with high precision and recall.

2. Accuracy Improvement:

Aim: Enhance the overall accuracy of OCR by minimizing errors in character recognition.

Objectives: Employ advanced machine learning and deep learning techniques to improve the
model's ability to correctly identify characters, even in challenging scenarios like distorted text
or low-quality images.

3. Language Support:

Aim: Support recognition of text in multiple languages.

Objectives: Train the OCR model to recognize and process text in various languages, ensuring a
broader application range and accessibility for users globally.

4. Document Layout Understanding:

Aim: Recognize and preserve the layout structure of documents.

Objectives: Implement features that understand the organization of text in documents, including
headers, footers, paragraphs, tables, and other structural elements, to maintain document
integrity during the OCR process.

5. Continuous Improvement:

Aim: Maintain and enhance OCR performance over time.

Objectives: Establish mechanisms for continuous learning and improvement, incorporating user
feedback and updating the model with new data to adapt to evolving patterns and challenges.
BACKGROUND STUDY
Deep learning solutions have taken the world by storm, and all kinds of organizations like tech
giants, well-grown companies, and startups are now trying to incorporate deep learning (DL)
and machine learning (ML) somehow in their current workflow. One of these important
solutions that have gained quite a popularity over the past few years is the OCR engine.
OCR (Optical Character Recognition) is a technique of reading textual information directly
from digital documents and scanned documents without any human intervention. These
documents could be in any format like PDF, PNG, JPEG, TIFF, etc. There are a lot of
Advantages of using OCR systems, these are:
 It increases productivity as it takes very less time to process (extract information)
the documents.
 It is resource-saving as you just need an OCR program that does the work
and no manual work would be required.
 It eliminates the need for manual data entry.
 Chances of error become less.

Extracting information from digital documents is still easy as they have metadata, that can give
you the text information. But for the scanned copies, you require a different solution as
metadata does not help there. Here comes the need for deep learning that provides solutions for
text information extraction from images.

In this article, you will learn about different lessons for building a deep learning-based OCR
model so that when you are working on any such use case, you may not face the issues that I
have faced during the development and deployment.

What is deep learning-based OCR?

OCR has become very popular nowadays and has been adopted by several industries for faster
text data reading from images. While solutions like contour detection, image classification,
connected component analysis, etc. are used for documents that have comparable text size and
font, ideal lighting conditions, good image quality, etc., such methods are not effective for
irregular, heterogeneous text often called wild text or scene text. This text could be from a car’s
license plate, house number plate, poorly scanned documents (with no predefined conditions),
etc. For this, Deep Learning solutions are used. Using DL for OCR is a three-step process and
these steps are:
METHODOLOGY

Decision function:

In this paper a complete OCR methodology for recognizing historical documents, either printed
or handwritten without any knowledge of the font, is presented. This methodology
consists of three steps: The first two steps refer to creating a database for training using a
set of documents, while the third one refers to recognition of new document images.
First, a pre-processing step that includes image binarization and enhancement takes
place. At a second step a top - down segmentation approach is used in order to detect
text lines, words and characters. A clustering scheme is then adopted in order to group
characters of similar shape. This is a semi-automatic procedure since the user is able to
interact at any time in order to correct possible errors of clustering and assign an ASCII
label. After this step, a database is created in order to be used for recognition. Finally,
in the third step, for every new document image the above segmentation approach takes
place while the recognition is based on the character database that has been produced at
the previous step.

Working:
A scanner reads documents and converts them to binary data. The OCR software analyzes the
scanned image and classifies the light areas as background and the dark areas as text.
flow chart for methodology
TOOLS AND TECHNIQUES TO BE USED

The main technologies used are:

 PYTHON
 BOTO 3
 AWS

PROPOSED WORK

The large Amount of documents, either modern or historical, that we have in our possession
nowadays, due to the expansion of digital libraries, has pointed out the need for reliable and
accurate systems for processing them. Historical documents are of more importance because
they are a significant part of our cultural heritage. During the last decades a lot of research has
been done in the field of Optical Character Recognition (OCR). Numerous commercial
products have been released that convert digitized documents into text files, usually in ASCII
format. Although these products process machine printed documents successfully, when it
comes to handwritten documents the results are not satisfactory enough. Moreover, such
products are unable to process historical documents due to their low quality, lack of standard
alphabets and presence of unknown fonts.To this end, recognition of historical documents is
one of the most challenging tasks in OCR.

In the literature, historical document processing is mainly focused on document retrieval.

Word-spotting techniques for searching and indexing historical documents have been
introduced. In word images are grouped into clusters of similar words by using image matching
to find similarity. Then, by annotating “interesting” clusters, an index that links words to the
locations where they occur can be built automatically. In holistic word recognition approaches
for historical documents are presented based on scalar and profile-based features and on
matching word contours respectively

ANN Miniproject Report
No ratings yet
ANN Miniproject Report
11 pages
A12REVIEW
No ratings yet
A12REVIEW
18 pages
Optical Character Recognition: Presented By: - Vikas Shukla - Raj Singh
No ratings yet
Optical Character Recognition: Presented By: - Vikas Shukla - Raj Singh
11 pages
Mp Final Report
No ratings yet
Mp Final Report
38 pages
Optical Character Recognizer: Team Member
No ratings yet
Optical Character Recognizer: Team Member
7 pages
Bengal College of Engineering and Technology, Durgapur: "Handwritten Text Recognition"
No ratings yet
Bengal College of Engineering and Technology, Durgapur: "Handwritten Text Recognition"
15 pages
Unlocking Text from Images: The Future of OCR Technology
No ratings yet
Unlocking Text from Images: The Future of OCR Technology
4 pages
Optical Character Recognition Technologies and Algorithms: Definitive Reference for Developers and Engineers
From Everand
Optical Character Recognition Technologies and Algorithms: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Optical Character Recognition: Divyanshu Sagar Ahmed Zaid Faizee Vidyut Singhania
No ratings yet
Optical Character Recognition: Divyanshu Sagar Ahmed Zaid Faizee Vidyut Singhania
11 pages
fin_irjmets1684836352
No ratings yet
fin_irjmets1684836352
7 pages
Surrvey Paper On Intelligent Reader For Visually Impaired People
No ratings yet
Surrvey Paper On Intelligent Reader For Visually Impaired People
5 pages
Mini Project-04,52 00
No ratings yet
Mini Project-04,52 00
85 pages
Build Your Own Optical Character Recognition (Ocr) System Using Google'S Tesseract and Opencv
No ratings yet
Build Your Own Optical Character Recognition (Ocr) System Using Google'S Tesseract and Opencv
10 pages
Optical Character Recognition: Selected Topics in Computer Science
No ratings yet
Optical Character Recognition: Selected Topics in Computer Science
7 pages
Raspberry Pi
No ratings yet
Raspberry Pi
21 pages
Optical Character Recognition Algorithms and Systems: Definitive Reference for Developers and Engineers
From Everand
Optical Character Recognition Algorithms and Systems: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Optical Character Recognition: Kaivan Gandhi 60001160012 Rahul Jha 60001160019 Shagun Vasmatkar 60001160061
No ratings yet
Optical Character Recognition: Kaivan Gandhi 60001160012 Rahul Jha 60001160019 Shagun Vasmatkar 60001160061
7 pages
Optical Character Recognition (Ocr) : Karan Panjwani T.E - B, 68 Guided By: Prof. Shalini Wankhade
No ratings yet
Optical Character Recognition (Ocr) : Karan Panjwani T.E - B, 68 Guided By: Prof. Shalini Wankhade
24 pages
Extraction of Information From Handwriting Using Optical Character Recognition and Neural Networks
No ratings yet
Extraction of Information From Handwriting Using Optical Character Recognition and Neural Networks
6 pages
OCR Using Tesseract
100% (2)
OCR Using Tesseract
37 pages
Review On Optical Character Recognition of Devanagari Script Using Neural Network
No ratings yet
Review On Optical Character Recognition of Devanagari Script Using Neural Network
6 pages
Optical Character Recognition: Article
No ratings yet
Optical Character Recognition: Article
5 pages
Ocr With Machine Learning
No ratings yet
Ocr With Machine Learning
6 pages
FFGB
No ratings yet
FFGB
12 pages
OCR PRESENTATION
No ratings yet
OCR PRESENTATION
15 pages
Optical Character Recognition: Article
No ratings yet
Optical Character Recognition: Article
5 pages
Hand Written Character Recognition Using Neural Network: BACHELOR OF ENGINEERING (Computer Engineering)
No ratings yet
Hand Written Character Recognition Using Neural Network: BACHELOR OF ENGINEERING (Computer Engineering)
46 pages
Seminar Report On Optical Character Recognition: Submitted By
No ratings yet
Seminar Report On Optical Character Recognition: Submitted By
27 pages
10 1109@icirca48905 2020 9183326
No ratings yet
10 1109@icirca48905 2020 9183326
6 pages
Optical Character Recognition - OCR Text Recognition
No ratings yet
Optical Character Recognition - OCR Text Recognition
11 pages
Abbas Mustafaoglu
No ratings yet
Abbas Mustafaoglu
21 pages
An Efficient OCR System Based On The Regional Feature Using The ASVM As Classifier
No ratings yet
An Efficient OCR System Based On The Regional Feature Using The ASVM As Classifier
7 pages
Optical Character Recognition: Bangalore Institute of Technology
No ratings yet
Optical Character Recognition: Bangalore Institute of Technology
21 pages
Optical Character Recognition - Report
50% (2)
Optical Character Recognition - Report
33 pages
Your Big Idea
No ratings yet
Your Big Idea
14 pages
Optical Character Recognition Project Report
No ratings yet
Optical Character Recognition Project Report
71 pages
Optical_Character_Recognition_Techniques
No ratings yet
Optical_Character_Recognition_Techniques
6 pages
OCR Presentation
No ratings yet
OCR Presentation
16 pages
Jagruthi Institute of Engineering and Technology: Optical Character Recognition
No ratings yet
Jagruthi Institute of Engineering and Technology: Optical Character Recognition
28 pages
Project Report On OCR Scanner
No ratings yet
Project Report On OCR Scanner
40 pages
Optical Character Recognition System
No ratings yet
Optical Character Recognition System
41 pages
Untitled Presentation Wonderslide
No ratings yet
Untitled Presentation Wonderslide
5 pages
Optical Character Recognition: Made By: Dhairya Goel-02814803115 Madhwan Sharma-60214803115
No ratings yet
Optical Character Recognition: Made By: Dhairya Goel-02814803115 Madhwan Sharma-60214803115
15 pages
Adarsh Kumar Singh ( (1NH21MC004) )
No ratings yet
Adarsh Kumar Singh ( (1NH21MC004) )
28 pages
SL NO. Name Usn Number Roll No
No ratings yet
SL NO. Name Usn Number Roll No
10 pages
A Survey of Modern Optical Character Rec PDF
No ratings yet
A Survey of Modern Optical Character Rec PDF
37 pages
Design of An OCR System and Its Hardware Implementation
No ratings yet
Design of An OCR System and Its Hardware Implementation
18 pages
Ocr PDF
No ratings yet
Ocr PDF
5 pages
Tesseract OCR Essentials: Definitive Reference for Developers and Engineers
From Everand
Tesseract OCR Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Ocr
No ratings yet
Ocr
3 pages
Confluence 2018 8442875
No ratings yet
Confluence 2018 8442875
4 pages
synopsis sample
No ratings yet
synopsis sample
7 pages
Optical Character Recognition
No ratings yet
Optical Character Recognition
3 pages
OCR (Optimal Character Recogintion)
No ratings yet
OCR (Optimal Character Recogintion)
7 pages
Optical Character Recognition (OCR) System
No ratings yet
Optical Character Recognition (OCR) System
5 pages
Optical Character Recognition: Fundamentals and Applications
From Everand
Optical Character Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Chapter 1
No ratings yet
Chapter 1
4 pages
3 M&a
No ratings yet
3 M&a
24 pages
Synopsis[1]
No ratings yet
Synopsis[1]
17 pages
Assignment 2 MLDS Lab
No ratings yet
Assignment 2 MLDS Lab
3 pages
Turkish Language Zamirler Pronouns
No ratings yet
Turkish Language Zamirler Pronouns
14 pages
BS 7th Semester 2019 PDF
No ratings yet
BS 7th Semester 2019 PDF
99 pages
The Affective Domain
No ratings yet
The Affective Domain
6 pages
Simatic Getting Started PCS7
No ratings yet
Simatic Getting Started PCS7
56 pages
Banter Bubbles - Crypto Market Visualiser With Fun Chats
No ratings yet
Banter Bubbles - Crypto Market Visualiser With Fun Chats
4 pages
Essay UNABOMBER
No ratings yet
Essay UNABOMBER
3 pages
Project Proposal: Digital Logic Design Title: Intelligent Traffic Light Controller
No ratings yet
Project Proposal: Digital Logic Design Title: Intelligent Traffic Light Controller
4 pages
Development of a Methodology for Lessons Learned
No ratings yet
Development of a Methodology for Lessons Learned
197 pages
Visvesvaraya Technological University, Belgavi. Karnataka, India
No ratings yet
Visvesvaraya Technological University, Belgavi. Karnataka, India
6 pages
Ethics Intro To Ethics Assignment
No ratings yet
Ethics Intro To Ethics Assignment
3 pages
Katherine L. Llup GED 321-6223 (3:30 - 4:30pm) February 10, 2014
No ratings yet
Katherine L. Llup GED 321-6223 (3:30 - 4:30pm) February 10, 2014
5 pages
Compiler Construction CS-4207: Instructor Name: Atif Ishaq
No ratings yet
Compiler Construction CS-4207: Instructor Name: Atif Ishaq
19 pages
Bloomberg Businessweek USA May 814 2017
No ratings yet
Bloomberg Businessweek USA May 814 2017
72 pages
Folk Arts and Crafts of Caraga Region
No ratings yet
Folk Arts and Crafts of Caraga Region
2 pages
A Study On The Value and Impact of B2B E-Commerce The Case of
No ratings yet
A Study On The Value and Impact of B2B E-Commerce The Case of
27 pages
Issb - Collected
100% (1)
Issb - Collected
19 pages
Smart Service Application Form
No ratings yet
Smart Service Application Form
3 pages
Exploratory Data Visualization Using Python
No ratings yet
Exploratory Data Visualization Using Python
3 pages
Ebenezer Howard
100% (1)
Ebenezer Howard
16 pages
Top 500 Adjectives
No ratings yet
Top 500 Adjectives
15 pages
ZZ Price List 01-04-2022 NR4.
No ratings yet
ZZ Price List 01-04-2022 NR4.
48 pages
Technical Proposal Writing
No ratings yet
Technical Proposal Writing
16 pages
Rdso - SPN - 200 - 2010 Rev 2.0 - Flashing Tail Lamp
100% (1)
Rdso - SPN - 200 - 2010 Rev 2.0 - Flashing Tail Lamp
21 pages
Two Sample Statistical Inference:: Formulae
No ratings yet
Two Sample Statistical Inference:: Formulae
2 pages
PL SQL Quick Reference
No ratings yet
PL SQL Quick Reference
50 pages
Regulador Fisher 630
No ratings yet
Regulador Fisher 630
16 pages
2090072CS CPQ WB-2025 Digital Logic
No ratings yet
2090072CS CPQ WB-2025 Digital Logic
23 pages
ST Helena Independent 20230616
No ratings yet
ST Helena Independent 20230616
40 pages
Smart Home Security System CSE 326
No ratings yet
Smart Home Security System CSE 326
8 pages
News Article - Group 02 - Sec A
No ratings yet
News Article - Group 02 - Sec A
12 pages

Raj Synopsis12

Uploaded by

Raj Synopsis12

Uploaded by

SYNOPSIS FOR MAJOR PROJECT

Registration No. 2012051299

Under the guidance of

Dr. Sarika Chaudhary

Department of Computer Science & Engineering

OCR, or Optical Character Recognition, is a technology that converts different types of

Aim: Support recognition of text in multiple languages.

4. Document Layout Understanding:

Aim: Recognize and preserve the layout structure of documents.

Aim: Maintain and enhance OCR performance over time.

What is deep learning-based OCR?

The main technologies used are:

In the literature, historical document processing is mainly focused on document retrieval.

You might also like