0% found this document useful (0 votes)

64 views

Application Example: Photo OCR Problem Description and Pipeline

The document discusses photo optical character recognition (OCR) and methods to improve machine learning models for photo OCR tasks. It describes the photo OCR pipeline, which includes text detection, character segmentation, and character classification. It then discusses using artificial data synthesis and introducing distortions to generate additional training data when the amount of real data is limited. Finally, it provides examples of using ceiling analysis to determine which part of the machine learning pipeline should be focused on for improvement.

Uploaded by

PravinkumarGhodake

0% found this document useful (0 votes)

64 views

Application Example: Photo OCR Problem Description and Pipeline

Uploaded by

PravinkumarGhodake

You are on page 1/ 29

Application example:

Photo OCR
Problem description
and pipeline

Machine Learning
The Photo OCR problem

Andrew Ng
Photo OCR pipeline
1. Text detection

2. Character segmentation

3. Character classification
A N T

Andrew Ng
Photo OCR pipeline

Character Character
Image Text detection
segmentation recognition
Application example:
Photo OCR
Sliding windows

Machine Learning
Text detection Pedestrian detection

Andrew Ng
Supervised learning for pedestrian detection
pixels in 82x36 image patches

Positive examples Negative examples

Andrew Ng
Sliding window detection

Andrew Ng
Text detection

Positive examples Negative examples

Andrew Ng
Text detection

[David Wu] Andrew Ng

1D Sliding window for character segmentation

Positive examples Negative examples

Andrew Ng
Photo OCR pipeline
1. Text detection

2. Character segmentation

3. Character classification
A N T

Andrew Ng
Application example:
Photo OCR
Getting lots of
data: Artificial
data synthesis
Machine Learning
Character recognition

A N T

I Q A

Andrew Ng
Artificial data synthesis for photo OCR

Abcdefg
Abcdefg
Abcdefg
Abcdefg
Abcdefg
Real data

[Adam Coates and Tao Wang] Andrew Ng

Artificial data synthesis for photo OCR

Real data Synthetic data

[Adam Coates and Tao Wang] Andrew Ng

Synthesizing data by introducing distortions

[Adam Coates and Tao Wang] Andrew Ng

Synthesizing data by introducing distortions: Speech recognition

Original audio:

Audio on bad cellphone connection

Noisy background: Crowd

Noisy background: Machinery

[www.pdsounds.org] Andrew Ng
Synthesizing data by introducing distortions
Distortion introduced should be representation of the type of
noise/distortions in the test set.
Audio:
Background noise,
bad cellphone connection
Usually does not help to add purely random/meaningless noise
to your data.
intensity (brightness) of pixel
random noise
[Adam Coates and Tao Wang] Andrew Ng
Discussion on getting more data
1. Make sure you have a low bias classifier before expending the
effort. (Plot learning curves). E.g. keep increasing the number
of features/number of hidden units in neural network until
you have a low bias classifier.
2. “How much work would it be to get 10x as much data as we
currently have?”
- Artificial data synthesis
- Collect/label it yourself
- “Crowd source” (E.g. Amazon Mechanical Turk)

Andrew Ng
Discussion on getting more data
1. Make sure you have a low bias classifier before expending the
effort. (Plot learning curves). E.g. keep increasing the number
of features/number of hidden units in neural network until
you have a low bias classifier.
2. “How much work would it be to get 10x as much data as we
currently have?”
- Artificial data synthesis
- Collect/label it yourself
- “Crowd source” (E.g. Amazon Mechanical Turk)

Andrew Ng
Application example:
Photo OCR
Ceiling analysis: What
part of the pipeline to
work on next
Machine Learning
Estimating the errors due to each component (ceiling analysis)

Character Character
Image Text detection
segmentation recognition

What part of the pipeline should you spend the most time
trying to improve?
Component Accuracy
Overall system 72%
Text detection 89%
Character segmentation 90%
Character recognition 100%
Andrew Ng
Another ceiling analysis example
Face recognition from images
(Artificial example)

Camera Preprocess
image (remove background)

Eyes segmentation

Face detection Nose segmentation Logistic regression Label

Mouth
segmentation

Andrew Ng
Another ceiling analysis example
Camera Preprocess
image (remove background)

Eyes segmentation

Logistic regression Label

Face detection Nose segmentation
Component Accuracy
Mouth Overall system 85%
segmentation Preprocess (remove
85.1%
background)
Face detection 91%
Eyes segmentation 95%
Nose segmentation 96%
Mouth segmentation 97%
Logistic regression 100%
Andrew Ng

Aluminum 6063 T5
100% (1)
Aluminum 6063 T5
3 pages
Applica'on Example: Photo OCR Problem Descrip'on and Pipeline
No ratings yet
Applica'on Example: Photo OCR Problem Descrip'on and Pipeline
29 pages
18: Application Example OCR: Problem Description and Pipeline
No ratings yet
18: Application Example OCR: Problem Description and Pipeline
6 pages
01_Problem_Description_and_Pipeline_7_min
No ratings yet
01_Problem_Description_and_Pipeline_7_min
4 pages
Object Detection and Recognition: Final Project Title
No ratings yet
Object Detection and Recognition: Final Project Title
6 pages
c2390573-3bbf-436a-9f9d-053ac5b9d8cd
No ratings yet
c2390573-3bbf-436a-9f9d-053ac5b9d8cd
30 pages
Project Basket
No ratings yet
Project Basket
388 pages
Pyimagesearch Gurus Syllabus PDF
0% (1)
Pyimagesearch Gurus Syllabus PDF
30 pages
ML Unit V
No ratings yet
ML Unit V
46 pages
Optical Character Recognizer: Team Member
No ratings yet
Optical Character Recognizer: Team Member
7 pages
Mini Project
No ratings yet
Mini Project
30 pages
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
No ratings yet
Theoretical and Practical Analysis On CNN, MTCNN and Caps-Net Base Face Recognition and Detection PDF
35 pages
1.Thesis Book Omar
No ratings yet
1.Thesis Book Omar
55 pages
Bachelor of Technology
No ratings yet
Bachelor of Technology
39 pages
Optical Character Recognition Using Convolutional Neural Network
No ratings yet
Optical Character Recognition Using Convolutional Neural Network
5 pages
Image Recognition Using Neural Networks
No ratings yet
Image Recognition Using Neural Networks
18 pages
02_Sliding_Windows_15_min
No ratings yet
02_Sliding_Windows_15_min
8 pages
Chapter 1: INTRODUCTION: 1.1 Problem Definition
No ratings yet
Chapter 1: INTRODUCTION: 1.1 Problem Definition
26 pages
Intro Ai Group3
No ratings yet
Intro Ai Group3
35 pages
Research Project
No ratings yet
Research Project
5 pages
Final Report On Facial Emotion Detection Using Machine Learning
No ratings yet
Final Report On Facial Emotion Detection Using Machine Learning
12 pages
Project-Human Emotion Detection
No ratings yet
Project-Human Emotion Detection
28 pages
NullClass Report
No ratings yet
NullClass Report
6 pages
TS Project - Submission
No ratings yet
TS Project - Submission
21 pages
AIA 6600 Module 5
No ratings yet
AIA 6600 Module 5
14 pages
Selected Topics in Computer Science CH
No ratings yet
Selected Topics in Computer Science CH
24 pages
Jatin Shinde ANN MINIPROJECT
No ratings yet
Jatin Shinde ANN MINIPROJECT
13 pages
AI Training2024Haile
No ratings yet
AI Training2024Haile
37 pages
Presentation REPORT Nd
No ratings yet
Presentation REPORT Nd
29 pages
Image Classification Using Cnn
No ratings yet
Image Classification Using Cnn
15 pages
W01 PracticalProblemsProjects
No ratings yet
W01 PracticalProblemsProjects
27 pages
Summary
No ratings yet
Summary
36 pages
Image Summarizer: Seeing Through Machine Using Deep Learning Algorithm
No ratings yet
Image Summarizer: Seeing Through Machine Using Deep Learning Algorithm
7 pages
Dl Mini Project
No ratings yet
Dl Mini Project
26 pages
Helmet and Vehicle License Plate Detection System
No ratings yet
Helmet and Vehicle License Plate Detection System
26 pages
Facial Emotion and Object Detection For Visually Impaired Blind Persons IJERTV10IS090108
No ratings yet
Facial Emotion and Object Detection For Visually Impaired Blind Persons IJERTV10IS090108
4 pages
Autonomous Car
No ratings yet
Autonomous Car
12 pages
Convolutional Neural Networks-CNN PDF
No ratings yet
Convolutional Neural Networks-CNN PDF
95 pages
Choosing and Implementing Hugging Face Models _ by Stephanie Kirmer _ Towards Data Science
No ratings yet
Choosing and Implementing Hugging Face Models _ by Stephanie Kirmer _ Towards Data Science
15 pages
Project Report On Emotion Aware Smart Music Recommended System Using CNN
No ratings yet
Project Report On Emotion Aware Smart Music Recommended System Using CNN
11 pages
Final Project Report
No ratings yet
Final Project Report
18 pages
Machine Learning 600 - Chapter 6
No ratings yet
Machine Learning 600 - Chapter 6
26 pages
Satellite Image Segmentation With Convolutional Neural Networks (CNN)
100% (1)
Satellite Image Segmentation With Convolutional Neural Networks (CNN)
4 pages
Presentation 1
No ratings yet
Presentation 1
14 pages
Paper 1
No ratings yet
Paper 1
13 pages
Final Project Report
50% (2)
Final Project Report
27 pages
Deep Learning Hardware
No ratings yet
Deep Learning Hardware
82 pages
Image Edge Detection Based On Fpga: Sree Vidyanikethan Engineering College
No ratings yet
Image Edge Detection Based On Fpga: Sree Vidyanikethan Engineering College
38 pages
IEEE_Conference_Template__2_ (2)
No ratings yet
IEEE_Conference_Template__2_ (2)
4 pages
Report (78,81,93)
No ratings yet
Report (78,81,93)
68 pages
Thesis_Research_Proposal
No ratings yet
Thesis_Research_Proposal
5 pages
Java SB
No ratings yet
Java SB
84 pages
Mathworks - Yann Debray - GPT-4o
No ratings yet
Mathworks - Yann Debray - GPT-4o
17 pages
Theories, Detection Methods, and Opportunities of Fake News Detection
No ratings yet
Theories, Detection Methods, and Opportunities of Fake News Detection
4 pages
Lab05 ML
No ratings yet
Lab05 ML
7 pages
[email protected]
No ratings yet
[email protected]
4 pages
Facial Emotion Detection: 1) Background/ Problem Statement
No ratings yet
Facial Emotion Detection: 1) Background/ Problem Statement
6 pages
1 AI_Introduction and ML
No ratings yet
1 AI_Introduction and ML
32 pages
Emotion Detection-Final
No ratings yet
Emotion Detection-Final
31 pages
OpenCV 3 Blueprints
From Everand
OpenCV 3 Blueprints
Joseph Howse
No ratings yet
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
From Everand
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
Fouad Sabry
No ratings yet
Tutorial 11a PlyFailure
No ratings yet
Tutorial 11a PlyFailure
15 pages
Large Scale Machine Learning
No ratings yet
Large Scale Machine Learning
24 pages
Support Vector Machines Optimization Objective: Machine Learning
No ratings yet
Support Vector Machines Optimization Objective: Machine Learning
31 pages
Docs Slides Lecture15
No ratings yet
Docs Slides Lecture15
37 pages
Tutorial 4
No ratings yet
Tutorial 4
8 pages
Recommender Systems Problem Formulation: Machine Learning
No ratings yet
Recommender Systems Problem Formulation: Machine Learning
22 pages
Tutorial 5
No ratings yet
Tutorial 5
12 pages
Tutorial 9
No ratings yet
Tutorial 9
6 pages
Tutorial 8 - Bolts
No ratings yet
Tutorial 8 - Bolts
5 pages
Fretting Simulation For Crankshaft-Counterweight Contact: A. Mäntylä and C. Lönnqvist
No ratings yet
Fretting Simulation For Crankshaft-Counterweight Contact: A. Mäntylä and C. Lönnqvist
17 pages
Tutorial 2
No ratings yet
Tutorial 2
12 pages
Tutorial 14 - Importing Implicit Into Explicit
No ratings yet
Tutorial 14 - Importing Implicit Into Explicit
6 pages
Tutorial 22 - Frequency Analysis
No ratings yet
Tutorial 22 - Frequency Analysis
8 pages
Aluminum 6063 T831
No ratings yet
Aluminum 6063 T831
2 pages
Aluminum 6063 T835
No ratings yet
Aluminum 6063 T835
2 pages
Aluminum 6063 T1
No ratings yet
Aluminum 6063 T1
3 pages
Aluminum 6063 T4
No ratings yet
Aluminum 6063 T4
2 pages
Sloshing in A Tank Modelled Using SPH As An Example in Abaqus
100% (1)
Sloshing in A Tank Modelled Using SPH As An Example in Abaqus
11 pages
Aluminum 6061 T8
No ratings yet
Aluminum 6061 T8
2 pages
Aluminum 6063 T83
No ratings yet
Aluminum 6063 T83
2 pages
Modeling Damage
No ratings yet
Modeling Damage
15 pages
Aluminum 6061 T91
No ratings yet
Aluminum 6061 T91
2 pages
Tutorial 6
No ratings yet
Tutorial 6
8 pages
Mapping Ai 2021 v2 PDF
No ratings yet
Mapping Ai 2021 v2 PDF
1 page
SRM Valliammai Engineering College (An Autonomous Institution)
No ratings yet
SRM Valliammai Engineering College (An Autonomous Institution)
12 pages
8.01 Machine Learning Basics
No ratings yet
8.01 Machine Learning Basics
6 pages
DS - NLP
No ratings yet
DS - NLP
39 pages
Technology - Wikipedia
No ratings yet
Technology - Wikipedia
24 pages
A Robot Is A Mechanical or Virtual Artificial Agent That Is
No ratings yet
A Robot Is A Mechanical or Virtual Artificial Agent That Is
7 pages
Emerging Technologies for Academic Libraries in the Digital Age 1st Edition Lili Li (Auth.) - The ebook in PDF/DOCX format is available for instant download
100% (1)
Emerging Technologies for Academic Libraries in the Digital Age 1st Edition Lili Li (Auth.) - The ebook in PDF/DOCX format is available for instant download
52 pages
CSE352 MIDSemAssignment2021-22 EvenSem
No ratings yet
CSE352 MIDSemAssignment2021-22 EvenSem
1 page
1.2.5. Machine Learning With Python Lab
No ratings yet
1.2.5. Machine Learning With Python Lab
2 pages
Animal Classification Using Facial Images With Score-Level Fusion
No ratings yet
Animal Classification Using Facial Images With Score-Level Fusion
7 pages
A Small Intro of AI
No ratings yet
A Small Intro of AI
55 pages
DIP3E - Chapter05 - Art - Image Segmentation
No ratings yet
DIP3E - Chapter05 - Art - Image Segmentation
34 pages
PH 401
No ratings yet
PH 401
9 pages
Ass 3
No ratings yet
Ass 3
2 pages
Department of Computer Science and Engineering: Chettinadtech Dept of Cse
No ratings yet
Department of Computer Science and Engineering: Chettinadtech Dept of Cse
8 pages
Navigation of Mobile Robots Based On Deep Reinforcement Learning: Re-Ward Function Optimization and Knowledge Transfer
No ratings yet
Navigation of Mobile Robots Based On Deep Reinforcement Learning: Re-Ward Function Optimization and Knowledge Transfer
12 pages
Kuka Brochure Iiwa
No ratings yet
Kuka Brochure Iiwa
2 pages
Artificial Intelligence Essay
No ratings yet
Artificial Intelligence Essay
3 pages
Artificial Intelligence Reading Comprehension
0% (1)
Artificial Intelligence Reading Comprehension
4 pages
Efficient Net B0
No ratings yet
Efficient Net B0
4 pages
Indoor Autonomous Drones For Inventory Management: Kaushik Gala
No ratings yet
Indoor Autonomous Drones For Inventory Management: Kaushik Gala
26 pages
SN ML 4gi
No ratings yet
SN ML 4gi
2 pages
Data Science Road Map
No ratings yet
Data Science Road Map
1 page
Difference Between ANN, CNN and RNN
100% (1)
Difference Between ANN, CNN and RNN
5 pages
Artificial Intelligence AI Timeline Infographic
No ratings yet
Artificial Intelligence AI Timeline Infographic
1 page
Accenture Total Enterprise Reinvention
No ratings yet
Accenture Total Enterprise Reinvention
62 pages
Presentation On New Technologies ICT
No ratings yet
Presentation On New Technologies ICT
3 pages
Image Processing With Python
No ratings yet
Image Processing With Python
21 pages
Tugas AI Search Algoritm
No ratings yet
Tugas AI Search Algoritm
8 pages
21AI502 Syllbus
No ratings yet
21AI502 Syllbus
5 pages