0% found this document useful (0 votes)

50 views

MP Final Report

This document is a project report on using machine learning techniques to denoise dirty documents. It describes using random forests and neural networks to clean synthetic noisy images from a dataset. Random forests were able to remove stains but not creases very well. A simple neural network was also tested that takes pixels from the noisy image as input and predicts cleaned pixel values one by one. Various image processing techniques like thresholding and filtering were also experimented with to denoise real world images. The results of different methods are compared based on their root-mean-squared-error scores.

Uploaded by

Trần Công

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views

MP Final Report

Uploaded by

Trần Công

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Denoising Dirty Documents

Project Final Report

2017 DS/NC/ESD 863 Machine Perception

Sriveda Reddy Udbhav Vats Simran Dokania

IMT2013047 IMT2013055 IMT2013044
Rishabh Manoj
IMT2013035

{SrivedaReddy.Chevuru, Udbhav.Vats, Simran.Dokania, Rishabh.Manoj

}@iiitb.org
May 15, 2017

Contents

1 Problem Statement 3

2 Motivation 3

3 Dataset 3

4 Methods 3
4.1 Random Forest . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
4.1.1 Challenges Faced . . . . . . . . . . . . . . . . . . . . . . . . . 5
4.2 Neural Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
4.2.1 Challenges Faced . . . . . . . . . . . . . . . . . . . . . . . . . 8

5 Experiments 8
5.1 Fixed Thresholding . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
5.2 Adaptive Thresholding . . . . . . . . . . . . . . . . . . . . . . . . . . 10
5.3 Canny Edge Detection and Morphology . . . . . . . . . . . . . . . . . 12
5.4 Median Filtering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

1
5.5 Random Forest Regression . . . . . . . . . . . . . . . . . . . . . . . . 16
5.6 Artificial Neural Network . . . . . . . . . . . . . . . . . . . . . . . . . 17

6 Conclusion 18

7 Future Work 19

List of Figures
1 Using Random Forest a) . . . . . . . . . . . . . . . . . . . . . . . . . 4
2 Using Random Forest b) . . . . . . . . . . . . . . . . . . . . . . . . . 5
3 Artificial Neural Network [4] . . . . . . . . . . . . . . . . . . . . . . . 6
4 Neural Network a) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
5 Neural Network b) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
6 Fixed Thresholding on Real World Image a) . . . . . . . . . . . . . . 9
7 Fixed Thresholding on Real World Image b) . . . . . . . . . . . . . . 9
8 Fixed Thresholding on Real World Image c) . . . . . . . . . . . . . . 10
9 Adaptive Thresholding on Real World Image a) . . . . . . . . . . . . 10
10 Adaptive Thresholding on Real World Image b) . . . . . . . . . . . . 11
11 Adaptive Thresholding on Real World Image c) . . . . . . . . . . . . 11
12 Canny Edge Detection and Morphology on Real World Image a) . . . 12
13 Canny Edge Detection and Morphology on Real World Image b) . . . 13
14 Canny Edge Detection and Morphology on Real World Image c) . . . 14
15 Median Filtering on Real World Image a) . . . . . . . . . . . . . . . . 15
16 Median Filtering on Real World Image b) . . . . . . . . . . . . . . . 15
17 Median Filtering on Real World Image c) . . . . . . . . . . . . . . . . 16
18 Artificial Neural Network on Real World Image a) . . . . . . . . . . . 17
19 Artificial Neural Network on Real World Image b) . . . . . . . . . . . 17
20 Artificial Neural Network on Real World Image c) . . . . . . . . . . . 18

List of Tables
1 Table with methods and their RMSE scores . . . . . . . . . . . . . . 19

2
1 Problem Statement
Given a dataset of images of scanned text (synthetic images) that are “noisy” with
stains and wrinkles, we propose to clean up the noise and help with the digitization
process.

2 Motivation
Optical Character Recognition (OCR) is the process of getting typed or handwritten
documents into a digitized format. The motivation of converting to a digitized format
is to ensure security, accessibility, edit-ability and ease of searching and sharing. Also,
digital documents don’t get dirty and cannot be ruined by coffee stains. [2]

Unfortunately, a lot of documents eager for digitization are being held back. Cof-
fee stains, faded sun spots, dog-eared pages, and lot of wrinkles are keeping some
printed documents offline and in the past. We were interested in speeding up this
process and hence chose this topic.

3 Dataset
Kaggle provided a data-set which consists of two sets of images - train and test.
These images contain various styles of text, to which synthetic noise has been added
to simulate real-world, messy documents. The dirty images contain stains as well
as creased paper. The training set also includes the cleaned up images of those
found in the test file (train_cleaned) [2]. By clean, we mean black letters on a white
background.
Additionally, a set of real images were procured, which contained stains and
creases. We tested on these images to check if the algorithms developed using simu-
lated data can be applied on the "real-world" messy documents.
Kaggle calculates the score based on the root-mean-squared-error (RMSE) value
between each pixels of the generated output and the actual cleaned image.

4 Methods
In the midterm progress report [5], we tried out some image processing techniques,
some of which worked well in removing the noises while others were not so efficient.

3
Here, we propose methods that involve Machine Learning and Neural Networks as
theorized in [1] and [3].

4.1 Random Forest

Here we propose a purely machine learning technique without any pre-processing
whatsoever. The basic idea is to use a random forest regressor model to predict the
pixel intensity based on neighbouring pixels.

Algorithm:

• Pad out each image by an extra 2 pixels (i.e.) N xN becomes (N + 2)x(N + 2).

• Run a 3×3 sliding window on the image. Please note that every pixel of the
original image will at least become the center of the sliding window once.

• Use all 9 pixels within the sliding window as predictors for the pixel in the
centre of the sliding window (i.e) All the pixels in the sliding window of the
dirty image acts as a feature to predict the centre pixel of the window for the
cleaned pixel.

• Use a Random Forest regressor model to predict the pixel brightness.

(a) Original Image (b) Cleaned Image

(c) Original Image (d) Cleaned Image

Figure 1: Using Random Forest a)

4
(a) Original Image (b) Cleaned Image

Figure 2: Using Random Forest b)

While this method succeeds in removing the stains [2], it does not work very well
with dog-ears and creases [1], in fact random forest just makes it worse. It looks as
if random forest takes the stain and sprinkle it across the entire image so that the
stains are not concentrated in one particular spot but more milder but widespread.
This, as one can see from the cleaned image, is not conducive for reading and thus
will not help us in our goal of converting to a digitized format for future use.

The RMSE score in Kaggle is 0.32492.

4.1.1 Challenges Faced

Fitting the training data to the model was gigantic task. We initially tried partial
fitting but the results obtained were just random noises. The entire data-set had to
be loaded simultaneously to get at least a proper output. Also training the model
took around half an hour as we were unsure how to use GPU for this computation. To
facilitate easier understanding we opted to go with IPython which is a very powerful
interactive python shell. This helped us in saving the trained models and tracking
variables without re-doing the entire thing.

4.2 Neural Network

We create a simple feed-forward neural network that de-noises one pixel at a time.
This neural network has one hidden layer. Each layer contains a weight matrix W
and a bias vector b and computes the function:

act(input ∗ W + b)
where act is typically some sort of sigmoid function.

The activation function of the input layer is the tanh function, while the activa-
tion function for the hidden layer is the clip function of theano which clips the value

5
based on the given minimum and maximum value (i.e)

1
d e f c l i p ( x , minx , maxx ) :
3 i f ( x < min ) :
r e t u r n minx
5 e l i f ( x > max) :
r e t u r n maxx
7 return x

Figure 3: Artificial Neural Network [4]

The hidden layer contains 10 neurons, the no. of neurons for the input is 29
(which is the no. of feature vectors) and output layers has one neuron which is the
pixel brightness.

Before passing the images to the neural network, we first calculate the features of
the image. We consider neighbouring pixels of the center pixel using a 5x5 window

6
as boundary as features. So for each pixel we have a feature vector containing 25
feature points. Also we do some initial image processing on these image and take the
output as features for the neural network. We use median blur with kernel size 5 and
kernel size 25. Using the Sobel operative we calculate the first and second derivative
of the images. For each pixel of the image, we have 4 image processing outputs, the
median blur with kernel size 5,the median blur with kernel size 25,first sobel derivtive
and second derivative. These are then added to the already existing 25 feature points
making the total to 29 feature points for each pixel. The feature vectors are combines
together to create a feature matrix for the image and given to the neural network.

Central Idea:

• Take a pixel from an image

• Calculate feature vector as mentioned above. It contains a total of 29 feature

points.

• This is the input to Neural Network Model.

• Output is the de-noised pixel (i.e) the intensity of the cleaned pixel.

We train the neural network using a naive gradient descent learning algorithm
with the entire data-set.

(a) Original Image (b) Cleaned Image

(c) Original Image (d) Cleaned Image

Figure 4: Neural Network a)

7
(a) Original Image (b) Cleaned Image

Figure 5: Neural Network b)

As you can see from [4] and [5] the creases are pretty much invisible to the eye
while the stains are faded to the point that only faint patches are visible.
The RMSE score in Kaggle is 0.03363.

4.2.1 Challenges Faced

We could not use the entire training data as our RAM was too small for it. We used
only half the training data for this method. Ideally we should have trained this for at
least 100 iterations(epochs) but due to low computational power we trained it only
for 10 iterations(epochs) which took around 20 minutes in a GPU.

5 Experiments
We experimented these methods and methods mentioned in [5] with "real-world" data
(i.e.) actual images of text paper with stains. The results were pretty varied as you
can see below

8
5.1 Fixed Thresholding

(a) Original Image (b) Cleaned Image

Figure 6: Fixed Thresholding on Real World Image a)

(a) Original Image (b) Cleaned Image

Figure 7: Fixed Thresholding on Real World Image b)

9
(a) Original Image (b) Cleaned Image

Figure 8: Fixed Thresholding on Real World Image c)

Fixed Thresholding does not really help us in cleaning stains. As seen in the above fig-
ures, the shadows affect the image and it binarises the image when fixed thresholding
is applied. As for the stains, it completely darkens them making it worse.

5.2 Adaptive Thresholding

(a) Original Image (b) Cleaned Image

Figure 9: Adaptive Thresholding on Real World Image a)

10
(a) Original Image (b) Cleaned Image

Figure 10: Adaptive Thresholding on Real World Image b)

(a) Original Image (b) Cleaned Image

Figure 11: Adaptive Thresholding on Real World Image c)

Adaptive thresholding seems to generate a uniformly noisy image. It neither cleans

the image nor does it improve the readability of the images.

11
5.3 Canny Edge Detection and Morphology

(a) Original Image (b) Cleaned Image using Dilation

(c) Cleaned Image using Erosion

Figure 12: Canny Edge Detection and Morphology on Real World Image a)

12
(a) Original Image (b) Cleaned Image using Dilation

(c) Cleaned Image using Erotion

Figure 13: Canny Edge Detection and Morphology on Real World Image b)

13
(a) Original Image (b) Cleaned Image using Dilation

(c) Cleaned Image using Erosion

Figure 14: Canny Edge Detection and Morphology on Real World Image c)

Canny Edge with morphological operation seems to remove some of the stains but
it either thickens the text or thins them to the point of illegibility. The goal is to
remove the stains and keep the texts as it is.

14
5.4 Median Filtering

(a) Original Image (b) Cleaned Image

Figure 15: Median Filtering on Real World Image a)

(a) Original Image (b) Cleaned Image

Figure 16: Median Filtering on Real World Image b)

15
(a) Original Image (b) Cleaned Image

Figure 17: Median Filtering on Real World Image c)

As seen in the figures above, Median Filtering somewhat removes the coffee stains and
rest of the background noise from the document, leaving little noise here and there.
The contrast of the image is also degraded. This may be due to the subtraction of
the background from the original image.

5.5 Random Forest Regression

Unfortunately the regressor model outputs all zeroes when given real world image as
input

16
5.6 Artificial Neural Network

(a) Original Image (b) Cleaned Image

Figure 18: Artificial Neural Network on Real World Image a)

(a) Original Image (b) Cleaned Image

Figure 19: Artificial Neural Network on Real World Image b)

17
(a) Original Image (b) Cleaned Image

Figure 20: Artificial Neural Network on Real World Image c)

ANN is able to remove coffee stains easily. While there are some small stains in the
image, they do not affect the readability of the paper. In the original image where
stains cover the text, ANN is successful in removing only the stains in most cases. In
others, the stains along with the text is removed but these are far and few in between.

6 Conclusion
Comparing the results of all the methods listed we find that ANN works the best. It
removes the stains & crevices and it is readable!! While the other methods remove
stains, the text is quite hard to decipher as it is blurred or the ink is too thin. The
RMSE values of the test data using our methods and the original image are listed in
the table below:

18
Methods/Score RMSE (%)
Fixed Thresholding 35.173%
Adaptive Thresholding 42.228%
Canny Edge (Dilation) 51.638%
Canny Edge (Erotion) 36.547%
Median Blur 55.096%
Random Forest Regressor 32.492%
Artificial Neural Network 3.363%

Table 1: Table with methods and their RMSE scores

7 Future Work
The images that we were able to clean are images of English texts. We plan on
expanding this to cover texts in other languages, figures, combination of both facts
and figures.
We have a tentative plan to create an android application that can remove stains and
creases using the above mentioned methods.

References
[1] Colin blog. https://colinpriest.com/2015/09/07/
denoising-dirty-documents-part-6/. Accessed: 2017-05-14.

[2] Kaggle - denoising dirty documents. http://tinyurl.com/z4ukatx. Accessed:

2017-04-30.

[3] Kaggle kernel. https://www.kaggle.com/rdokov/nn-starter-kit. Accessed:

2017-05-14.

[4] Glosser.ca. Artificial neural network. https://commons.wikimedia.org/w/

index.php?curid=24913461. Accessed: 2017-05-10.

[5] Dokania S. Reddy V. Manoj R., Vats U. Denoising dirty documents. http:
//tinyurl.com/mbl4p66, 2017.

Specfem3d Manual
No ratings yet
Specfem3d Manual
90 pages
Learning Multiple Layers of Features from Tiny Images. Alex Krizhevsky
No ratings yet
Learning Multiple Layers of Features from Tiny Images. Alex Krizhevsky
60 pages
FULLTEXT01
No ratings yet
FULLTEXT01
52 pages
As Win Sivam Ravi Kumar
No ratings yet
As Win Sivam Ravi Kumar
23 pages
Classification of Textures Using Convolutional
No ratings yet
Classification of Textures Using Convolutional
30 pages
Photo OCR For Nutrition Labels
No ratings yet
Photo OCR For Nutrition Labels
49 pages
Introduction To Numpy Exercise
No ratings yet
Introduction To Numpy Exercise
24 pages
FULLTEXT01
No ratings yet
FULLTEXT01
88 pages
Michael Dorkenwald Eml2018 Report PDF
No ratings yet
Michael Dorkenwald Eml2018 Report PDF
11 pages
Mandar Sat Vil Kar
No ratings yet
Mandar Sat Vil Kar
23 pages
Thesis
No ratings yet
Thesis
47 pages
Oishee Dey NNFL Project Report
No ratings yet
Oishee Dey NNFL Project Report
20 pages
Improving The Accuracy of 2D On - Road Object Detection Based On Deep Learning Techniques
No ratings yet
Improving The Accuracy of 2D On - Road Object Detection Based On Deep Learning Techniques
69 pages
Java DSP Printer
No ratings yet
Java DSP Printer
32 pages
Ray Tracing Maya Hair and Fur: Examensarbete
No ratings yet
Ray Tracing Maya Hair and Fur: Examensarbete
38 pages
FULLTEXT01
No ratings yet
FULLTEXT01
25 pages
Accelerating XD GRASP MRImage Reconstruction
No ratings yet
Accelerating XD GRASP MRImage Reconstruction
77 pages
Internship Report: Meta-Learning Algorithms For Few-Shot Computer Vision
No ratings yet
Internship Report: Meta-Learning Algorithms For Few-Shot Computer Vision
35 pages
Determining Room Occupancy With Machine Learning Techniques: Daniel Myhrman
No ratings yet
Determining Room Occupancy With Machine Learning Techniques: Daniel Myhrman
54 pages
6140
No ratings yet
6140
53 pages
Demosaising Convolutional Neural Networks Memoire
No ratings yet
Demosaising Convolutional Neural Networks Memoire
63 pages
Enabling Real Time Search of Medical Ima
No ratings yet
Enabling Real Time Search of Medical Ima
137 pages
Solar Power Forecasting With Machine Learning Techniques: Emil Isaksson Mikael Karpe Conde
No ratings yet
Solar Power Forecasting With Machine Learning Techniques: Emil Isaksson Mikael Karpe Conde
64 pages
Programmatic Cad 3d Mesh Deep Learning
No ratings yet
Programmatic Cad 3d Mesh Deep Learning
53 pages
Main
No ratings yet
Main
86 pages
CondensedSummaries
No ratings yet
CondensedSummaries
419 pages
Ganimeexplained: Ganime Girl Random Sample
No ratings yet
Ganimeexplained: Ganime Girl Random Sample
5 pages
Mining of Massive Datasets: Jure Leskovec Anand Rajaraman Jeffrey D. Ullman
0% (1)
Mining of Massive Datasets: Jure Leskovec Anand Rajaraman Jeffrey D. Ullman
17 pages
Szakdolgozat Bence MSC
No ratings yet
Szakdolgozat Bence MSC
37 pages
Dimensionality reduction methods review
No ratings yet
Dimensionality reduction methods review
69 pages
Full Text 02
No ratings yet
Full Text 02
62 pages
Just For Fun
No ratings yet
Just For Fun
24 pages
FULLTEXT01
No ratings yet
FULLTEXT01
32 pages
Deep Residual Learning
No ratings yet
Deep Residual Learning
80 pages
A Modelisation Scheme of Uncertainty and Its Application in Motion Detection
No ratings yet
A Modelisation Scheme of Uncertainty and Its Application in Motion Detection
25 pages
Asp Book 1.0.5
No ratings yet
Asp Book 1.0.5
108 pages
Fardapaper-Image-generation-through-feature-extraction-and-learning-Using-a-deep-learning-approach
No ratings yet
Fardapaper-Image-generation-through-feature-extraction-and-learning-Using-a-deep-learning-approach
113 pages
Thesis Philippe Saade
No ratings yet
Thesis Philippe Saade
69 pages
CV2019
No ratings yet
CV2019
152 pages
stationary_objects_detection
No ratings yet
stationary_objects_detection
40 pages
Chemo Spec
No ratings yet
Chemo Spec
36 pages
Leordeanu PHD Thesis
No ratings yet
Leordeanu PHD Thesis
236 pages
Master Inspera
No ratings yet
Master Inspera
45 pages
1905.13750 Sketch2code Generating A Website From A Paper
No ratings yet
1905.13750 Sketch2code Generating A Website From A Paper
64 pages
IOE_Thapathali_Campus_Minor_and_Major_Project_Report_Template__5_-7
No ratings yet
IOE_Thapathali_Campus_Minor_and_Major_Project_Report_Template__5_-7
19 pages
Interactive, Tree-Based Graph Visualization: Andy Pavlo March 17, 2006
No ratings yet
Interactive, Tree-Based Graph Visualization: Andy Pavlo March 17, 2006
72 pages
EE_BSc_Thesis_UWB_SAR_Imaging_Algorithm_Max_Cancrinus
No ratings yet
EE_BSc_Thesis_UWB_SAR_Imaging_Algorithm_Max_Cancrinus
41 pages
BTP Report
No ratings yet
BTP Report
27 pages
Deep Learning for Remote Sensing Images with Open Source Software (Rémi Cresson) (Z-Library)
No ratings yet
Deep Learning for Remote Sensing Images with Open Source Software (Rémi Cresson) (Z-Library)
165 pages
Bryn Lansdown
No ratings yet
Bryn Lansdown
48 pages
Bayesian Variational Recurrent Neural Networks For Prognostics and Health Management of Complex Systems
No ratings yet
Bayesian Variational Recurrent Neural Networks For Prognostics and Health Management of Complex Systems
99 pages
Nguyen Duy
No ratings yet
Nguyen Duy
66 pages
(D. Sundararajan (Auth.) ) Digital Image Processing (B Ok - Xyz) PDF
No ratings yet
(D. Sundararajan (Auth.) ) Digital Image Processing (B Ok - Xyz) PDF
475 pages
Machine Learning The Basics
No ratings yet
Machine Learning The Basics
158 pages
Geometric Deep Learning
No ratings yet
Geometric Deep Learning
50 pages
Course Project Report: Indian Institute of Technology, Kanpur
No ratings yet
Course Project Report: Indian Institute of Technology, Kanpur
15 pages
Klein Berg Book
No ratings yet
Klein Berg Book
459 pages
Open Data Structures: An Introduction
From Everand
Open Data Structures: An Introduction
Pat Morin
4/5 (4)
Handbook of Time Series Analysis: Recent Theoretical Developments and Applications
From Everand
Handbook of Time Series Analysis: Recent Theoretical Developments and Applications
Björn Schelter
No ratings yet
The Satisfiability Problem: Algorithms and Analyses
From Everand
The Satisfiability Problem: Algorithms and Analyses
Uwe Schöning
No ratings yet
Automated Visual Inspection For Bottle Caps Using Fuzzy Logic
No ratings yet
Automated Visual Inspection For Bottle Caps Using Fuzzy Logic
7 pages
Proceedings of The Global Ai Congress 2019 2020
0% (1)
Proceedings of The Global Ai Congress 2019 2020
712 pages
CS401 Computer Graphics PDF
No ratings yet
CS401 Computer Graphics PDF
3 pages
Journal of Computer Science and Informat
No ratings yet
Journal of Computer Science and Informat
192 pages
Object Detection Report
No ratings yet
Object Detection Report
48 pages
Unit3 CV
No ratings yet
Unit3 CV
27 pages
Digital Image Processing - 2 Marks-Questions and Answers
No ratings yet
Digital Image Processing - 2 Marks-Questions and Answers
19 pages
Satish Kumar Kushwaha Dr. Neelesh Jain Shekhar Nigam
No ratings yet
Satish Kumar Kushwaha Dr. Neelesh Jain Shekhar Nigam
8 pages
Digital Image Processing
No ratings yet
Digital Image Processing
5 pages
An Overview of Autonomous Crop Row Navigation Strategies For Unmanned Ground Vehicles
No ratings yet
An Overview of Autonomous Crop Row Navigation Strategies For Unmanned Ground Vehicles
8 pages
RS-Lecture 11-DigitalImageProcessing
No ratings yet
RS-Lecture 11-DigitalImageProcessing
34 pages
Detection of Surface Defects On Ceramic Tiles Base
No ratings yet
Detection of Surface Defects On Ceramic Tiles Base
10 pages
Matlab Report
No ratings yet
Matlab Report
14 pages
basic of computer vision UNIT II
No ratings yet
basic of computer vision UNIT II
29 pages
Amharic Ocr
No ratings yet
Amharic Ocr
62 pages
The Technology of Image Processing Used in Automatic Target-Scoring System
No ratings yet
The Technology of Image Processing Used in Automatic Target-Scoring System
4 pages
IPCV Unit 03
No ratings yet
IPCV Unit 03
9 pages
Drawing Architecture Using Manga Techniques
No ratings yet
Drawing Architecture Using Manga Techniques
10 pages
Study and Comparison of Various Image Ed
No ratings yet
Study and Comparison of Various Image Ed
12 pages
Exam Mid Medical Image Processing 22-23 Solved-1
No ratings yet
Exam Mid Medical Image Processing 22-23 Solved-1
3 pages
A Study of Frei-Chen Approach For Edge Detection: January 2017
No ratings yet
A Study of Frei-Chen Approach For Edge Detection: January 2017
5 pages
Ieee
No ratings yet
Ieee
4 pages
Chapter 9
No ratings yet
Chapter 9
73 pages
Edge Detection
No ratings yet
Edge Detection
25 pages
Facial Features Monitoring For Real Time Drowsiness Detection
No ratings yet
Facial Features Monitoring For Real Time Drowsiness Detection
4 pages
Digital Image Processing Lab
No ratings yet
Digital Image Processing Lab
30 pages
Iris Recognition System Using Statistical Features For Biometric
No ratings yet
Iris Recognition System Using Statistical Features For Biometric
10 pages
Image Processing Project
No ratings yet
Image Processing Project
12 pages
Module 5 Notes
No ratings yet
Module 5 Notes
28 pages
Iris Recognition: Detecting The Pupil
No ratings yet
Iris Recognition: Detecting The Pupil
8 pages

MP Final Report

Uploaded by

MP Final Report

Uploaded by

Denoising Dirty Documents

Project Final Report

Sriveda Reddy Udbhav Vats Simran Dokania

{SrivedaReddy.Chevuru, Udbhav.Vats, Simran.Dokania, Rishabh.Manoj

4.1 Random Forest

• Use a Random Forest regressor model to predict the pixel brightness.

(a) Original Image (b) Cleaned Image

(c) Original Image (d) Cleaned Image

Figure 1: Using Random Forest a)

Figure 2: Using Random Forest b)

The RMSE score in Kaggle is 0.32492.

4.1.1 Challenges Faced

4.2 Neural Network

Figure 3: Artificial Neural Network [4]

• Take a pixel from an image

• Calculate feature vector as mentioned above. It contains a total of 29 feature

• This is the input to Neural Network Model.

(a) Original Image (b) Cleaned Image

(c) Original Image (d) Cleaned Image

Figure 4: Neural Network a)

Figure 5: Neural Network b)

4.2.1 Challenges Faced

(a) Original Image (b) Cleaned Image

Figure 6: Fixed Thresholding on Real World Image a)

(a) Original Image (b) Cleaned Image

Figure 7: Fixed Thresholding on Real World Image b)

Figure 8: Fixed Thresholding on Real World Image c)

5.2 Adaptive Thresholding

(a) Original Image (b) Cleaned Image

Figure 9: Adaptive Thresholding on Real World Image a)

Figure 10: Adaptive Thresholding on Real World Image b)

(a) Original Image (b) Cleaned Image

Figure 11: Adaptive Thresholding on Real World Image c)

Adaptive thresholding seems to generate a uniformly noisy image. It neither cleans

(a) Original Image (b) Cleaned Image using Dilation

(c) Cleaned Image using Erosion

(c) Cleaned Image using Erotion

(c) Cleaned Image using Erosion

(a) Original Image (b) Cleaned Image

Figure 15: Median Filtering on Real World Image a)

(a) Original Image (b) Cleaned Image

Figure 16: Median Filtering on Real World Image b)

Figure 17: Median Filtering on Real World Image c)

5.5 Random Forest Regression

(a) Original Image (b) Cleaned Image

Figure 18: Artificial Neural Network on Real World Image a)

(a) Original Image (b) Cleaned Image

Figure 19: Artificial Neural Network on Real World Image b)

Figure 20: Artificial Neural Network on Real World Image c)

Table 1: Table with methods and their RMSE scores

[2] Kaggle - denoising dirty documents. http://tinyurl.com/z4ukatx. Accessed:

[3] Kaggle kernel. https://www.kaggle.com/rdokov/nn-starter-kit. Accessed:

[4] Glosser.ca. Artificial neural network. https://commons.wikimedia.org/w/

You might also like