Convolutional Neural Networks in Computer Vision: Jochen Lang
Convolutional Neural Networks in Computer Vision: Jochen Lang
Vision
Jochen Lang
Analyze
errors
Data
Analyze
errors
Adapted from A. Géron, Hands-On ML
Jochen Lang, EECS
[email protected]
Classification of Machine Learning
System
• Mitchell’s definition is based on a task T, an experience
E, and a performance measure P. This can help to
classify machine learning approaches.
• Classify by task T in broad terms
– Classification
• Group the input into a set of different categories,
e.g., image classification: decide if the main
image object is a cat, car, house, person etc.
• We try to learn a function
– Our model maps the input into k categories
with the input vector and a
probably distribution over categories
• FlyingThings3D
• https://lmb.informatik.uni-freiburg.de/resources/
datasets/SceneFlowDatasets.en.html
• Useful for various 3D Vision tasks: optical flow, stereo
matching, segmentation
– CG Images of various scenes
– Synthetic dataset with optical flow, stereo and
segmentation ground truth
– ~39000 stereo frames @ 960x540 px
Mayer et al, A Large Dataset to Train Convolutional Networks for Disparity, Optical
Flow, and Scene Flow Estimation [2016]
• Setup:
– Python with jupyter, matplotlib, numpy, pandas,
scipy, scikit-learn, tensorflow
• scikit-learn, according to http://scikit-learn.org/stable/
– Simple and efficient tools for data mining and data
analysis
– Accessible to everybody, and reusable in various
contexts
– Built on NumPy, SciPy, and matplotlib
– Open source, commercially usable - BSD license
• Precision
– Precision is high if the predictions are correct for
positives but if positives (e.g. detections or
classification of 7 in our example) are missed, it does
not influence the precision.
• Recall (sensitivity)
– Recall is high if true predictions are not missed but if
negatives (e.g. detections or classification of not 7 in
our example) are incorrect, it does not influence the
recall.
better
– false positive rate
Selection by chance
• Multiclass Classification
– Different possible approaches
– Can use multiple binary classifier
• One vs. all (OvA), or one vs the rest, e.g., 10
binary classifiers which make a binary decision if
the given image is of a specific number. The
binary classifier with the highest score or
probability wins.
• One vs. one (OvO), i.e., all possible pairs are
( )
formed, e.g., . The class which
wins most binary classifications is chosen.
matrix, e.g.,