Lecture 1
Lecture 1
* Though it would be more efficient, since the above is LaTeX code automatically
generated character by character with Recurrent Neural Networks (RNNs)
[35] Andrej Karpathy. The Unreasonable Effectiveness of Recurrent Neural Networks." (2015).
Course 6.S094: Lex Fridman: Website: January
References: [35] Deep Learning for Self-Driving Cars [email protected] cars.mit.edu 2017
Guest Speakers
Open Question:
Is driving closer to chess or to everyday conversation?
Internal
1. Visible-light camera
2. Infrared camera
3. Audio
Scene Understanding:
Where is everyone else?
Movement Planning:
How do I get from A to B?
Driver State:
Whats the driver up to?
Uber nuTonomy
Turing Test:
Can a computer be mistaken for a
human more than 30% of the time?
4. Go to the next example in the training set and repeat steps 2-4 until the Perceptron
makes no more mistakes
In addition, specifying a reward function for self-driving cars raises ethical questions
Course 6.S094: Lex Fridman: Website: January
References: [63, 64] Deep Learning for Self-Driving Cars [email protected] cars.mit.edu 2017
Deep Learning Breakthroughs: What Changed?
Compute
CPUs, GPUs, ASICs
Organized large(-ish) datasets
Imagenet
Algorithms and research:
Backprop, CNN, LSTM
Software and Infrastructure
Git, ROS, PR2, AWS, Amazon Mechanical Turk, TensorFlow,
Financial backing of large companies
Google, Facebook, Amazon,
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenet classification with deep
convolutional neural networks." Advances in neural information processing systems. 2012.
Course 6.S094: Lex Fridman: Website: January
References: [4] Deep Learning for Self-Driving Cars [email protected] cars.mit.edu 2017
Pause: Progress on ImageNet
Source: Long et al. Fully Convolutional Networks for Semantic Segmentation. CVPR 2015.
Course 6.S094: Lex Fridman: Website: January
Deep Learning for Self-Driving Cars [email protected] cars.mit.edu 2017
Pause: Object Detection
Zhang, Richard, Phillip Isola, and Alexei A. Efros. "Colorful Image Colorization." (2016).
Course 6.S094: Lex Fridman: Website: January
References: [25, 26] Deep Learning for Self-Driving Cars [email protected] cars.mit.edu 2017
Applications: Automatic Translation of Text in Images
Google Translate
Course 6.S094: Lex Fridman: Website: January
References: [30] Deep Learning for Self-Driving Cars [email protected] cars.mit.edu 2017
(Pause) Flavors of Neural Networks
Input:
Output:
Venugopalan et al.
"Sequence to sequence-video to text." 2015.
Code: https://vsubhashini.github.io/s2vt.html
Encoded in the large, highly evolved sensory and motor portions of the human brain is a
billion years of experience about the nature of the world and how to survive in it.
Abstract thought, though, is a new trick, perhaps less than 100 thousand years old. We have
not yet mastered it. It is not all that intrinsically difficult; it just seems so when we do it.
- Hans Moravec, Mind Children (1988)
Challenges:
Snow
Heavy rain
Big open parking lots
Parking garages
Any pedestrian behaving irresponsibly or just unpredictably
Reflections, dynamics blinding ones
Merging into a high-speed stream of oncoming traffic
Course 6.S094: Lex Fridman: Website: January
References: [44] Deep Learning for Self-Driving Cars [email protected] cars.mit.edu 2017
Google Self-Driving Car: Driver Disengagements
Nguyen et al. "Deep neural networks are easily fooled: High confidence predictions for unrecognizable images." 2015.
Course 6.S094: Lex Fridman: Website: January
References: [67] Deep Learning for Self-Driving Cars [email protected] cars.mit.edu 2017
Robustness:
Fooled by a Little Distortion
The library that most frameworks use for doing the actual
computation
Implements primitive neural network functions in CUDA on
the GPU
Keras.js
GPU Support
Full sized networks
Can use trained Keras models
ConvNetJS
Built by a Andrej Karpathy
Good for explaining neural network concepts
Fun to play around with
Very few requirements
Full CNN, RNN, Deep Q Learning
https://goo.gl/9Xhp2t