AIM301 Deep Learning With TensorFlow PyTorch and MXNet on AWS
AIM301 Deep Learning With TensorFlow PyTorch and MXNet on AWS
AIM301
© 2020, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda
Machine learning and deep learning
Summary
Resources
Q&A
Deep learning: ML with deep neural networks
Recommendations Forecasting Image recognition …
Machine
learning Deep neural networks
• Recommendations
Results •
•
Forecasts
Predictions
• Trends and patterns
Challenges with deep learning
Many model architectures – difficult to get started
VGG, ResNet, ResNeXt, DenseNet, SqueezeNet, R-CNN, Faster R-CNN,
SSD, YOLO, Seq2Seq, Transformers, and custom model architecture
Frameworks
Compute
networking
storage
Deep learning on AWS
Amazon SageMaker + deep learning frameworks + infrastructure services =
record-setting performance at low cost
Low cost
SageMaker SDK
Training scripts
…
Two ways to scale deep learning with
Amazon SageMaker
1 2
Bring your own training script Bring your own Docker container
(script mode) (BYOC)
Training scripts
Code files
1 Bring your own training script
AWS Deep
Learning
Code files Containers Amazon ECR
Container
registry
Amazon SageMaker SDK
Amazon S3
Fully managed
SageMaker cluster
2 Bring your own Docker container
Custom container
Amazon S3
Fully managed
SageMaker cluster
Large training datasets: What are my options?
TensorFlow, PyTorch, and MXNet
Moderate and • File mode: Copy entire
Fully managed and optimized 1 large datasets
dataset to local volume
Amazon SageMaker cluster
• Pipe mode: Stream
Amazon S3 dataset from Amazon S3
Scalable shared
2 file system • No downloading or
streaming
• Share file system with
Amazon EFS other services
aws.amazon.com/blogs/machine-learning/deploying-pytorch-models-for-inference-at-scale-using-torchserve/
Apache MXNet on AWS
https://aws.amazon.com/mxnet/
GluonCV
AWS-optimized Apache MXNet
GluonTS
• AWS Deep Learning Containers for training and inference
• AWS Deep Learning AMIs (DLAMI) GluonNLP
https://aws.amazon.com/blogs/opensource/machine-
learning-with-autogluon-an-open-source-automl-library/
Recap: Challenges and solutions
Many model architectures
• TensorFlow, PyTorch, and MXNet offer pretrained models
• Gluon and Keras make it easy to develop custom networks
• Gluon libraries include over 200 pretrained models in CV and NLP
linkedin.com/in/shashankprasanna
medium.com/@shashankprasanna
© 2020, Amazon Web Services, Inc. or its affiliates. All rights reserved.