Deepdive On Amazon Sagemaker and Aws Reinvent New Features
Deepdive On Amazon Sagemaker and Aws Reinvent New Features
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Our Mission at AWS
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
The AWS AI/ML Stack
Broadest and most complete set of AI/ML capabilities
AI SERVICES HEALTH AI INDUSTRIAL AI ANOMALY DETECTION CODE AND DEVOPS
VISION SPEECH TEXT SEARCH CHATBOTS PERSONALIZATION FORECASTING FRAUD CONTACT CENTERS
Contact Lens
Amazon Amazon Amazon Amazon Amazon Amazon Amazon Amazon Amazon Amazon Amazon
Rekognition Polly Transcribe Comprehend Translate Textract Kendra Lex Personalize Forecast Fraud Detector Voice ID
+Medical +Medical For Amazon Connect
ML SERVICES
SAGEMAKER STUDIO IDE
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
State of AI / ML
By 2023, spending on AI By the end of 2024, 75% of Driven by advancements in GPUs and ML is now part of mainstream
systems will reach $97.9B, enterprises will shift from compute, availability of data, new DevOps process, not a set of
up 2.5x from $37.5B in 2019 piloting to operationalizing AI algorithms and the cloud specialized, isolated projects
— IDC — Gartner — Gartner
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Machine learning development can be complex and costly
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon SageMaker: Built to make ML more accessible
SageMaker Studio IDE
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon SageMaker Overview
Amazon SageMaker
PREPARE BUILD TRAIN & TUNE DEPLOY & MANAGE
SageMaker Ground Truth SageMaker Studio Notebooks Managed Training Managed Deployment
Label training data for machine learning Jupyter notebooks with elastic compute Distributed infrastructure Fully managed, ultra low latency,
and sharing management high throughput
SageMaker Data Wrangler NEW
Aggregate and prepare data for Built-in and Bring SageMaker Experiments Kubernetes & Kubeflow
machine learning your-own Algorithms Capture, organize, and compare Integration
Dozens of optimized algorithms or bring every step Simplify Kubernetes-based
SageMaker Processing your own machine learning
Built-in Python, BYO R/Spark Automatic
Local Mode Model Tuning Multi-Model Endpoints
SageMaker Feature Store NEW Test and prototype on your local machine Hyperparameter optimization Reduce cost by hosting multiple models
Store, update, retrieve, and share features per instance
SageMaker Autopilot Distributed Training
SageMaker Clarify NEW Automatically create machine learning Libraries NEW SageMaker Model Monitor
Detect bias and understand models with full visibility Training for large datasets Maintain accuracy of deployed models
model predictions and models
SageMaker JumpStart NEW SageMaker Edge Manager NEW
Pre-built solutions for common use cases SageMaker Debugger NEW Manage and monitor models on
Debug and profile training runs edge devices
SageMaker Studio
Integrated development environment (IDE) for ML
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon SageMaker Studio
Fully Integrated Development Environment (IDE)
for machine learning
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon SageMaker Studio
Fully Integrated Development Environment (IDE) for machine learning
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon SageMaker JumpStart
Easily and quickly bring machine learning
applications to market
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
15+ pre-built solutions for common ML use cases
Solutions can be used out-of-the-box or can be customized for a specific
business problem
SageMaker
JumpStart Accelerate time to deploy over 150 open
source models
Use one-click deployable ML models and algorithms from popular
Easily and quickly bring model zoos
machine learning
applications to market Get started with just a few clicks
Easily bring ML applications to market using pre-built solutions, ML models,
and algorithms from popular model zoos, and getting started content
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon SageMaker
Data Wrangler
Fastest way to prepare data
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
80% of time spent on data prep
Other
Refining algorithms
19% 60%
Building training sets
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
SageMaker Data Wrangler is the fastest way to prepare data
Amazon SageMaker Select & Query Cleanse & Enrich Visualize Understand Operationalize
Data Wrangler Select and query Cleanse and explore data, Graphically understand Use a sample dataset Export data preparation
A faster, visual way data from a variety of perform feature data and detect outliers to quickly estimate workflow to a notebook or
to aggregate and prepare data sources such as engineering with built-in with pre-configured model performance, code script to bring the
data for machine learning Amazon Athena, Amazon data transforms, and visualization templates accuracy, and diagnose workflow into production
Redshift, detect statistical bias potential issues
AWS Lake Formation, and with SageMaker Clarify
Amazon S3
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon SageMaker
Feature Store
Store, discover, and share features
for machine learning
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Feature drift
Challenges
of separate Feature duplication
feature stores
Slow model
development/deployment
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon SageMaker Feature Store
Store, discover, and share features for machine learning
Real time
inference
Online
Streaming feature
store
Batch
inference
Offline
Batch feature
Amazon SageMaker store
Feature Store Model
training
Store, discover, and share
features for machine
Raw data Feature processing learning Ingest data Store Serve
Data in its original Transform raw data into Move streaming features Online and offline feature Features for real-time and
form that has not meaningful features for or batch features to a central stores maintaining batch applications, and for
been processed better models repository consistency and accuracy model training
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Machine Learning at Scale
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Machine Learning at Scale
Balance between ML agility and IT governance can be achieved with SageMaker
ML Builders Cloud IT
and Business Stakeholders and DevOps
Innovate with the speed and agility of AWS Govern and enable with central controls
Self-service access Security
Experiment fast Compliance
Respond quickly to change Operations
Spend management
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ML is an Iterative and Collaborative Process
Automation of
ML production model deployment
“recipe” via CI/CD
=
data flow
pipeline
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Security
Security features to help you meet strict security
SageMaker
Compliance
PCI, HIPAA, SOC 1/2/3, FedRAMP, and ISO
9001/27001/27017/27018
is DevOps ready
ML workflows
Create automated workflows in minutes to support
thousands of models
Scalability
Train complex models with massive datasets
Orchestration
Automatic scheduling and execution of jobs with managed infrastructure
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon SageMaker Pipelines
Managed machine learning CI/CD service
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon SageMaker Pipelines
Managed machine learning CI/CD service
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
See pipeline execution details and metrics in real-time
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Approve models for production
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Why customers choose Amazon SageMaker
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank You!
Belton Kwong
Solutions Architect
AWS
© 2021, Amazon Web Services, Inc. or its Affiliates. All rights reserved.