Developing and Deploying A Machine Learning Scenario For SAP HANA
Developing and Deploying A Machine Learning Scenario For SAP HANA
Las Vegas
September 24–27, 2019
Christoph Morgen
Frank Gottfried
Barcelona
October 8-10, 2019
Christoph Morgen
Frank Gottfried
Bangalore
November 13-15, 2019
Sathish Hariharan
Suriyanarayanan Balamurugan
The information in this presentation is confidential and proprietary to SAP and may not be disclosed without the permission of SAP.
Except for your obligation to protect confidential information, this presentation is not subject to your license agreement or any other service
or subscription agreement with SAP. SAP has no obligation to pursue any course of business outlined in this presentation or any related
document, or to develop or release any functionality mentioned therein.
This presentation, or any related document and SAP's strategy and possible future developments, products and or platforms directions and
functionality are all subject to change and may be changed by SAP at any time for any reason without notice. The information in this
presentation is not a commitment, promise or legal obligation to deliver any material, code or functionality. This presentation is provided
without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a
particular purpose, or non-infringement. This presentation is for informational purposes and may not be incorporated into a contract. SAP
assumes no responsibility for errors or omissions in this presentation, except if such damages were caused by SAP’s intentional or gross
negligence.
All forward-looking statements are subject to various risks and uncertainties that could cause actual results to differ materially from
expectations. Readers are cautioned not to place undue reliance on these forward-looking statements, which speak only as of their dates,
and they should not be relied upon in making purchasing decisions.
Predicting customer behavior like churn, fraud or Forecasting future sales, demand, cost, etc.
buying behavior (classification) based on historic time related data
(time series forecasting)
Predicting car prices, based on model Analyzing shopping baskets to suggest product
characteristics and market trends (regression) placements or additional purchases to a customer
(association analysis)
Enabling marketers to develop targeted marketing Detecting anomalies in financial transactions for
programs by grouping customers (clustering) fraud analysis, or in machine sensor data for
predictive maintenance (outlier detection)
Provide personalized product recommendations In a given social network, you seek to infer which
by analyzing product associations, individual new interactions among its members are likely to
purchase history and external factors occur in the near future (link analysis / prediction)
(recommender system)
Addresses all key scenarios like Classification, Segmented ML model development and prediction
Regression or Time Series Forecasting (and more) – Supported with all PAL algorithms and scenarios
– All major machine learning scenario on structure data – Like segmented time series forecasting (forecast
can be addressed, within the databases segmented by store, product, etc.)
– Algorithms fast and optimized for in-database execution
Automated cross validation, hyper parameter
Over 90+ classic and trending algorithms selection for key algorithm
– Random decision trees and gradient boosting decision – Model development support and automation, higher
trees outperform in most classification and regression productivity and faster results with best possible and
use cases stable models
High-performance parallel mass prediction, Easy to develop and simple to embed within
real-time transactional speed prediction applications
– Multi-node fastest big data predictions as well as – Supports both expert data scientists and developer
real-time transactional <50ms speed prediction personas
– Simple SQL interface and Python and R client APIs
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 11
SAP HANA machine learning – Automated Predictive Library (APL)
Native in-database automated predictive analytics
SAP HANA embeds the Automated Predictive Library* SAP HANA Platform
– Addresses key scenarios like automated Classification,
Regression or Time Series Forecasting (and more) Automated Predictive Library (APL)
* https://blogs.sap.com/2019/04/23/automate-machine-learning-with-apl-now-part-of-sap-hana-sps04/
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 12
SAP HANA client APIs and external machine learning integration
Leverage open source machine learning with SAP HANA
SAP HANA DataFrame Data Scientist using Python Data Scientist using R
• Stores only references to data in
SAP HANA
• No data transferred to the SAP HANA Core ML API
Python Program R Program • Python / R wrapper for
Python/R process except when
explicitly requested DataFrame ML API DataFrame ML API PAL / APL functions
• Hides SQL statements • Typical Python / R ML
hdbcli for Python RODBC (HDBODBC)
• Remote, in-database like interface
computation in SAP HANA • Processes data based
• Useful for data analysis and on dataframes
HANA HANA
exploration
• Used as implicit input structure to
the Core ML API
Key capabilities
• Allows scripting in Python while instructing remote processing of
data in SAP HANA, leveraging HANA In-database embedded ML
capabilities
• The HANA dataframe object as virtual data reference in Python
for data preprocessing, transformation and analysis
• Exploratory data analysis (EDA) visualization capabilities
• Large set of Predictive Analyis Library (PAL) functions for the Python API for HANA ML
expert Data Scientist, allowing simple conversion of Python native hdbcli (HANA client for Python)
SAP HANA
Predictive
Model
© 2019 SAP SE or an SAP affiliate company. All rights reserved. ǀ PUBLIC 20
Exercise scenario
Scenario - train a PAL classification model to predict customer churn
Select Chapter by clicking at it Select Exercise by clicking at it Navigate to next instruction step by clicking at arrow
Access replays Continue the conversation Check out the latest blogs
Keynotes Read and reply to blog posts See all SAP TechEd blog posts
Live interviews Ask questions Learn from peers and experts
Select lecture sessions Join discussions
http://sapteched.com/online sap.com/community SAP TechEd blog posts
www.sap.com/contactsap