Vmware Nvidia Presentation
Vmware Nvidia Presentation
#VMworld #SER3052BU
Disclaimer
• This presentation may contain product features that are currently under development.
• This overview of new technology represents no commitment from VMware to deliver these
features in any generally available product. t i o n
tribu
r dis purchase orders, or
• Features are subject to change, and must not be included in contracts,
o
sales agreements of any kind.
a t i on
c u b li
• Technical feasibility and market demand will affect o p
r final delivery.
o t f
• n
Pricing and packaging for any new technologiest : N or features discussed or presented have not
ont e
been determined.
17 C
2 0
o r ld
VMw
CONFIDENTIAL 2
Aligning To Your Strategic Priorities
i s tr ibu
Modernize Integrate Empower
o d
r Transform
Transform VMware Workspace ONE™
o n
Desktop Mobile Identity
cati
Security Data Public Digital
li
Security
Centers Clouds Workspaces
p u b
Any Application
t f or
Empower
:N o
Traditional Apps Cloud-Native Apps SaaS Apps
Digital
t e n t
Workspaces
C on VMware Cross-Cloud Architecture™
1 7
20
Private Cloud Hybrid Cloud Public Cloud
Integrate
o r ld VMware Cross-Cloud Services™
VMw
Public
Clouds
Any Cloud
VMware vRealize® Cloud Management
Modernize
Data VMware Cloud Foundation™
Centers
Software-Defined Data Center VMware Cloud Provider Partners
3
Universal App Platform
t i o n
i s tr ibu
or d
t ion
bli c a
r p u
o t fo
nt: N
o n te vSphere
1 7 C Integrated
2 0 Containers
w o rld
V M
4
Virtualizing HPC, Big Data and ML Workloads with GPUs
t i o n
i s tr ibu
or d
t ion
bli c a
r p u Application Compatibility
t f o
:N o
t e n t Near Native Performance
C o n
Efficiency017 Efficiency
r l d 2
w o
VM Agility Agility
Resiliency Resiliency
Security Security
CONFIDENTIAL 5
GPU Compute on vSphere with DirectPath IO – Benefits
VM level QoS
t i o n
Workload Isolation
i s tr ibu
or d
t io n
a
VM
ubli c
r p
o t fo
GPU GPU
te
GPU GPU
o n
C Reproducibility
GPU GPU
0 1 7
rld 2
Mw o
vSphere V
vSphere
t i o n
ibu
CUDA CUDA Data Data
Developer Developer Scientist Scientist
i s tr
or d
t io n
a
VM
ubli c
r p
o t fo
GPU GPU
te
GPU GPU
C o n GPU GPU
0 1 7
rld 2
Mw o
vSphere V
vSphere
7
CONFIDENTIAL
GPU Compute on vSphere with DirectPath IO – HPC and Big Data
t i o n
i s tr ibu
or d Researchers
t io n
a
VM
Worker Worker Worker Worker
ubli c Worker
r p
o t fo
GPU GPU
onte
GPU GPU
node
17 C
GPU GPU
d 2 0
orl
vSphere VMw vSphere
8
CONFIDENTIAL
GPU Compute on vSphere with DirectPath IO – Benefits
VM level QoS
t i o n
Workload Isolation
i s tr ibu
or d
t io n
a
VM
ubli c
r p What’s Missing?
o t fo
GPU GPU
o n
C Reproducibility GPU acc VM Resiliency
GPU GPU
0 1 7
rld 2 GPU QoS
Mw o
vSphere V GPU Resource Scheduling
vSphere
10
t i o n
i s tr ibu
or d
t ion
WHY IS SUPPORT FOR COMPUTE bli
r p
cu
a
WORKLOADS
t fo
IMPORTANT
nte n t : N o
C o
0 1 7
l d 2
or
VMw
11
GPU ACCELRATED APPLICATIONS
BROAD RANGE OF INDUSTRIES TRANSFORMED
Computational Structural Medical Imaging Weather and Climate
Visual Computing
Mechanics
t i o n
i s tr ibu
or d
t ion
bli c a
r p u
Electric Design Automation fo
Computational Fluid Dynamics
N o t Computational Finance Numerical Analytics
n t :
o n te
1 7 C
r ld 20
Mw o
V
Computational Chemistry Data Science Defense Machine Learning
13
t i o n
i s tr ibu
or d
t ion
WHAT ARE THE KEY oREQUIREMENTS
bli
r
c
pu TO
a
t f
CONSIDER
nte n t : N o
C o
0 1 7
l d 2
or
VMw
14
VIRTUALIZING MIXED WORKLOADS
Requirements
Performance
t i o n
i s tr ibu
or d
t i
Guaranteed
a on QoS
Virtual Machine Virtual Machine Virtual Machine
u ic
bl
Guest OS Guest OS Guest OS
or p
Apps Apps Apps
o t f
NVIDIA Driver
n t : N
te
NVIDIA Driver NVIDIA Driver
C o n Insight
d 2 017
or l
vGPU VMw vGPU vGPU
Hypervisor NVIDIA vGPU manager Fully Accelerate every Application
CPUs NVIDIA
Server GPU
15
t i o n
i s tr ibu
or d
t ion
bli c a
u
GRID AUGUST
: N o t 2017
fo r p
RELEASE
t e n t
C on
1 7
r ld 20
Mw o
V
16
t i o n
ibu tr
NVIDIA VIRTUAL PASCAL HW SUPPORT
o n o r d i s
END TO END MANAGEMENT
t i
GPU SW - or p u b l i c a
t f
AUGUST 2017 ntent : No
C o
RELEASErld 201 7
Mw o
V
18
GRID PERFORMANCE OPTIMIZED
TESLA M60 TESLA P40
GPUs Dual GM204 Single GP102
r p
o t fo
Max Concurrent Users
n t : N 32 (0.5GB FB) 24 (1GB FB)
ont e 0Q, 1Q, 2Q, 4Q, 8Q 1Q, 2Q, 3Q, 4Q, 6Q, 8Q, 12Q, 24Q
1 C
Profile Options
7 0B, 1B 1B
r ld 20 0A, 1A, 2A, 4A, 8A 1A, 2A, 3A, 4A, 6A, 8A, 12A, 24A
~2x
for
SPECviewperf 12 62 110*
Virtual Data Center
SGEMM TFLOPS 2x 3.8 10.6
Workstations
Memory Bandwidth 2x 160 GB/s 347 GB/s
* estimate
19
STANDARD SCHEDULER
BEST EFFORT SCHEDULING
6 4 2 1
VM 1 Timesliced Round i o n Robin
Scheduler i b u t
is t r
o r d
on
5 7 6 5 4 3 2 1
8 7 Round 8
VM 2
c a t i Tasks generally execute
ubli
Robin
Scheduler o r p within a timeslice
GPU
o t fEngine
nt: N
3
VM 3
n te Best Effort Scheduling
C o
0 1 7
rld 2
Mw o
SHARE OF GPU CYCLES
V
VM3
VM1
VM2
20
SCHEDULING LONG RUNNING TASKS
ROOT-CAUSE FOR QoS ISSUES
1
VM 1 Compute Tasks i o n can be long
running i b u t
is t r
o r d
on
1
8 6 4 2 Round
VM 2
c a t i Round Robin Scheduler fails
ubli
Robin
Scheduler o r p when a single task does not
GPU
o t fEngine complete within a reasonable
nt: N
7 5 3
VM 3
o n te time
C
17
2 0
r ld OF GPU CYCLES
SHARE
o Starves other VMs
V Mw
Injects the “noisy neighbor”
symptom
VM1
21
INTRODUCING : EQUAL SHARE SCHEDULER
GUARENTEE DETERMISINSTIC QoS
1 New Advanced Scheduling
VM 1
ion Share
modeib:utEqual
Equal i s tr
Scheduler
d (Available on
8 6 4 2 Share 8 1 7 6 1 5 4 1 3 2 1 o r
n Pascal HW)
VM 2 a t i o
Round
u b lic
Robin o r p
o t
GPUf Engine Long Running Tasks are pre-
7 5 3 Scheduler
n t : N empted and context saved
VM 3
on t e
7 C to be resumed when
2 0 1 rescheduled
r l d
o SHARE OF GPU
V Mw
Deterministic share of GPU
VM3 VM1
cycles per VM
All running vGPU enabled
VM2 VMs get equal share of GPU
cycles 22
END-TO-END MANAGEMENT
Taking GPU visibility to a new level with application monitoring
t i o n
i s tr ibu
or d
t ion
bli c a
r p u
o t fo End user experience
on t e
1 7 C
r ld 20
Mw o
Guest monitoring V Performance
troubleshooting
Accurate sizing
Host monitoring
New
App monitoring
23
CUDA enabled app
t i o n
- CUDA 9.0 i s tr ibu
or d
a t i on Quadro Virtual
- OCLb2.0
u lic
o r p Data Center
o t f Workstation
t : N - Quadro Value-Add
e n
C ont
1 7 - Vulkan 1.0
r ld 20
Mw o
V - Shader Model 5.0
GRID Virtual
- OGL 4.5 PC
- DX 9, 10, 11, 12
24
NVIDIA GRID GPU VIRTUALIZATION PLATFORM
Industry standard virtualization platform
Quadro virtual
t i o n
vPC Data Center Rendering CUDA Compute
HPC u
Support
ib tr AI
d i s
(Workstation Apps, Rendering, HPC, DL, AI)
r
Workstation
on o
c a t i
u b li
vGPU Monitoring, Insight
o p
r and Management
o t f
n
Data Centert N
: and/or Cloud Accessible
t e
17 Con
2 0 Hypervisor
w o rld
V M NVIDIA Virtualization Software
25
WRAP UP
t i o n
r ibu
dist
- GPU Accelerated Apps is transforming a Broad Range of Industries
o r
t io n
- Virtualizing Mixed Workloads in a multi-tenantbenvironment
li c a requires
r p u
t f o
- Performance : N o
te n t
o n
- Deterministic QoS 17 C
d 2 0
- Insight Mwo
rl
V
- Full Acceleration for every Application
t i o n
i s t r ibu
or d
t io
vGPU
vGPU n
bli c a vGPU
vGPU
r p u
o t fo
nt: N
Suspend
o n te
&Resume
Snapshots vMotion
1 7 C DRS
2 0 See @booths
vSphere w o rld
V M vSphere Cloud
Platform - New
Workloads
Shared Resources NVIDIA GRIDtm
EUC 3D Experience
CPU Mem GPU GPU NVIDIA GRID
The information in this presentation is intended to outline our general product direction and should not be relied on in making a purchasing decision. It 27
is for informational purposes only and may not be incorporated into any contract.
Considered Milestones for VMware vSphere with NVIDIA GRID
Virtual High Performance
Virtual PC Machine Learning
Workstation Computing
t i o n
i s t r ibu
or d
t io n
bli c a
r p u
o t fo
nt: N
Suspend&Resume nte
Snapshots
CoRoadmap
vSphere vMotion vSphere DRS
0 1 7
Tech Preview
M w o
V
See @booths See @booths
The information in this presentation is intended to outline our general product direction and should not be relied on in making a purchasing decision. It 28
is for informational purposes only and may not be incorporated into any contract.
Roadmap
See @booths
VMW EUC 3D
Experience
tion
NVIDIA GRID
t r i b u
o r dis
on
CUDA CUDA Data Data CUDA CUDA Data Data
Developer Developer Scientist Scientist Developer Developer Scientist Scientist
c a t i
VM VM
u b li
or p
vGPU vGPU
o t f vGPU
:N
vGPU vGPU vGPU vGPU vGPU
vGPU
t
vGPU
t e n
C on
1 7
r ld 20
Mw o
V
VMware vSphere NVIDIA GRID VMware vSphere NVIDIA GRID
Remediate
GPU GPU GPU GPU GPU GPU GPU GPU
The information in this presentation is intended to outline our general product direction and should not be relied on in making a purchasing decision. It 29
is for informational purposes only and may not be incorporated into any contract.
Tech Preview
t i o n
vGPU vGPU vGPU vGPU
i s t r ibu
Develop/VDI/
Inference by day or d
Data Data
a t i on
licnight
CUDA vGPU vGPU
CUDA
ML Trainingbby
Developer Scientist Scientist
u
Developer
VM
o r p
N o tf
vGPU vGPU vGPU vGPU
ent :
on t
17 C See @Booths
CUDA CUDA
2
Data
orl
Developer Developer Scientist Scientist vGPU vGPU VMW Cloud Platform-
VMw
VM New Workloads
NVIDIA
VMware vSphere NVIDIA GRID
The information in this presentation is intended to outline our general product direction and should not be relied on in making a purchasing decision. It is 30
for informational purposes only and may not be incorporated into any contract.
Roadmap
t i o n
i s t r ibu
r
oGPUd
GPU
t io n
bli c a
r p u
o t fo
nt: N
Suspend
o n te vSphere DRS
&Resume
Snapshots vMotion
1 7 C DRS
HA Placement
2 0
vSphere w o rld
V M
Shared Resources
CPU Mem GPU GPU
The information in this presentation is intended to outline our general product direction and should not be relied on in making a purchasing decision. It is
CONFIDENTIAL 31
for informational purposes only and may not be incorporated into any contract.
Overview - Considered Milestones for all NVIDIA GPU Enablement
o r ld
See @booths V Mw See @booths
The information in this presentation is intended to outline our general product direction and should not be relied on in making a purchasing decision. It is for 32
informational purposes only and may not be incorporated into any contract.
Introducing vSphere Scale-Out for Big Data and HPC Workloads
New package that provides all the core features required for scale-out workloads at an attractive
price point
t i o n
• Hypervisor, vMotion, vShield Endpoint, i sibu
r
tStorage
o d vMotion,
rI/O Controls & SR-IOV,
Features Storage APIs, Distributed Switch,
t n
ioand more
i c a
Host Profiles / Auto Deploy
bl
p u
t f or
:N o
t e n t
on
Packaging 2017 C • Sold in Packs of 8 CPU at a cost-effective price point
ld o r
VMw
33
Value of vSphere Scale-Out for Big Data and HPC
• NVIDIA GRID and VMware vSphere provide the operational benefits of virtualization with near native
performance (95%) for GPU accelerated HPC, Big Data and Machine Learning
t i o n
ibu i s tr
• VMware's vision is seamless integration of NVIDIA GPU technologies as native o d
r resources of VM
t io n
infrastructure
bli c a
p u
t f or
: N o
• New vSphere Scale-Out SKU; new package twith e n tattractive price point for Big Data/HPC/ML dedicated
C on
17
infrastructure virtualization. http://blogs.vmware.com/vsphere/2017/09/vsphere-scale-now-available.html
2 0
o r ld
VMw
t i o n
i s tr ibu
or d
t ion
bli c a
u r p
Contact us!
t f o
o
t et: Nand challenges.
We’d like to learn about your use cases
n
Co n
2 0 17
o r
Raj Rao, NVIDIA GRID ld Product Management – [email protected]
VMwvSphere Product Management – [email protected]
Ziv Kalmanovich,
Recommended Additional Resources at VMWorld
CONFIDENTIAL 37
t i o n
i s tr ibu
or d
t ion
bli c a
r p u
o t fo
nt: N
o n te
17 C
2 0
w orld
V M