Module 2: Data Center Environment 1
Module 2: Data Center Environment 1
DATA CENTER
ENVIRONMENT
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 1
Module 2: Data Center Environment
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 2
Module 2: Data Center Environment
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 3
Application
• A software program that provides logic for computing operations
• Commonly deployed applications in a data center
Business applications – email, enterprise resource planning (ERP),
decision support system (DSS)
Management applications – resource management, performance
tuning, virtualization
Data protection applications – backup, replication
Security applications – authentication, antivirus
• Key I/O characteristics of an application
Read intensive vs. write intensive
Sequential vs. random
I/O size
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 4
Application Virtualization
Application Virtualization
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 5
Database Management System (DBMS)
• Database is a structured way to store data in logically organized
tables that are interrelated
Helps to optimize the storage and retrieval of data
• DBMS controls the creation, maintenance, and use of databases
Processes an application’s request for data
Instructs the OS to retrieve the appropriate data from storage
• Popular DBMS examples are MySQL, Oracle RDBMS, SQL Server,
etc.
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 6
Host (Compute)
• Resource that runs applications with the
help of underlying computing
components
Example: Servers, mainframes, laptop,
desktops, tablets, server clusters, etc.
• Consists of hardware and software
components
• Hardware components
Include CPU, memory, and input/output
IP
(I/O) devices
• Software components
Include OS, device driver, file system,
volume manager, and so on
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 7
Operating Systems and Device Driver
• In a traditional environment OS resides between the applications
and the hardware
Responsible for controlling the environment
• In a virtualized environment virtualization layer works between
OS and hardware
Virtualization layer controls the environment
OS works as a guest and only controls the application environment
In some implementation OS is modified to communicate with
virtualization layer
• Device driver is a software that enables the OS to recognize the
specific device
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 8
Memory Virtualization
• An OS feature that presents larger memory to Operating System
the application than physically available
Additional memory space comes from disk
storage
Space used on the disk for virtual memory is
called ‘swap space/swap file or page file’ Memory
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 9
Logical Volume Manager (LVM)
• Responsible for creating and
controlling host level logical storage Logical Volume
Physical view of storage is converted
Logical Volume Logical Disk
to a logical view Block
volume group
Volume Group
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 10
LVM Example: Partitioning and Concatenation
Hosts
Logical Volume
Physical Volume
Partitioning Concatenation
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 12
File System
File System Blocks
Users
Files File System
1 2 3
Disk Physical
Disk Sectors Extents LVM Logical Extents
6 5 4
Mapped to Mapped to Mapped to
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 13
Compute Virtualization
Compute Virtualization
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 15
Need for Compute Virtualization
CPU NIC Card Memory Hard Disk CPU NIC Card Memory Hard Disk
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 16
Desktop Virtualization
Desktop Virtualization
thin clients
Improved data security
Desktop VMs
Simplified data backup and PC maintenance
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 17
Module 2: Data Center Environment
Lesson 2: Connectivity
During this lesson the following topics are covered:
• Physical components of connectivity
• Storage connectivity protocols
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 18
Connectivity
• Interconnection between hosts or between a host and
peripheral devices, such as storage
• Physical Components of Connectivity are:
Host interface card, port, and cable
• Protocol = a defined format for communication between sending
and receiving devices
Popular storage interface protocols: IDE/ATA and SCSI
Host
Adapter Cable
Disk
Port
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 19
IDE/ATA and Serial ATA
• Integrated Device Electronics (IDE)/Advanced Technology
Attachment (ATA)
Popular interface used to connect hard disks or CD-ROM drives
Available with varity of standards and names
• Serial Advanced Technology Attachment (SATA)
Serial version of the IDE/ATA specification that has replaced the
parallel ATA
Inexpensive storage interconnect, typically used for internal
connectivity
Provides data transfer rate up to 6 Gb/s (standard 3.0)
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 20
SCSI and SAS
• Parallel Small computer system interface (SCSI)
Popular standard for connecting host and peripheral devices
Commonly used for storage connectivity in servers
Higher cost than IDE/ATA, therefore not popular in PC
environments
Available in wide variety of related technologies and standards
Support up to 16 devices on a single bus
Ultra-640 version provides data transfer speed up to 640 MB/s
• Serial Attached SCSI (SAS)
Point-to-point serial protocol replacing parallel SCSI
Supports data transfer rate up to 6 Gb/s (SAS 2.0)
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 21
Fibre Channel and IP
• Fibre Channel (FC)
Widely used protocol for high speed communication to the storage
device
Provides a serial data transmission that operates over copper wire
and/or optical fiber
Latest version of the FC interface ‘16FC’ allows transmission of
data up to 16 Gb/s
• Internet Protocol (IP)
Traditionally used to transfer host-to-host traffic
Provide opportunity to leverage existing IP based network for
storage communication
Examples: iSCSI and FCIP protocols
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 22
Module 2: Data Center Environment
Lesson 3: Storage
During this lesson the following topics are covered:
• Various storage options
• Disk drive components, addressing, and performance
• Enterprise Flash drives
• Host access to storage and direct-attached storage
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 23
Storage Options
• Magnetic Tape
Low cost solution for long term data storage
Preferred option for backup destination in the past
Limitations
Sequential data access
Single application access at a time
Physical wear and tear
Storage/retrieval overheads
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 24
Storage Options (contd.)
• Optical discs
Popularly used as distribution medium in small, single-user
computing environments
Limited in capacity and speed
Write once and read many (WORM): CD-ROM, DVD-ROM
Other variations: CD-RW, Blu-ray discs
• Disk drive
Most popular storage medium
Large storage capacity
Random read/write access
• Flash drives
Uses semiconductor media
Provide high performance and low power consumption
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 25
Disk Drive Components
Controller
Board
HDA
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 26
Physical Disk Structure
Spindle Sector
Sector
Track
ylinder
Track
Platter
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 28
Logical Block Addressing
Sector 8
Block 0
Head 0 (Upper Surface)
Cylinder 1 Block 32
(Lower Surface)
Block 64
Block 128
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 29
Disk Drive Performance
• Electromechanical device
Impacts the overall performance of the storage system
• Disk service time
Time taken by a disk to complete an I/O request, depends on:
Seek time
Rotational latency
Data transfer rate
Disk service time = seek time + rotational latency + data transfer time
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 30
Seek Time
• Time taken to position the read/write head
• The lower the seek time, the faster the I/O
operation
• Seek time specifications
include
Full stroke
Average Radial
Track-to-track Movement
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 31
Rotational Latency
• The time taken by the platter
to rotate and position the data
under the R/W head
• Depends on the rotation speed
of the spindle
• Average rotational latency
One-half of the time taken for
a full rotation
For ‘X’ rpm, drive latency is
calculated in milliseconds as:
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environnent 32
Data Transfer Rate
• Average amount of data per unit time that the drive can deliver
to the HBA
Internal transfer rate : Speed at which data moves from a platter’s
surface to the internal buffer of the disk
External transfer rate: Rate at which data move through the
interface to the HBA
Head Disk
HBA Interface Buffer Assembly
Disk Drive
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 33
I/O Controller Utilization Vs. Response Time
• Based on fundamental laws of disk drive performance:
100%
0% Utilization 70%
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environnent 34
Storage Design Based on Application Requirements and
Disk Drive Performance
• Disks required to meet an application’s capacity need (DC):
• IOPS serviced by a disk (S) depends upon disk service time (TS):
TS is time taken for an I/O to complete, therefore IOPS serviced by
a disk (S) is equal to (1/TS)
For performance sensitive application (S)=
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environnent 35
Enterprise Flash Drives
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 37
Host Access to Storage
Storage Networking Options
Compute Compute Compute
Application
Application Application
Storage File-level
Network Request
Block-level Storage
Network
Block-level
Request Network Request
File System
Storage
System
Storage Storage Storage
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 38
Direct-Attached Storage (DAS)
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 39
Module 2: Data Center Environment
Concept in Practice
• VMware ESXi
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 40
VMware ESXi
• Industry’s leading hypervisor
Enable virtualization of x86 hardware platforms
• Physical machine that houses ESXi is called ESXi host
ESXi host abstracts physical compute resources to run multiple
VMs concurrently on same physical server
• Two Components
VMKernel
Work similar to OS – responsible for process creation, resource
scheduling, and so on
Virtual machine monitor
Performs binary translation for privileged OS instructions that can
not be virtualized
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 41
Module 2: Summary
Key points covered in this module:
• Key data center elements
• Application and compute virtualization
• Disk drive components and performance
• Enterprise flash drives
• Host access to storage
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 42
Check Your Knowledge – 1
• Which is a benefit of compute virtualization?
A. Enables compute memory swapping
B. Improves compute utilization
C. Isolates compute memory from the applications
D. Isolates compute OS from the applications
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 43
Check Your Knowledge – 2
• What is concatenation?
A. Grouping multiple physical drives into a logical drive
B. Dividing a physical drive into multiple logical drives
C. Process of writing disk metadata on a logical drive
D. Adding more capacity to a physical drive through de-fragmentation
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 44
Check Your Knowledge – 3
• Which is a challenge of DAS environment?
A. Low performance
B. Limited scalability
C. Deployment complexity
D. High initial investment
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 45
Exercise: Design Storage Solution for New Application
• Scenario
Characteristics of new application:
Require 1TB of storage capacity
Peak I/O workload 4900 IOPS
Typical I/O size is 4KB
Specifications of the available disk drives:
15K rpm drive with storage capacity = 100 GB
Average seek time = 5ms
Data transfer rate = 40 MB/sec
As it is business critical application, response time must be within
acceptable range
• Task
Calculate the number of disks required for the application
EMC Proven Professional. Copyright © 2012 EMC Corporation. All Rights Reserved. Module 2: Data Center Environment 46