IBM - IBM Spectrum Scale Data Sheet
IBM - IBM Spectrum Scale Data Sheet
Data Sheet
Part of the IBM Spectrum Storage family of solutions, IBM Spectrum Scale is an enterprise-
grade parallel file system that provides superior resiliency, scalability and control. Based on IBM
General Parallel File System (GPFS), IBM Spectrum Scale delivers scalable capacity and
performance to handle demanding data analytics, content repositories and technical computing
workloads. Storage administrators can combine flash, disk, cloud, and tape storage into a
unified system with higher performance and lower cost than traditional approaches. With
thousands of customers and nearly 20 years of demanding production deployments, IBM
Spectrum Scale is a file system that can adapt to both application performance and capacity
needs across the enterprise. By including IBM Spectrum Scale in their software-defined
infrastructure, organizations can streamline data workflows, help improve service, reduce costs,
manage risk and deliver business results today while positioning the enterprise for future
growth.
IBM Spectrum Scale enables the unification of virtualization, analytics, file and object use cases
into a single scale-out storage solution. It can provide a single namespace for all of this data,
offering a single point of management with an intuitive graphical user interface (GUI). Using
storage policies transparent to end users, data can be compressed or tiered to tape or cloud to
help cut costs; data can also be tiered to high-performance media, including server cache, to
lower latency and improve performance. Intelligent caching of data at remote sites ensures that
data is available with local read/write performance across geographically distributed sites using
Active File Management (AFM).
IBM Spectrum scale is a parallel file system, where the intelligence is in the client and the client
spreads the load across all storage nodes in a cluster, even for individual files. In traditional
scale-out network-attached storage (NAS), one file can only be accessed through one node at a
time by an individual client, limiting performance and scalability. By contrast, the IBM Spectrum
Scale architecture allows it to seamlessly handle tens of thousands of clients, billions of files
and yottabytes of data.
IBM Spectrum Scale allows different applications or services to access the same data without
moving or altering it. Data can be written and retrieved as either files or objects. Rather than use
a copy and change gateway, IBM Spectrum Scale supports both protocols natively for higher
performance and simplified administration. The common storage layer enables most IBM
Spectrum Scale features, including authentication, encryption and tiering, for both object and
file storage.
Systems Hardware
Data Sheet
IBM Spectrum Scale includes integrated management tools and an intuitive GUI to help manage
data at scale. The system can span multiple storage environments and data centers to eliminate
data silos and “filer sprawl.” It can automatically spread data across multiple storage
devices—optimizing available storage utilization, reducing administration and delivering high
performance where needed. IBM Spectrum Scale has multiple deployment options and
configurations to incorporate current NFS filers, block storage and storage-rich servers into a
global namespace with universal access. Its file system supports interfaces for file (POSIX, NFS,
CIFS), object (S3, SWIFT) and Hadoop Distributed File System (HDFS) for in-place analytics. IBM
Spectrum Scale is the caretaker of business-critical data with the ability to replicate, encrypt,
compress, and distribute data across different hardware platforms, systems and data centers.
Global collaboration
Systems Hardware
Data Sheet
IBM Spectrum Scale enables low-latency read and write access to data from anywhere in the
world using AFM distributed routing and advanced caching technology. AFM expands the IBM
Spectrum Scale global namespace across geographical distances, providing fast read and write
performance with automated namespace management. As data is written or modified at one
location, all other locations get the same data with minimal delays. AFM leverages the inherent
scalability of IBM Spectrum Scale, providing a high-performance, location-independent solution
that masks network failures and hides wide-area latencies and outages. These game-changing
capabilities accelerate project schedules and improve productivity for globally distributed
teams.
IBM Spectrum Scale can help improve performance, lower costs, add resiliency and simplify
collaboration with algorithmic and policy-driven data movement, copying and caching. IBM
Spectrum Scale catalogs data across multiple storage pools, including the cloud. It tracks usage
profiles, storage latency and a broad range of standard and custom metadata from which data
movement policies can be constructed.
Armed with the knowledge of the data usage and the underlying storage, IBM Spectrum Scale
curates data across multiple tiers of storage, including tape and cloud. The powerful, data-
aware intelligence engine can create optimized tiered storage pools by grouping devices—flash,
Systems Hardware
Data Sheet
solid-state drive (SSD), disk or tape—based on performance, locality or cost. Migration policies
transparently move data from one storage pool to another without changing the file’s location in
the directory structure. Automated analysis of data usage patterns can help administrators pull
data back up to higher performance tiers as needed. The information lifecycle management
toolset built into IBM Spectrum Scale helps simplify data management by enabling additional
control over data placement. The toolset includes storage pooling and a high-performance,
scalable, rule-based policy engine.
IBM Spectrum Scale supports Hadoop workloads and HDFS—without requiring any changes to
applications. With IBM Spectrum Scale HDFS Transparency connector, multiple IBM Spectrum
Scale clusters or another HDFS repository can be federated into a single HDFS instance. This
reduces the need to move data, simplifying the deployment and workflow of Hadoop, Apache
Spark and related packages.
IBM Spectrum Scale provides system scalability, very high availability and reliability with no
single point of failure in large-scale storage infrastructures. Administrators can configure the file
system so that it automatically remains available if a disk or server fails. The system is designed
to transparently fail over metadata operations and other services, which can be distributed
throughout the entire cluster. For additional reliability, IBM Spectrum Scale supports snapshots,
synchronous and asynchronous replication, and asynchronous error diagnosis while affected
input/output (I/O) operations continue. IBM Spectrum Scale offers protection of data at rest and
secure deletion with file-level encryption. It can encrypt data in flight or at rest with
independent key management that integrates with leading enterprise key management
systems. IBM Spectrum Scale can be part of the enterprise disaster-recovery plans with the
ability to quickly back up, copy and restore data as needed. With automatic fail-over and
intelligent fail-back, IBM Spectrum Scale keeps businesses and organizations up and running.
Using Transparent Cloud Tiering, public cloud or on-premises IBM Cloud Object Storage can be
added as a tier of storage. Ideal for adding active archive storage pools or taking advantage of
storage as a service, the design leaves end users unaffected by data movement to and from the
cloud. IBM Spectrum Scale manages the metadata, movement and caching to seamlessly tier to
and from any Amazon S3 or OpenStack Swift storage without the inconvenience, complexity and
performance hit of adding a separate cloud or object storage silo.
Innovation within the IBM Spectrum Scale platform continues at a brisk pace. IBM Spectrum
Scale includes these new capabilities and enhancements, among many others:
Significantly reduced inter-node software path latency to support the newest low-latency,
high-bandwidth hardware such as NVMe
Improved performance for many small and large block size workloads simultaneously from
new 4 MB default block size with variable sub-block size based on block size choice
Enhanced GUI features for many capabilities, including performance, capacity, network
monitoring, AFM (multi-cluster management), Transparent Cloud Tiering, and enhanced
maintenance and support, including interaction with IBM remote support
New file audit logging capability to track user accesses to filesystem and events supported
across all nodes and all protocols
Expanded Transparent Cloud Tiering support with better granularity of data and support for
multiple cloud accounts and multiple containers
Better backup/restore via new support of snapshots, better performance via improved load
balancing, and better production vs. recovery prioritization
Faster implementation and deployment in IBM Elastic Storage Server environments with
enhanced Call Home configuration and network pre-checking
Systems Hardware
Data Sheet
IBM Spectrum Scale is available in AWS marketplace and includes a supported reference
architectures. IBM Spectrum Scale on AWS is configured in a high-performance tier in AWS for
AI and other high-performance applications and can be configured with the ESS 5000 and ESS
3000 or any IBM Spectrum Scale configuration.
IBM Spectrum Scale has containerized the storage access inside a Kuberneties container.
Storage deployed inside containers, alongside applications running in containers, is an
important innovation that benefits both developers and administrators. By containerizing
storage services and managing them under a single management plane such as Kubernetes,
administrators have fewer housekeeping tasks to deal with, allowing them to focus on more
value-added tasks. Developers benefit by being able to provision application storage
themselves that’s both highly elastic and developer-friendly. With IBM Spectrum Scale
container native storage access there are no limits of localized disks and performance is not
effected by the application server but is optimized by external storage resources. IBM Spectrum
Scale provides most of the benefits of container native storage without the limits of using only
local disk and application server performance.
IBM Spectrum Scale has a simple message, Secure access. Anywhere. Enterprise data services.
Everywhere. Hybrid cloud. Anyone. IBM Spectrum Scale is the center of IBM Storage for Data
and AI information architecture. Its a global hybrid cloud file system with parallel access and is
the glue that creates a hybrid cloud storage solution for an AI information architecture for any
enterprise or organization.