Intel AVX Documentation
Intel AVX Documentation
December 2023
Revision 2.0
CHAPTER 1
CONVERGED VECTOR ISA: INTEL® ADVANCED VECTOR EXTENSIONS 10
1.1 BACKGROUND . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-1
1.2 INTRODUCTION TO INTEL® AVX10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-1
1.3 ENUMERATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-2
1.4 PERFORMANCE BENEFITS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-2
1.5 AVAILABILITY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-3
1.6 CONCLUSION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-3
PAGE
FIGURES
Figure 1-1. Intel® AVX-512 Feature Flags Across Intel® Xeon® Processor Generations vs. Intel® AVX10 . . . . . . . . . . . . . . . . . . . . . . 1-2
Figure 1-2. Intel® ISA Families and Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-3
Revision History
Revision
Description Date
Number
1.0 Initial release of the document. June 2023
®
Updated Section 1.2, “Introduction to Intel AVX10,” to remove the 32-bit
2.0 December 2023
mask register limitation.
Intel® Advanced Vector Extensions 10 (Intel® AVX10) introduces a modern vector Instruction Set Architecture
(ISA) that will be supported across future Intel® processors. This new ISA includes all the richness of the Intel®
Advanced Vector Extensions 512 (Intel® AVX-512) with additional features and capabilities enabling it to seam-
lessly run across Performance-cores and Efficient-cores, delivering performance and consistency across all plat-
forms. It also introduces a new enumeration approach based on version and supported vector lengths, reducing the
burden on the developer to check multiple feature bits for the platform. Intel AVX10 extends and enhances the
capabilities of Intel AVX-512 to benefit all Intel® products and will be the vector ISA of choice moving into the
future.
1.1 BACKGROUND
In 2016, Intel launched a major update to its vector instruction set with the launch of a high-performance vector
ISA named Intel Advanced Vector Extensions 512 (Intel AVX-512). The Intel AVX-512 ISA included several new
features and capabilities over the Intel® Advanced Vector Extensions 2 (Intel® AVX2) ISA including 512-bit vector
registers, a discrete feature enumeration methodology, 16 additional vector registers, 8 mask registers, 512-bit
vector length embedded rounding, and a large suite of new instructions. Over time, Intel AVX-512 evolved to
include support for shorter vector length versions of instructions (128 and 256 bits) along with many additional
instructions, each with its own CPUID feature flag, driving performance and capabilities for Performance-core (P-
core) targeted vector workloads.
The Intel® AVX family of instruction sets (Intel AVX, Intel AVX2, and Intel AVX-512) have successfully gained wide
industry adoption for a variety of applications including video processing, cryptography, HPC, AI, gaming, and
others. Building on this momentum, Intel is announcing the next generation Intel AVX10 as the standard for ISA,
supported by our future Efficient-cores (E-cores) and Performance-cores (P-cores). Intel AVX10 will enable the
ecosystem to seamlessly integrate solutions across products and platforms and innovate for future generations of
our products for years to come.
core based Xeon processors will continue to support all Intel AVX-512 instructions ensuring that legacy applications
continue to run without impact.
1.3 ENUMERATION
The developer community has provided feedback that the current Intel AVX-512 enumeration method has become
increasingly unwieldy over time. As new instructions were introduced, they were assigned a new CPUID feature flag
that would need to be checked to determine processor support. As of future Intel Xeon processors with P-cores,
codenamed Granite Rapids, there are expected to be more than 20 discrete Intel AVX-512 feature flags. To address
this, Intel AVX10 introduces a new versioning approach to enumeration: a Vector ISA feature bit specifying Intel
AVX10 support, an Intel AVX10 ISA Version Number, and three bits enumerating 128-, 256-, and 512-bit vector
length support in the product.
The Intel AVX10 ISA Version Number will be inclusive and monotonically increasing. A developer can expect that
Intel AVX10 Version N+1 will include all the features and capabilities included in Version N. With the stated goal of
minimizing developer impact, a new version of the Intel AVX10 ISA can be expected to include a significant suite of
new instructions and capabilities, delivering sufficient additional value to justify the associated software enable-
ment effort. In rare cases, a discrete CPUID feature flag may be allocated for a segment-specific feature or in the
case of an interim launch in between new Intel AVX10 versions.
The Intel AVX-512 ISA will be frozen as of the introduction of Intel AVX10 and all CPUID feature flags will continue
to be enabled on future P-core processors for legacy support. All new subsequent vector instructions will be
enumerated only as part of Intel AVX10. Apart from a few special cases, those instructions will be supported at all
vector lengths, with 128-bit and 256-bit vector lengths being supported across all processors, and 512-bit vector
lengths additionally supported on P-core processors.
Figure 1-1. Intel® AVX-512 Feature Flags Across Intel® Xeon® Processor Generations vs. Intel® AVX10
vector lengths, Intel AVX10/512 will be supported on Intel P-cores, continuing to deliver the best-in-class perfor-
mance for AI, scientific, and other high-performance codes. New Intel® AVX10 libraries, compilers, and tool
support will also be provided to help application developers realize the best achievable performance for all vector
lengths and processor targets.
1.5 AVAILABILITY
Intel AVX10 Version 1 will be introduced for early software enablement and supports the subset of all the Intel AVX-
512 instruction set available as of future Intel Xeon processors with P-cores, codenamed Granite Rapids, that is
forward compatible to Intel AVX10. This version will not include the new 256-bit vector instructions supporting
embedded rounding or any of the new instructions and will serve as the transition base version from Intel AVX-512
to Intel AVX10.
Intel AVX10 Version 2 will include the 256-bit instruction forms supporting embedded rounding as well as a suite of
new Intel AVX10 instructions covering new AI data types and conversions, data movement optimizations, and
standards support. All new instructions will be supported at 128-, 256-, and 512-bit vector lengths with limited
variances. All Intel AVX10 versions will implement the new versioning enumeration scheme.
1.6 CONCLUSION
Intel AVX10 represents a major shift to supporting a high-performance vector ISA across future Intel processors.
It allows the developer to maintain a single code-path that achieves high performance across all Intel platforms
with the minimum of overhead checking for feature support. Future development of the Intel AVX10 ISA will
continue to provide a rich, flexible, and consistent environment that optimally supports both Server and Client
products.