Jing Xie’s Post

🌊Spot Instance Surfer | 🤖GPU Optimizer

7mo

These features help HPC users save 50-80% on EC2 compute costs...here's how they work -- Optimize is a feature we developed for stateful jobs running on EC2 On-Demand and migrates them to run on EC2 Spot instances: 1. This compute optimization feature uses our memory checkpointing technology to ensure the job doesn't lose state This is key because for long-running scientific computing jobs it is difficult to restart what could be hours, days or even weeks of runtime just to save $ 2. During extremely busy periods, EC2 Spot interruption rates can be as high as 50-80% and responding to each rebalance notification is very disruptive If you're running on a Spot instance that actually get reclaimed and our software can't easily find another Spot instance, we perform a stateful migration to on-demand instances and Optimize helps look for a Spot instance 10, 20, 30 mins later to move back to -- How many times have you run out of memory? Out of memory protection is a feature we developed for HPC jobs that are hard to deterministically predict memory requirements pre-run: 1. Being able to run on smaller EC2 instances and migrating to larger instances only when needed without losing state results in material cost savings 2. Our software monitors resource utilization on all the worker nodes and when we detect memory utilization approaching 100% and/or swap is hit, we perform a full application checkpoint, migrate you to the larger instance, and resume running -- Some of these capabilities that we have developed to optimize Cloud costs are also quite useful for on-prem HPC application. Drop a comment below if you have questions or shoot me a DM if you want to chat on how implement memory checkpointing in your HPC architecture. #AWS #HPC #SuperComputing #Slurm #PEARC24

2 Comments

Jing Xie

🌊Spot Instance Surfer | 🤖GPU Optimizer

7mo

This is an example where the job starts on Spot. Then there was a Spot preemption storm and we used memory checkpointing to move a customer job to on-demand (see 2nd EC2 instance stats). Then used Optimize to go back to using Spot instances and SpotSurfer to survive over 20 Spot interruptions and continually find other Spot instances to migrate to. It also bumped the job to a larger memory instance with OOM protection, saving the customer time and completing the job at a fraction of the on-demand cost.

Divya Atre

Building brand & demand through content marketing, social media marketing and campaigns

7mo

This is a great feature for HPC users to save on EC2 compute costs. The memory checkpointing technology is key for long-running jobs and the out of memory protection feature is helpful for hard-to-predict memory requirements. It's also interesting to know that these capabilities can be useful for on-prem HPC applications. Thanks for sharing!

See more comments

To view or add a comment, sign in

More Relevant Posts

Clarify.ai

19 followers
4mo
Report this post
DAY 5 Amazon Web Services (AWS) EC2 Instance Types Overview There are several EC2 Instance Types, each designed to serve different needs. Choosing the right instance depends on whether your application requires more compute power, memory, or storage capacity. 1. General Purpose Instances Use Case: Best for balanced computing, memory, and networking resources. Examples: Application servers Gaming servers Backend servers for businesses Small and medium-sized databases Recommendation: Choose when there's an equal need for computing, memory, and networking resources. 2. Compute Optimized Instances Use Case: Best for high-performance, compute-intensive tasks. Examples: High-performance web applications Compute-heavy gaming servers Application servers Recommendation: Choose for workloads that need more processing power over memory or storage. 3. Memory Optimized Instances Use Case: Ideal for applications requiring large amounts of data to be preloaded and processed quickly. Examples: Large dataset applications In-memory databases Recommendation: Use for workloads where fast data processing and large memory are needed. 4. Accelerated Computing Instances Use Case: Utilizes hardware accelerators for faster data processing, especially for compute-heavy tasks. Examples: Graphics rendering Streaming Recommendation: Best for workloads like machine learning, AI, or tasks requiring significant GPU or FPGA resources. 5. Storage Optimized Instances Use Case: Optimized for workloads requiring high input/output operations. Examples: Data warehouses Large-scale databases Online transaction processing systems Recommendation: Choose when fast local storage access is essential for large data sets. #AWSEC2 #CloudComputing #GeneralPurposeInstance #ComputeOptimized #MemoryOptimized #AcceleratedComputing #StorageOptimized #AWSCloud #CloudSolutions #AWSArchitecture #CloudInfrastructure #CloudMigration
Like Comment
To view or add a comment, sign in
Akil Saji

Entrepreneur | AWS Cloud | Startups | XAUUSD
4mo
Report this post
DAY 5 Amazon Web Services (AWS) EC2 Instance Types Overview #AWSEC2 #CloudComputing #GeneralPurposeInstance #ComputeOptimized #MemoryOptimized
Clarify.ai

19 followers
4mo

DAY 5 Amazon Web Services (AWS) EC2 Instance Types Overview There are several EC2 Instance Types, each designed to serve different needs. Choosing the right instance depends on whether your application requires more compute power, memory, or storage capacity. 1. General Purpose Instances Use Case: Best for balanced computing, memory, and networking resources. Examples: Application servers Gaming servers Backend servers for businesses Small and medium-sized databases Recommendation: Choose when there's an equal need for computing, memory, and networking resources. 2. Compute Optimized Instances Use Case: Best for high-performance, compute-intensive tasks. Examples: High-performance web applications Compute-heavy gaming servers Application servers Recommendation: Choose for workloads that need more processing power over memory or storage. 3. Memory Optimized Instances Use Case: Ideal for applications requiring large amounts of data to be preloaded and processed quickly. Examples: Large dataset applications In-memory databases Recommendation: Use for workloads where fast data processing and large memory are needed. 4. Accelerated Computing Instances Use Case: Utilizes hardware accelerators for faster data processing, especially for compute-heavy tasks. Examples: Graphics rendering Streaming Recommendation: Best for workloads like machine learning, AI, or tasks requiring significant GPU or FPGA resources. 5. Storage Optimized Instances Use Case: Optimized for workloads requiring high input/output operations. Examples: Data warehouses Large-scale databases Online transaction processing systems Recommendation: Choose when fast local storage access is essential for large data sets. #AWSEC2 #CloudComputing #GeneralPurposeInstance #ComputeOptimized #MemoryOptimized #AcceleratedComputing #StorageOptimized #AWSCloud #CloudSolutions #AWSArchitecture #CloudInfrastructure #CloudMigration
Like Comment
To view or add a comment, sign in
Nir Peleg

FinOps Engineer Team Lead ✨ Helping companies saving money on AWS ✨ FinOps Enthusiast ✨ Enjoy Cracking strategic business challenges
3mo
Report this post
🎉 Please welcome a new member to the EC2 family: I7ie instances 🎉 Designed for large storage I/O intensive workloads, I7ie instances are powered by 5th generation Intel Xeon Scalable processors with an all-core turbo frequency of 3.2 GHz, offering up to 40% better compute performance and 20% better price performance over existing I3en instances. I7ie instances have the highest local NVMe storage density in the cloud for storage optimized instances and offer up to twice as many vCPUs and memory compared to prior generation instances. Powered by 3rd generation AWS Nitro SSDs, I7ie instances deliver up to 65% better real-time storage performance, up 50% lower storage I/O latency, and 65% lower storage I/O latency variability compared to I3en instances. I7ie are high density storage optimized instances, ideal for workloads requiring fast local storage with high random read/write performance at very low latency consistency to access large data sets. I7ie instances also deliver 40% better compute performance to run more complex queries without increasing the storage density per vCPU. Additionally, the 16KB torn write prevention feature, enables customers to eliminate performance bottlenecks. I7ie instances deliver up to 100Gbps of network bandwidth and 60Gbps of bandwidth for Amazon Elastic Block Store (EBS). https://lnkd.in/dTsnxUTa #finops #ec2 #costoptimization #aws

Introducing Amazon EC2 next generation high density Storage Optimized I7ie instances - AWS

aws.amazon.com
Like Comment
To view or add a comment, sign in
Manal Fadhil

Professor in Computer Science
3mo Edited
Report this post
Amazon EC2 instance type specifications: Amazon EC2 provides a wide selection of instance types optimized to fit different use cases. Instance types comprise varying combinations of CPU, memory, storage, and networking capacity and give you the flexibility to choose the appropriate mix of resources for your applications. Each instance type includes one or more instance sizes, allowing you to scale your resources to the requirements of your target workload. We group EC2 instance into the following categories: • General purpose – Provide a balance of compute, memory, and networking resources. These instances are ideal for applications that use these resources in equal proportions, such as web servers and code repositories. Burstable performance – The T instance family is also referred to as burstable performance instances. These instances provide a baseline CPU performance with the ability to burst above the baseline at any time. For more information, see Burstable performance instances in the Amazon EC2 User Guide. • Compute optimized – Designed for compute intensive applications that benefit from high performance processors. These instances are ideal for batch processing workloads, media transcoding, high performance web servers, high performance computing (HPC), scientific modeling, dedicated gaming servers, ad server engines, and machine learning inference. • Memory optimized – Designed to deliver fast performance for workloads that process large data sets in memory. • Storage optimized – Designed for workloads that require high, sequential read and write access to very large data sets on local storage. They are optimized to deliver tens of thousands of low latency, random I/O operations per second (IOPS) to applications. • Accelerated computing – Use hardware accelerators, or co-processors, to perform functions, such as floating point number calculations, graphics processing, or data pattern matching, more efficiently than is possible in software running on CPUs. • High-performance computing – Purpose built to offer the best price performance for running HPC workloads at scale on AWS. These instances are ideal for applications that benefit from high-performance processors, such as large, complex simulations and deep learning workloads. • Previous generation – AWS offers previous generation instance types for users who have optimized their applications around them and have yet to upgrade. We encourage you to use 6 Amazon EC2 Instance Types current generation instance types to get the best performance, but we continue to support previous generation instance types. #aws #IUTP
Like Comment
To view or add a comment, sign in
Pradyut Kumar Ghosh

AWS Certified Solutions Architect Associate | Aspiring DevOps & Cloud Engineer | Experienced in AWS, Serverless Solutions, and Infrastructure Automation, open to new opportunities
1mo
Report this post
Post 1/2: AWS EC2 – Mastering Compute Power in the Cloud! 🚀 Whether you're launching your first instance or optimizing a complex architecture, Amazon EC2 (Elastic Compute Cloud) is at the heart of AWS. Let’s break it down: 1️⃣ What Makes EC2 Powerful? ✅ Scalable virtual servers for various workloads ✅ Wide range of instance types to optimize performance ✅ Flexible storage & networking options for efficiency 2️⃣ Choosing the Right EC2 Instance Type 📌 General Purpose – Balanced compute, memory & networking (T, M series) 📌 Compute Optimized – High CPU performance (C series) 📌 Memory Optimized – Large RAM for in-memory workloads (R, X, Z series) 📌 Storage Optimized – High disk I/O for databases & analytics (I, D, H series) 📌 Accelerated Computing – GPU & FPGA for AI/ML, graphics (P, G, F series) ⚡ Tip: Use Spot Instances for cost savings up to 90% for non-critical workloads! 3️⃣ Storage Options in EC2 💾 EBS (Elastic Block Store) – Persistent block storage for EC2 📂 Instance Store – Ephemeral storage tied to instance lifecycle 📦 EFS (Elastic File System) – Scalable file storage for shared access ☁️ S3 & Glacier – Object storage for backups, archival, and static data 🚀 Pro Tip: Use EBS Snapshots for fast disaster recovery & backups! 4️⃣ EC2 Billing & Cost Optimization 💰 📌 On-Demand – Pay per second (flexible but costly) 📌 Reserved Instances – Up to 72% savings for long-term commitments 📌 Spot Instances – Unused capacity at a massive discount 📌 Savings Plans – Flexible cost savings over 1-3 years 📌 Auto Scaling – Scale up/down dynamically & pay only for what you use! 🔍 Pro Tip: Use AWS Compute Optimizer to analyze instance usage and optimize costs! 🔽 But there’s more! 🔽 In the next post (2/2), we’ll cover IAM roles, high availability, security best practices, and disaster recovery! Stay tuned! 🔥 💬 Have you worked with EC2? Share your favorite optimization tips in the comments! 👇 #AWS #CloudComputing #DevOps #AWSEC2 #CostOptimization #Scalability #CloudStorage
Like Comment
To view or add a comment, sign in
Aishwarya Rajendran

AWS DevOps Engineer
2mo
Report this post
⚡ Types of Amazon EC2 Instances & When to Use Them ⚡ 🔍 1. General Purpose (T, M Series): Use Case: Balanced CPU, memory, and network performance. When to Prefer: Suitable for web servers, development environments, and applications with moderate resource requirements. Examples: t3, m5. 🔍 2. Compute Optimized (C Series): Use Case: High CPU power for compute-intensive tasks. When to Prefer: Best for batch processing, media encoding, high-performance web servers, and gaming servers. Examples: c5, c6g. 🔍 3. Memory Optimized (R, X, Z Series): Use Case: Workloads that require significant memory. When to Prefer: Ideal for in-memory databases (like Redis), SAP workloads, and real-time big data processing. Examples: r5, x1, z1d. 🔍 4. Storage Optimized (I, D, H Series): Use Case: High-speed and large storage. When to Prefer: Best for big data workloads, high-frequency analytics, and transactional workloads. Examples: i3, d2. 🔍 5. Accelerated Computing (P, G, F Series): Use Case: GPU-based workloads or specialized hardware. When to Prefer: Perfect for machine learning (ML), AI, video transcoding, and scientific simulations. Examples: p3, g4dn, f1. 🛠 How to Decide? Evaluate your workload requirements: CPU, memory, storage, and GPU needs. Cost-efficiency: Choose an instance that balances cost and performance. Scalability: Use auto-scaling to adapt to changing demands. Amazon EC2 (Elastic Compute Cloud) offers a variety of instance types, tailored for different workloads and performance needs. Choosing the right type ensures cost-efficiency and performance for your applications. #AWS #Compute #EC2
Like Comment
To view or add a comment, sign in
Claret Ibeawuchi

Software Engineer | Machine Learning Engineer | Building Intelligent, Secure and Resilient Systems
5mo
Report this post
Choosing the Right EC2 Instance Type for Your Workload Amazon EC2 offers a wide range of instance types, each optimized for different use cases. Whether you’re running a simple web server, a memory-intensive database, or a high-performance computing application, choosing the right instance type can make a significant difference in performance and cost-efficiency. 🖥 General Purpose Instances: Best for applications with a balanced mix of compute, memory, and networking needs. Ideal for web servers, app servers, and dev environments. ⚙️ Compute Optimized Instances: Designed for compute-bound workloads that require high-performance processing, like batch processing and media transcoding. 💾 Memory Optimized Instances: Perfect for memory-intensive applications such as in-memory databases, big data analytics, and high-performance databases. 📦 Storage Optimized Instances: These are your go-to for applications needing high, sequential read/write access to large datasets, like NoSQL databases and data warehousing. 🚀 Accelerated Computing Instances: For workloads that require hardware acceleration, like machine learning, gaming, or scientific simulations, these GPU and FPGA-based instances are your best bet. 🔍 High Memory & HPC Instances: Tailored for extremely large databases and high-performance computing tasks, ensuring low latency and high throughput. Tips for Choosing the Right Instance: 1. Understand Your Workload: Analyze whether your application is compute, memory, or storage-intensive. 2. Evaluate Costs: Consider both performance and cost to find the right balance. 3. Plan for Scalability: Choose an instance type that can grow with your needs. 4. Check Availability: Make sure the instance type is available in your region. Remember, the right instance type can optimize your performance and lower your costs. What EC2 instance type do you rely on for your workloads? Share your experiences! #AWS #CloudComputing #EC2 #Devops #CloudOptimization
Like Comment
To view or add a comment, sign in
Akash Kamble

Data Engineering Enthusiast | Expertise in DWH, SQL Server, Teradata, SSRS, Python, Distributed Computing, Cloud Basics, ADF
4mo
Report this post
🌟 Day 34 - AWS EC2 Instance Types Explained 🌟 When choosing an EC2 instance, selecting the right instance family and type is key to balancing performance, scalability, and cost. Here’s a detailed breakdown: EC2 Instance Families: General Instances: Suitable for balanced CPU and memory workloads. Memory Instances: Optimized for memory-intensive applications. CPU Instances: Ideal for compute-bound applications requiring high processing power. Storage Instances: Tailored for data-heavy applications with large storage needs. GPU Instances: Perfect for graphic-intensive workloads, such as machine learning and 3D modeling. Instance Types (CPU + Memory): Each instance type is defined by its CPU and memory configuration: t2.nano = 0.5GB RAM, 1 vCPU t2.micro = 1GB RAM, 1 vCPU t2.small = 2GB RAM, 2 vCPU t2.medium = 4GB RAM t2.large = 8GB RAM t2.xlarge = 16GB RAM Scalability: Scale Up and Scale Down AWS enables scaling by changing instance types—easy and data-safe! Scale Up Anytime 🔼 — No Data Lost Scale Down Anytime 🔽 — No Data Lost To change the instance type: Stop the EC2 instance (downtime required). Change the type, without losing data on the instance. Burstable Performance Instances Certain instances, like t2 instances, offer burstable performance with CPU Credits. These credits allow an instance to temporarily burst and handle increased workloads. 🔸 How CPU Credits Work: The instance uses credits to enter burst mode, providing high performance for a limited time. Credits vary by instance type (e.g., t2.small has 2 vCPU + 6 vCPU in burst). 💡 Key Takeaways: Pick your instance family based on workload needs (CPU, memory, storage, or GPU). Use scaling to adjust resources without data loss. Leverage burstable instances for workloads with variable demand, thanks to CPU credits. #AWS #Day34 #CloudJourney #EC2Instances #Scalability #BurstableInstances #CPUCredits #CloudOptimization
Like Comment
To view or add a comment, sign in
Nir Peleg

FinOps Engineer Team Lead ✨ Helping companies saving money on AWS ✨ FinOps Enthusiast ✨ Enjoy Cracking strategic business challenges
3mo
Report this post
🎉 Please welcome a new member to EC2 family: I8g instances 🎉 I8g instances offer the best performance in Amazon EC2 for storage-intensive workloads. I8g instances are powered by AWS Graviton4 processors that deliver up to 60% better compute performance compared to previous generation I4g instances. I8g instances use the latest third generation AWS Nitro SSDs, local NVMe storage that deliver up to 65% better real-time storage performance per TB while offering up to 50% lower storage I/O latency and up to 60% lower storage I/O latency variability. These instances are built on the AWS Nitro System, which oﬄoads CPU virtualization, storage, and networking functions to dedicated hardware and software enhancing the performance and security for your workloads. I8g instances are ideal for real-time applications like relational databases, non-relational databases, streaming databases, search queries and data analytic. https://lnkd.in/dAgsXQSJ #aws #finops #ec2 #graviton #costoptimization

Announcing Amazon EC2 I8g instances - AWS

aws.amazon.com
Like Comment
To view or add a comment, sign in
Mohamed Ghaly

HCIE| Cloud Infrastructure Specialist | Huawei Certified Trainer | Expert in Huawei cloud and Storage | DevOps| Datacenter |virtualization |cybersecurity
2mo
Report this post
Understanding AWS EC2 Instance Types Amazon Web Services (AWS) Elastic Compute Cloud (EC2) offers a diverse range of instance types tailored to meet various computing needs. Each instance type is designed for specific workloads, providing different combinations of CPU, memory, storage, and networking capacity. This article explores the main categories of EC2 instance types and their use cases. 1. General Purpose Instances General purpose instances provide a balanced mix of compute, memory, and networking resources. They are suitable for a variety of applications, including web servers and small databases. Examples: t4g: Cost-effective, burstable performance. t3: General-purpose, with a balance of resources. m5: Offers a balance of compute, memory, and networking. 2. Compute Optimized Instances Compute optimized instances are designed for compute-intensive applications. They offer high-performance processors and are ideal for tasks such as batch processing, gaming, and high-performance web servers. Examples: c5: Optimized for compute-intensive workloads. c6g: Based on AWS Graviton2 processors for better price-performance. 3. Memory Optimized Instances Memory optimized instances are tailored for memory-intensive applications. They provide high memory-to-CPU ratios, making them suitable for databases and in-memory caching. Examples: r5: Designed for memory-intensive applications. x1e: Offers the highest memory capacity for large-scale in-memory databases. 4. Storage Optimized Instances Storage optimized instances are designed for workloads that require high, sequential read and write access to very large data sets on local storage. Examples: i3: Optimized for high I/O performance and storage. d2: Designed for dense storage workloads. 5. Accelerated Computing Instances These instances use hardware accelerators, or co-processors, to perform functions such as floating-point number calculations, graphics processing, and data pattern matching more efficiently than software running on a general-purpose CPU. Examples: p4: Optimized for machine learning and high-performance computing. g4ad: Designed for graphics-intensive applications. #AWS #EC2 #CloudComputing #InstanceTypes #TechTrends #CloudInfrastructure
Like Comment
To view or add a comment, sign in

8,673 followers

521 Posts

View Profile Connect

Jing Xie’s Post

More Relevant Posts

Explore topics