Aws s3 Guide
Aws s3 Guide
s3.png
General
Amazon S3 is object storage built to store and retrieve any amount of data from
anywhere on the Internet.
It’s a simple storage service that offers an extremely durable, highly available,
and infinitely scalable data
storage infrastructure at very low costs.
s3-capabilities.jpg
Storage Classes
S3 Standard (durable, immediately available, frequently accessed).
S3 Intelligent-Tiering (automatically moves data to the most cost-effective tier).
S3 Standard-IA (durable, immediately available, infrequently accessed).S3 One Zone-
IA (lower cost for infrequently accessed data with less resilience).
S3 Glacier (archived data, retrieval times in minutes or hours).
S3 Glacier Deep Archive (lowest cost storage class for long term retention)
Access
By default, a bucket, its objects, and related sub-resources are all private.
IAM policies.
Bucket policies.
Access Control Lists (ACLs).
Query string authentication (URL to an Amazon S3 object which is only valid for a
limited time).
Resource-based policies:
ACLs:
You grant permission to another AWS account using the email address or the
canonical user ID.
However, if you provide an email address in your grant request, Amazon S3 finds the
canonical user ID
for that account and adds it to the ACL.
Grantee accounts can then delegate the access provided by other accounts to their
individual users.
Transfer Acceleration
Amazon S3 Transfer Acceleration enables fast, easy, and secure transfers of files
over long distances between
your client and your Amazon S3 bucket.
Encryption
SSE-S3: handled/managed by AWS, AES 256
SSE-KMS: managed with AWS KMS keys
SSE-C: client manages the keys, S3 manages encryption
Replication
SRR - same region replication
CRR - cross-region replication
only works on versioned buckets
this is an async process
no chaining allowed, can't set up replication region A -> region B - > region C
Exam tips
Combine S3 and EC2 in the same AWS Region to optimize performance, reduce latency
and data transfer cost.
Use Byte-Range Fetches for partial data retrieval and parallel download.
Use Transfer Acceleration to minimize latency caused by distance (GB/TB of data
across large geographical
distances or access to a centralized bucket)