0% found this document useful (0 votes)

73 views46 pages

Amazon Redshift Overview and Guide

The document provides an overview of Amazon Redshift, a fully managed cloud data warehousing service that enables users to analyze and visualize data from various sources. It covers key features such as serverless options, automatic scaling, and data sharing capabilities, as well as the architecture and instance types available. Additionally, it highlights the benefits of using Redshift Spectrum for querying external data stored in Amazon S3.

Uploaded by

c-ashishkumar.perukari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views46 pages

Amazon Redshift Overview and Guide

Uploaded by

c-ashishkumar.perukari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Amazon Redshift 101

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
1
1. AWS Services Overview 2. Redshift Overview

Agenda

3. Redshift Getting Started 4. Lab

Cloud computing lets you stop thinking of infrastructure as hardware,

and instead think of it (and use it) as software

Programmable Dynamic Pay as

resources abilities you go

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
4
How does it work?
AWS owns and maintains the network-connected hardware
You provision and use what you need

Storage Database Business

applications

Compute Networking Internet

& content of Things
delivery

Platform, applications, and identity and access management (IAM)

Customer
Operating system, network, and firewall configuration
responsibility
Client-side data Network traffic
Server-side encryption
encryption and data protection (encryption,
(file system/data)
integrity authentication integrity, identity)

AWS foundation services

Compute Storage Databases Networking
AWS
responsibility AWS global infrastructure

Regions Availability Zones Edge locations

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
6
AWS global infrastructure
eu-west-1a eu-west-1b

AZ AZ
eu-west-1c

Availability
Data center Zone (AZ) AZ eu-west-1
(Ireland)
Region

Typically houses • One or more data centers • Each AWS Region is made up of two or more AZs
thousands of servers • Designed for fault isolation • AWS has 32 Regions worldwide

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
7
AWS Global Infrastructure Regions & AZs
EUROPE
N AMERICA
Frankfurt 3 Stockholm 3 ASIA PACIFIC
Canada Central 3 Oregon 4
Ireland 3 Zurich 3 *Beijing 3 Osaka 3
GovCloud US-East 3 Canada West
London 3 *Ningxia 3 Seoul 4
GovCloud US-West 3
Milan 3 Hong Kong 3 Singapore 3
Northern California 3
Paris 3 Hyderabad 3 Tokyo 4
Northern Virginia 6
Spain 3 Jakarta 3 Malaysia
Ohio 3
Mumbai 3 Thailand
MIDDLE EAST
AFRICA
Bahrain 3
Cape Town 3
Tel Aviv 3

S AMERICA UAE 3

São Paulo 3 AUSTRALIA

& NEW ZEALAND

Melbourne 3

Sydney 3

Auckland

Available Region Announced # Availability Zone

AWS categories of services

Analytics Application AR and VR Blockchain Business Compute

Integration Applications

Cost Customer Database Developer Tools End User Game Tech

Management Engagement Computing

Internet Machine Management and Media Services Migration and Mobile

of Things Learning Governance Transfer

Storage Robotics Satellite Networking and Security, Identity,

Content Delivery and Compliance
© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
9
Core service areas
Amazon
Route 53
• Compute
Amazon
• Storage VPC S3
• Databases
• Networking User
Amazon EC2 Amazon
• Security DynamoDB

Your
application
Amazon EBS

• Complete control of your computing

resources
• Resizable compute capacity
• Reduced time required to obtain and boot
new server instances
Amazon • Over 750 types of compute instances
EC2

Amazon Simple Storage Amazon Elastic File System

Service (Amazon S3) (Amazon EFS)
Scalable, highly durable Scalable network file storage
object storage in the cloud for Amazon EC2 instances

Amazon S3 Glacier Amazon Elastic Block Store

Low-cost, highly durable (Amazon EBS)
archive storage in the cloud Network-attached volumes that
provide durable block-level storage
for Amazon EC2 instances

Object-level
storage Use cases
• Content storage and distribution
Designed for • Backup and archiving
99.999999999%
• Big data analytics
durability
Amazon • Disaster recovery
S3 • Static website hosting
Event triggers

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
13
Amazon EBS
• Persistent network-attached
AWS Cloud
block storage for instances
• Different drive types Monday’s snapshot EC2 EC2
instance instance
• Scalable Tuesday’s snapshot

Wednesday’s snapshot
• Pay only for what
you provision Thursday’s snapshot

Friday’s snapshot
• Snapshot functionality Amazon EBS volumes

• Encryption available Create volume snapshots Detach and reattach volumes

for backup and recovery to other EC2 instances

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
15
Amazon Redshift
FULLY MANAGED, AI-POWERED CLOUD DATA WAREHOUSING

Data Insights
Analyze and
Transactional data
visualize data

Clickstream Amazon Redshift

Deliver real-time &
Unify data across databases, data lakes and data predictive analytics
warehouses with a zero-ETL approach
IoT telemetry

Build data-driven
Best-in-class security, applications
Application logs governance, and compliance

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
16
Amazon Redshift Spark and ML
Integration for Amazon Amazon
BI tools Data API Query Editor Apache Spark Redshift ML Data Exchange
Best price
performance
cloud DW Third-party
data exchanges
Amazon Redshift
Serverless Automatic compute management Pay for use
Amazon Redshift
Compute Compute

Automatic scaling
for consistent
performance

Scale and pay for Workload isolation and charge-

Near real-time compute and storage ability with data sharing
data ingestion independently
Native data lake querying
Managed Storage Amazon S3
Streaming
ingestion
Parquet ORC JSON
Zero-ETL

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
17
Customers – sample list
Tens of thousands of customers process exabytes of data with Amazon Redshift daily

NTT DOCOMO WARNER Yelp Jack in the Box Pfizer

Moved >10 PB of BROS. Enabling a Improved ops by Provide scientists
data from on- Performance, scale, data-driven moving off of with near real-time
premises to cloud cost-effeciency organization with on-premises DW analysis
concurrency scaling

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
18
Redshift cluster architecture
SQL Clients / BI Tools
Leader node
• SQL endpoint JDBC/ODBC
• Stores metadata
• Coordinates parallel SQL processing & Leader
• ML optimizations node
• Leader node is no-charge for clusters
with 2+nodes
Compute Compute Compute
Compute nodes node node node
• Split into “Slices”
• Local SSDs for caching Load
• Executes queries in parallel
Unload
• Load, unload, backup, restore from S3
Redshift Managed Storage Backup
• Resides in S3 Restore
• Available across entire Region
• Pay for space used (not provisioned) Redshift Managed Storage
• Scales independently of Compute

Amazon S3
Exabyte-scale object storage
© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
19
Redshift instance types Additional Documentation
• Working with clusters

Amazon Redshift RA3 (current generation)

• Solid-state disks + Amazon S3 A Redshift cluster can have up to 128
• Amazon Redshift Managed Storage (RMS) ra3.16xlarge nodes (16 PB of managed
storage) and can support EBs of data
Dense compute—DC2
with its Redshift Data Lake support.
• Solid-state disks

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
20
Get started with Experience better
analytics in seconds price-performance

YOU
focus on
Amazon Redshift
Save costs and stay Pay for what
insights on budget you use

Serverless
Automatic Advanced
provisioning monitoring

Automatic Backup and

scaling recovery

takes care Automated Routine

of the rest patching maintenance

Automatic Security and

failover industry compliance

JDBC/ODBC Data API Query Editor

Data
sharing Amazon Redshift Serverless
clusters ML-based Streams
workload monitoring

Intelligent and dynamic

compute management
Compute

Automatic
workload management

Operational
Automatic scaling
Databases
Automatic tuning

Automatic maintenance

Amazon
Performance at scale
Sagemaker
Pay for use
Storage

Redshift Amazon S3 AWS Lambda

managed Apache
storage Parquet orc
© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
22
Redshift Serverless or Provisioned Highlights
Provisioned Serverless
• Cluster of Compute Nodes • Workgroup is a collection of
compute resources
• Greater control of configuration and
workload management • Workgroup resources managed by
Redshift Processing Units (RPU)
• Predictable cost
• Simplified management
• Discounts with Reserved Instances
• Pay for use

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
23
Redshift Spectrum Overview
Run SQL queries directly against data in S3 using
Redshift Spectrum is a feature of Redshift that allows thousands of nodes
SQL queries on external data stored in Amazon S3

Benefits Spectrum

• Enables the Modern Data Architecture pattern to query

exabytes of data in an S3 data lake
• Data is queried in-place, no loading of data
• Keeps your data warehouse lean by ingesting warm data
locally while keeping other data in the data lake within
reach
• Write query results from Redshift direct to S3 external
tables
• Create materialized views on S3 data using Redshift
Spectrum queries
© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
24
Life of a query SELECT COUNT(*)
FROM S3.EXT_TABLE
1 GROUP BY…

JDBC/ODBC Query is optimized and compiled at the leader

Amazon
2 node. Determined what gets run locally and
what goes to Amazon Redshift Spectrum
9 Result is sent back to client Redshift

Query plan is sent to all compute nodes

3
Final aggregations and joins Compute nodes dynamically prune partitions
4
8 with local Amazon Redshift
tables done in-cluster Each compute node issues multiple requests to
5 Amazon Redshift Spectrum layer

Amazon Redshift
7 Spectrum projects, ... 6 Amazon Redshift Spectrum nodes
filters, joins and scan S3 data
1 2 3 4 N
aggregates
Glue Data Catalog
Hive metastore
Lake Formation
Amazon S3
© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
25
Data storage in Redshift
• Data loaded into Redshift is stored in Redshift Managed Storage (RMS), storage is columnar
• Structured and semi-structured data can be loaded
• Amazon Redshift is ANSI SQL and ACID compliant
• Does not require indexes or db hints. Leverages sort keys, distribution keys, compression instead, to
achieve fast performance through parallelism and efficient data storage
• Data is organized as: Namespace > database > schema > objects

Namespace (One per endpoint)

database1 database2 databaseN

schema1 schema2 schemaN schema1 schema10 schema20 schema1 schemaN

database database database database database database database database

code objects code objects code objects code objects code objects code objects code objects code objects
objects objects objects objects objects objects objects objects

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
26
Data sharing with Amazon Redshift
• Instant, secure, and
live data sharing across
Redshift data warehouses
BI and Machine Data processing
analytics apps learning & advanced
analytics
• Within and across AWS Amazon
accounts and across AWS Redshift

Regions

Amazon Redshift
• Live and transactionally
consistent

• Flexible multi-cluster and Amazon S3

data lake
Amazon
Redshift
data mesh architectures

Redshift can be used with a number of data models including…

A commonly used data model with Amazon Redshift
STAR Highly is the STAR schema, which separates data into large
Snowflake Schema fact and dimension (dim) tables:
Schema Denormalized • Facts refer to specific events (e.g. order
Most Common Less Common submitted.) and fact tables hold summary detail
for those events. e.g. the high-level attributes
of an order submitted such as order_id, order_dt,
product_id, & total_cost Fact tables use foreign
keys to link to dim tables
• The dimensions that make up a fact often have
attributes themselves that are more efficiently
stored in separate dim tables. e.g. a fact might
contain a product_id, but the actual product
details would be contained in a separate
products dim table (e.g. product_price,
Best Practice: Avoid highly normalized models. Models such as 3NF resemble the height_cm, width_cm, & product_id are columns
STAR schema, but has much more table normalization and are typically more that might be found in a products dim table)
appropriate with OLTP systems

Numeric Characters Datetime

BOOLEAN HLLSKETCH GEOMETRY VARBYTE SUPER
Types Types Types

Integer DECIMAL/ Floating

CHAR DATE
Type NUMERIC Point type

SMALLINT REAL VARCHAR TIME

DOUBLE
INT NCHAR TIMETZ
PRECISION

BIGINT TEXT TIMESTAMP

BPCHAR TIMESTAMPTZ

Data type: SUPER id name phones

INTEGER SUPER SUPER

[{"type":"work",
Easy, efficient, and powerful JSON processing {"given":"Jane", "num":"9255550100"},
1
"family":"Doe"} {"type":"cell",
"num": 6505550101} ]
Fast row-oriented data ingestion
{"given":"Richard",
"family":"Roe“, [{"type":"work",
2
Fast column-oriented analytics with "middle":“John" "num": 5105550102}]
materialized views over SUPER/JSON },

SELECT [Link] AS firstname, [Link] as

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
32
Row -Store vs Column Store
• Row storage (e.g. MySQL): all row fields are stored
together on disk (typically in a sequential file)
• Accessing a column (example: scanning SSN of all
residents) with row storage:
• Scan every column in every row of the table
• Resultant unnecessary I/O and caching overhead

• Column storage (e.g. Amazon Redshift): each table

column is stored separately on disk (typically in a separate
file or set of files)
• Accessing column (example: scanning SSN of all residents)
with columnar storage:
• Only scan blocks for relevant column(s)
• Significantly less I/O
© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
33
Row -Store Read vs Column Store Read
Given the following table definition and data for the deep_dive table, how will a simple
SQL query behave in a row-based data store, and then in a column-based store?

CREATE TABLE deep_dive ( SELECT min(dt) FROM deep_dive;

aid INT --airport_id
Row-based storage behavior Column-based storage behavior
,loc CHAR(3) --location
,dt DATE --date • Need to read everything • Only scan blocks for relevant
); • Unnecessary I/O column
• Significantly less I/O

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
34
Materialized Views
• Improve performance of complex, SLA sensitive, predictable and
repeated queries using Materialized views
• Materialized view persists the result set of the associated SQL
Redshift Materialized Views
• Materialized views can be refreshed automatically or manually
Materialized views can be created using the
• Redshift automatically determines best way to update data in
the materialized view (incremental or full refresh) CREATE statement, and can be included
(default) or excluded from Redshift backups.
• Automatic query rewrite leverages relevant materialized views Materialized views can also have table
and can improve query performance by order(s) of magnitude attributes such as dist style and sort keys, and
• Automated materialized views: Redshift continuously monitors be refreshed at any time
workload to identify queries that will benefit from having a MV
CREATE MATERIALIZED VIEW mv_name
and automatically creates and manages MVs for them
[ BACKUP { YES | NO } ]
[ table_attributes ]
AS query

REFRESH MATERIALIZED VIEW mv_name;

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
36
Table Design Best Practices
• Redshift performance is about efficient I/O
• Make columns only as wide as they need to be
• Define primary key and foreign key constraints
• Let COPY choose compression encodings
• Choose the best distribution style
• Choose the best sort key
• AUTO vs Timestamp vs Filtering vs Frequent Joins

• Use appropriate data types

• Use date/time data types for date columns
• Multibyte Characters - Use VARCHAR data type for UTF-8 multibyte
characters support (up to a maximum of four bytes)
• Spatial data can be natively stored, retrieved, and processed using
the GEOMETRY data type and spatial functions.

Additional Documentation
• Best Practices for Designing Tables
• Querying Spatial Data in Redshift

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
37
Data Loading Best Practices
• Use COPY command to load data whenever possible
• Use a single COPY command per table
• Writes are serial per table
• Commits are serial per cluster
ETL Best Practices
• Use multi-row inserts if COPY is not possible
• Bulk insert operations (INSERT INTO...SELECT and CREATE TABLE AS)
provide high performance data insertion
▪ Staging tables are more performant
when created using CREATE TABLE
• Enforce Primary, Unique or Foreign Key constraints outside of Redshift
LIKE instead of SELECT INTO
• Wrap workflow/statements in an explicit transaction #my_temp_table
• Consider using TRUNCATE instead of DELETE ▪ Merge operations should be
• ALTER TABLE APPEND to move rows faster from source to target table. performance via INSERT/UPDATE
• Staging Tables to target table with deduplication
• Use temporary or permanent table with “BACKUP NO” option
• CREATE TABLE LIKE to mirror compression settings.
Additional Documentation
• Define same key column as DISTSTYLE KEY between staging and
production table. ▪ Data Loading Best Practices
▪ Loading Data from S3

• UNLOAD command is the reverse of COPY, in that it

outputs data from Amazon Redshift to S3
• Runs from a SELECT statement. Order By clause
respected by UNLOAD if PARALLEL=OFF
• Encryption & compression handled automatically
• Runs in parallel on all compute nodes

• UNLOAD output
• CSV, JSON or Parquet (Data Lake Export) file formats
• Generates > 1 file per slice for all compute nodes
• Max file size written on S3 can be controlled (max
UNLOAD ('select-statement')
internal limit 6.2GB)
TO 's3://object-path/name-prefix'
• Generates a manifest for all unloaded files (useful for iam_role "arn" [ option [ ... ] ]
COPY into another cluster)
• Control if files can overwrite existing locations or not
© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
39
Query SQL Best Practices
• Avoid using select *. Include only the columns you specifically need to reduce I/O
• Use a CASE expression to perform complex aggregations instead of selecting from the
same table multiple times.
• If you use both GROUP BY and ORDER BY clauses, make sure that you put the columns in
the same order in both.
• Use subqueries in cases where one table in the query is used only for predicate conditions
and the subquery returns a small number of rows (less than about 200). The following
example uses a subquery to avoid joining the LISTING table.
Use
select sum([Link]) from sales
where salesid in (
select listid from listing where listtime > '2023-12-26'
);

Instead of Additional Documentation

select sum([Link]) from sales
• Best Practices for Designing Queries
Join listing on [Link] = [Link]
• Redshift SQL Reference
Where listing. listtime > '2023-12-26';

Vacuum and Analyze:

• Redshift automatically performs vacuum and analyze in the background during periods of low
workloads.
• Redshift users are still empowered to explicitly invoke VACUUM and then ANALYZE as part of
their workloads
• Explicitly invoking VACUUM and then ANALYZE ensures that a table is sorted, defragmented, and
analyzed immediately and with priority for the benefit of the next steps in a workflow.

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
41
Query SQL Best Practices
Query Predicate:
• Use predicates to restrict the dataset as much as possible and use sort keys in the predicates
• In the predicate, use the least expensive operators that you can.
• Comparison condition operators are preferable to LIKE operators.
• LIKE operators are still preferable to SIMILAR TO or POSIX operators.
• Avoid using functions in query predicates.
• Add predicates to filter tables that participate in joins, even if the predicates apply the same filters
Use Instead of
select [Link], sum([Link]) select [Link], sum([Link])
from sales, listing from sales, listing
where [Link] = [Link] where [Link] = [Link]
and [Link] > '2008-12-01' and [Link] > '2008-12-01'
and [Link] > '2008-12-01' group by 1 order by 1;
group by 1 order by 1;

© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
42
Query Editor v2 Best Practices
• For large SQLs (>30k characters), use Notebooks
• Notebooks run SQL one-at-a-time. Editor can run SQLs in parallel
• Minimize the number of open Query Editor windows
• Close sessions once complete - Don’t leave connections open
• Queries continue to run, even after closing windows

Additional Documentation
• Using Amazon Redshift Query Editor v2

AWS Overview: Features, History & Services
No ratings yet
AWS Overview: Features, History & Services
184 pages
Introduction to AWS Cloud Services
No ratings yet
Introduction to AWS Cloud Services
18 pages
Introduction to AWS Cloud Computing
No ratings yet
Introduction to AWS Cloud Computing
16 pages
AWS Basics: Introduction and Overview
No ratings yet
AWS Basics: Introduction and Overview
15 pages
AWS Cloud Computing Overview
No ratings yet
AWS Cloud Computing Overview
24 pages
AWS Cloud Services Overview
100% (1)
AWS Cloud Services Overview
110 pages
Hosting Your Website on AWS
No ratings yet
Hosting Your Website on AWS
20 pages
AWSome Day 2023: Cloud Computing Intro
No ratings yet
AWSome Day 2023: Cloud Computing Intro
16 pages
Amazon Redshift Super Class Overview
No ratings yet
Amazon Redshift Super Class Overview
74 pages
AWS Overview: Features, History, and Services
No ratings yet
AWS Overview: Features, History, and Services
150 pages
AWS Cloud Computing Overview Guide
No ratings yet
AWS Cloud Computing Overview Guide
17 pages
AWS Global Infrastructure Overview
No ratings yet
AWS Global Infrastructure Overview
33 pages
Introduction to AWS Cloud Computing
No ratings yet
Introduction to AWS Cloud Computing
18 pages
Intro to Cloud Computing with AWS
No ratings yet
Intro to Cloud Computing with AWS
18 pages
AWS Overview Part 1
100% (1)
AWS Overview Part 1
34 pages
AWS Academy Cloud Foundations Module 1
No ratings yet
AWS Academy Cloud Foundations Module 1
47 pages
AWS Cloud Overview and Services Guide
No ratings yet
AWS Cloud Overview and Services Guide
45 pages
AWS Cloud Computing Overview
No ratings yet
AWS Cloud Computing Overview
26 pages
AWS Cloud RFI Workshop Insights
No ratings yet
AWS Cloud RFI Workshop Insights
42 pages
Cloud Computing and AWS Overview
No ratings yet
Cloud Computing and AWS Overview
15 pages
AWS Cloud Computing Overview Guide
No ratings yet
AWS Cloud Computing Overview Guide
49 pages
AcademyCloudFoundations Module 03
No ratings yet
AcademyCloudFoundations Module 03
33 pages
Understanding Cloud Computing with AWS
No ratings yet
Understanding Cloud Computing with AWS
67 pages
Introduction to AWS Cloud Services
No ratings yet
Introduction to AWS Cloud Services
17 pages
Data Analytics Services on AWS Cloud
No ratings yet
Data Analytics Services on AWS Cloud
28 pages
Overview of Key AWS Services
No ratings yet
Overview of Key AWS Services
11 pages
Introduction to AWS Cloud Services
No ratings yet
Introduction to AWS Cloud Services
19 pages
AWS Cloud Services Overview and Applications
No ratings yet
AWS Cloud Services Overview and Applications
123 pages
AWS Global Infrastructure Overview
No ratings yet
AWS Global Infrastructure Overview
28 pages
Introduction to AWS Cloud Computing
No ratings yet
Introduction to AWS Cloud Computing
18 pages
AWS Cloud Services Overview
No ratings yet
AWS Cloud Services Overview
57 pages
Introduction to AWS Cloud Services
No ratings yet
Introduction to AWS Cloud Services
18 pages
AWSomeDayOnline Q322 - 2. Introduction To AWS Services Compute, Storage, Databases
No ratings yet
AWSomeDayOnline Q322 - 2. Introduction To AWS Services Compute, Storage, Databases
33 pages
Overview of Amazon Web Services (AWS)
No ratings yet
Overview of Amazon Web Services (AWS)
40 pages
Overview of AWS Compute Services
No ratings yet
Overview of AWS Compute Services
43 pages
AWS Solutions Architect Overview
No ratings yet
AWS Solutions Architect Overview
69 pages
AWS Cloud Computing Solutions Overview
No ratings yet
AWS Cloud Computing Solutions Overview
94 pages
AWS Introduction and Cost Management
No ratings yet
AWS Introduction and Cost Management
72 pages
SimpleDB vs RDS in AWS
No ratings yet
SimpleDB vs RDS in AWS
63 pages
AWS Cloud Concepts Overview Guide
No ratings yet
AWS Cloud Concepts Overview Guide
49 pages
AWSome Day Online Conference 2025
No ratings yet
AWSome Day Online Conference 2025
17 pages
AWS Cloud Infrastructure Overview
No ratings yet
AWS Cloud Infrastructure Overview
15 pages
Overview of Amazon Web Services (AWS)
No ratings yet
Overview of Amazon Web Services (AWS)
17 pages
Overview of Amazon Web Services (AWS)
100% (1)
Overview of Amazon Web Services (AWS)
13 pages
AWS Cloud Concepts Overview Guide
No ratings yet
AWS Cloud Concepts Overview Guide
51 pages
AWSome Day Overview and Insights
No ratings yet
AWSome Day Overview and Insights
41 pages
AWS Cloud Concepts Overview Module 1
100% (1)
AWS Cloud Concepts Overview Module 1
47 pages
AWS Conference Overview and Insights
No ratings yet
AWS Conference Overview and Insights
19 pages
AWS Overview and Key Features Guide
No ratings yet
AWS Overview and Key Features Guide
149 pages
Overview of Amazon Web Services (AWS)
No ratings yet
Overview of Amazon Web Services (AWS)
15 pages
AcademyCloudFoundations Module 01
No ratings yet
AcademyCloudFoundations Module 01
47 pages
AcademyCloudFoundations Module 01
No ratings yet
AcademyCloudFoundations Module 01
47 pages
AWS Cloud Practitioner Course Agenda
No ratings yet
AWS Cloud Practitioner Course Agenda
84 pages
GCP vs AWS vs Azure: Cloud Services Comparison
No ratings yet
GCP vs AWS vs Azure: Cloud Services Comparison
13 pages
Overview of Amazon EC2, S3, and Redshift
No ratings yet
Overview of Amazon EC2, S3, and Redshift
9 pages
Overview of Amazon Web Services (AWS)
No ratings yet
Overview of Amazon Web Services (AWS)
56 pages
Introduction to AWS Cloud Services
No ratings yet
Introduction to AWS Cloud Services
18 pages
Overview of AWS Database Services
No ratings yet
Overview of AWS Database Services
32 pages
Understanding Swaps and Risk Management
No ratings yet
Understanding Swaps and Risk Management
23 pages
Delimitation of Municipal Wards Rules
100% (1)
Delimitation of Municipal Wards Rules
3 pages
B2 Grammar: Conditionals & Clauses Guide
No ratings yet
B2 Grammar: Conditionals & Clauses Guide
4 pages
Cheese Bomb Business Plan Overview
No ratings yet
Cheese Bomb Business Plan Overview
57 pages
Article 2
No ratings yet
Article 2
10 pages
DIY Four Color Screen Printing Press Plans
100% (4)
DIY Four Color Screen Printing Press Plans
10 pages
LEMO Connectors for HD Fibre Optics
No ratings yet
LEMO Connectors for HD Fibre Optics
12 pages
NTR District Student Kit Supply Report
No ratings yet
NTR District Student Kit Supply Report
1 page
Unang Yakap: Essential Newborn Care Steps
100% (4)
Unang Yakap: Essential Newborn Care Steps
57 pages
Cutmaster True Series: Manual Plasma Cutting Systems
No ratings yet
Cutmaster True Series: Manual Plasma Cutting Systems
2 pages
Singapore Transfer Pricing Guidelines 3rd
No ratings yet
Singapore Transfer Pricing Guidelines 3rd
106 pages
Geographical Significance of Nabarangpur
No ratings yet
Geographical Significance of Nabarangpur
6 pages
CV for Bank Job in Bangladesh
91% (11)
CV for Bank Job in Bangladesh
3 pages
Dark Fantasy RP Events Overview
No ratings yet
Dark Fantasy RP Events Overview
34 pages
Liver and Biliary Tree Anatomy Overview
No ratings yet
Liver and Biliary Tree Anatomy Overview
37 pages
Understanding Cognitive Science Representation
No ratings yet
Understanding Cognitive Science Representation
71 pages
Abhiman Matlapudi: Professional Profile
No ratings yet
Abhiman Matlapudi: Professional Profile
6 pages
Arrangement Architecture Setup Guide
77% (13)
Arrangement Architecture Setup Guide
41 pages
Genetics Pedigree Analysis Worksheet
0% (1)
Genetics Pedigree Analysis Worksheet
72 pages
HPMC Pengikat
No ratings yet
HPMC Pengikat
10 pages
Importance of pH in Food Science
No ratings yet
Importance of pH in Food Science
9 pages
Cashless Authorization Letter Details
No ratings yet
Cashless Authorization Letter Details
3 pages
Saving PDF
No ratings yet
Saving PDF
8 pages
Evaluated Receipt Settlement in SAP MM
No ratings yet
Evaluated Receipt Settlement in SAP MM
9 pages
Newbie Guide to Colette Clubs
No ratings yet
Newbie Guide to Colette Clubs
12 pages
Easy Multiplication by 22 Method
No ratings yet
Easy Multiplication by 22 Method
2 pages
Checkpoint Jump Start: Business Success
No ratings yet
Checkpoint Jump Start: Business Success
1 page
Study of Brake Systems in Vehicles
No ratings yet
Study of Brake Systems in Vehicles
24 pages
Forensic Biomechanics
No ratings yet
Forensic Biomechanics
11 pages
Theories of First Language Acquisition
No ratings yet
Theories of First Language Acquisition
7 pages

Amazon Redshift Overview and Guide

Uploaded by

Amazon Redshift Overview and Guide

Uploaded by

Amazon Redshift 101

3. Redshift Getting Started 4. Lab

Cloud computing lets you stop thinking of infrastructure as hardware,

Programmable Dynamic Pay as

Storage Database Business

Compute Networking Internet

Platform, applications, and identity and access management (IAM)

AWS foundation services

Regions Availability Zones Edge locations

São Paulo 3 AUSTRALIA

Available Region Announced # Availability Zone

Analytics Application AR and VR Blockchain Business Compute

Cost Customer Database Developer Tools End User Game Tech

Internet Machine Management and Media Services Migration and Mobile

Storage Robotics Satellite Networking and Security, Identity,

• Complete control of your computing

Amazon Simple Storage Amazon Elastic File System

Amazon S3 Glacier Amazon Elastic Block Store

• Encryption available Create volume snapshots Detach and reattach volumes

Clickstream Amazon Redshift

Scale and pay for Workload isolation and charge-

NTT DOCOMO WARNER Yelp Jack in the Box Pfizer

Amazon Redshift RA3 (current generation)

Automatic Backup and

takes care Automated Routine

Automatic Security and

JDBC/ODBC Data API Query Editor

Intelligent and dynamic

Redshift Amazon S3 AWS Lambda

• Enables the Modern Data Architecture pattern to query

JDBC/ODBC Query is optimized and compiled at the leader

Query plan is sent to all compute nodes

Namespace (One per endpoint)

database1 database2 databaseN

schema1 schema2 schemaN schema1 schema10 schema20 schema1 schemaN

database database database database database database database database

• Flexible multi-cluster and Amazon S3

Redshift can be used with a number of data models including…

Numeric Characters Datetime

Integer DECIMAL/ Floating

SMALLINT REAL VARCHAR TIME

BIGINT TEXT TIMESTAMP

Data type: SUPER id name phones

SELECT [Link] AS firstname, [Link] as

• Column storage (e.g. Amazon Redshift): each table

CREATE TABLE deep_dive ( SELECT min(dt) FROM deep_dive;

REFRESH MATERIALIZED VIEW mv_name;

• Use appropriate data types

• UNLOAD command is the reverse of COPY, in that it

Instead of Additional Documentation

Vacuum and Analyze:

You might also like