Course 2 - Innovating With Data With Google Cloud
Course 2 - Innovating With Data With Google Cloud
not
dist
rib
ut
e
Modules
Student Slides
01 The Value of Data
Innovating with Data
with Google Cloud 02 Data Consolidation and Analytics
Topics covered
Types of data
Important considerations
for using data in the cloud
Do
not
dist
rib
ut
e
As organizations have digitized their operations, various forms of business data have become
available. This includes financial information, logistics data, production output, quality reports,
etc. Businesses also have access to new kinds of data about their customers.
Do
not
dist
rib
ut
e
Large enterprises with traditional IT infrastructures face several limitations when it comes to
leveraging the value of data. These limitations include: processing volumes and varieties of new
data, finding cost effective solutions, scaling resource capacity up or down, accessing historical
data, and deriving insights from historical and new data.
Do
not
dist
rib
ut
e
Economies of scale
Automation
Google Cloud offers
Rapid elasticity
Data access
Do
not
dist
rib
ut
e
Businesses can now consume, store and process terabytes of data in real-time,
and run queries—that is, requests to retrieve and use data, instantly.
Do
not
dist
rib
ut
e
Resources are now distributed across a global network. Multiple data centers create resilience
against data loss or service disruption, without any extra overhead for businesses.
Do
not
dist
rib
ut
e
User data
Structured
Structured data is highly organized. Examples include customer records consisting of names,
addresses, credit card numbers, and other quantitative data. Structured data can be easily
stored and managed in databases.
Do
not
dist
rib
ut
e
BLOB
Do
not
dist
rib
ut
e
Organizations rely on both structured and unstructured data to gain insight and make
intelligent decisions. However, unstructured data has historically been very difficult to analyze.
Do
not
dist
rib
ut
e
API
Consider the source of the data, how it is being collected and where it’s stored.
Do
not
dist
rib
ut
e
AI
ML
Do
not
dist
rib
ut
e
Topics covered
When you store your data on-premises, you are responsible for the IT infrastructure
that supports the collection, security, and processing of that data. You’re also
responsible for maintaining and expanding the capacity of your IT infrastructure.
Do
not
dist
rib
ut
e
With cloud, you can ‘rent’ space from public cloud providers like Google Cloud.
This means that their data storage and compute power is elastic.
Do
not
dist
rib
ut
e
Another way that migrating data to the cloud provides a better return on investment is the
speed at which you can ingest and use data. Businesses can now ingest data in real-time.
Do
not
dist
rib
ut
e
What is a database?
Scale
Do
not
dist
rib
ut
e
It’s another fully managed database service, and it’s designed for global scale. With Cloud Spanner,
data is automatically and instantly copied across regions. This replication means that if one region
goes offline, the organization’s data can still be retrieved from another region.
Do
not
dist
rib
ut
e
With Google Cloud databases, businesses can build and deploy faster, deliver
transformative applications, and maintain portability and control of their data.
Do
not
dist
rib
ut
e
Databases are built and optimized to enable ingesting large amounts of data from many
different sources efficiently. However, data warehouses are built to enable rapid analysis of
large and multi-dimensional datasets.
Do
not
dist
rib
ut
e
Most data warehouse providers link storage and compute together, so customers
are charged for compute capacity whether they are running a query or not.
Do
not
dist
rib
ut
e
What is BigQuery?
What is BigQuery?
They often hold 'back-up' data, which helps businesses build resilience against unexpected
harm affecting their data. Businesses are protected against data loss. They also hold data
that is historic and not relevant to day-to-day business operations.
Do
not
dist
rib
ut
e
One way to classify an organization’s requirements for storage is by how often they need to
access the data.
Do
not
dist
rib
ut
e
Multi-regional
storage
Cloud Storage offers multi-regional storage. It’s ideal for serving content to users worldwide.
Do
not
dist
rib
ut
e
Other solutions let everyone in the business perform their own data analysis. But they can only
perform their analysis with portions of the available data. This means that only a few people, or
possibly no one, has a full view of the company’s business data.
Do
not
dist
rib
ut
e
What is Looker?
What is Looker?
Topics covered
Module 3: Student Slides
The definition of ML and AI
Innovation with
Data quality considerations
Machine Learning
Real world ML use cases
Do
not
dist
rib
ut
e
The dashboard and the report are examples of backward-looking data. They look at what
happened in the past. To create value in your business, you need to use that data to make
decisions for future business.
Do
not
dist
rib
ut
e
What is AI?
What is ML?
Definition used in this course: ML is a way to use standard algorithms or standard models to
analyze data in order to derive predictive insights and make repeated decisions at scale.
Do
not
dist
rib
ut
e
Algorithm Data
Bugs in ML are often caused by bugs in the data. In traditional software development, a bug is a
mistake in the code that causes unexpected or undesired behavior. In ML, even though there can
be bugs in the implementation of an algorithm, bugs in data are far more common.
DDoo
n
noo
tt d
ist
rib
buut
tee
Data is the tunnel through which the model views the world.
Do
not
dist
rib
ut
e
Improve coverage?
Improve cleanliness?
Google Cloud AI Platform is a unified, simply managed platform that makes machine learning easy to
adopt by analysts and developers. It provides modern ML services, with the ability to generate tailored
models and use pre-trained models.
Do
not
dist
rib
ut
e
APIs
APIs are simple methods and tools to connect various applications. They can be deployed in a virtual
private cloud, on-premises, or in Google’s public cloud. They allow developers to quickly and easily
train custom models regardless of their level of ML experience.
Do
not
dist
rib
ut
e