Data Pipeline
Data Pipeline
PIPELINE
SIMPLIFIED
BY NISCHAY THAPA
Imagine a retail company that
wants to analyse its sales data
to understand customer
behaviours.
The company collects data and
stores it from different systems
POS System
Website
Social Media
CRM System
How do I move
the data? Destination
Source
Data Pipeline
A data pipeline is a series of
automated processes that
move data from one system or
stage to another.
Destination
Source
The processes in a data pipeline
can include
Extraction
Validation
Transformation
Loading
Quality Checks
Monitoring
Without a data pipeline,
Be cost-effective
ELT Pipeline:
Extracts data from various
sources
Loads it into a destination
system
Transforms it
Data pipelines are an efficient means for
managing and processing data enabling
automation, improved governance, and
providing accurate insights to inform
decision-making.
Quality
Check Load
Validate
Transform
Destination
Monitor
Source Extract
RESOURCES