CLOUD ia2
CLOUD ia2
• Data ingestion:
• means transferring data from the source to your storage, data lake, or data warehouse.
• This would involve something such as Azure Synapse Analytics using data integration to
transfer data from various sources such as on-premises databases and SaaS products to a
data lake.
• Data storage :
• Once data has been ingested from various data sources, all the data is stored in a data lake.
• The data residing within the lake will still be in a raw format and includes both structured
and unstructured data formats.
• Data sharing :
• Azure Data Share allows you to securely manage and share your big data to other parties
and organizations.
• Data preparation:
• Once data is ingested, the next step is data preparation.
• This is a phase where the data from di erent data sources is pre-processed for data analytics
purposes.
ff
ff
ff
11. DISCUSS THE ARCHITECTURE OF AZURE SYNAPSE ANALYTICS SERVICE
WITH A SUITABLE DIAGRAM.
->
• Azure Synapse is an enterprise analytics service that accelerates time to insight across data
warehouses and big data systems.
• Azure Synapse Analytics is a fully managed, integrated data analytics service that blends data
warehousing, data integration, and big data processing with accelerating time to insight into a
single service.
• Synapse SQL is a distributed query system for T-SQL that enables data warehousing and data
virtualization scenarios and extends T-SQL to address streaming and machine learning scenarios.
• Apache Spark for Azure Synapse deeply and seamlessly integrates Apache Spark--the most
popular open source big data engine used for data preparation, data engineering, ETL, and
machine learning.
• Azure Synapse removes the traditional technology barriers between using SQL and Spark together.
You can seamlessly mix and match based on your needs and expertise.
• Azure Synapse contains the same Data Integration engine and experiences as Azure Data Factory,
allowing you to create rich at-scale ETL pipelines without leaving Azure Synapse Analytics.