Big Data Analytics Presentation
Big Data Analytics Presentation
B I G DATA A N A LY S I S A N D
PROCESSING
Introduction to big
data processing
Introduction
Big data processing refers to the methods and technologies used to
handle large volumes of data that traditional data processing applications
can't manage efficiently. This data typically comes from various sources
such as social media, sensors, machines, transactions, and more. The
three main characteristics of big data, often referred to as the three Vs,
are volume, velocity, and variety:
• Volume: Big data involves large amounts of data, often ranging from
terabytes to petabytes or even exabytes.
• Velocity: Data streams in at high speeds and needs to be processed
quickly to derive insights or take actions in real-time or near real-time.
• Variety: Data comes in various formats and types, including structured
data (like databases), semi-structured data (like XML files), and
unstructured data (like text, images, and videos).
components of Big data
• Storage processing
Systems: Big data storage solutions like Hadoop
Distributed File System (HDFS), Amazon S3, or Google Cloud
Storage are used to store massive amounts of data across
distributed systems.