0% found this document useful (0 votes)
15 views

BA ppt

Uploaded by

devil.ten10
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views

BA ppt

Uploaded by

devil.ten10
Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 17

TITLE – CONCEPT OF BIG DATA

PRESENTED BY – DIVYANSHU UPADHYAY


NAMAN GUPTA
ADARSH PANDEY
PANKAJ CHAUDHARY
SHIVBRAT SINGH

SUBJECT – BUSINESS ANALYTICS


INTRODUCTION TO BIG DATA
• Big Data refers to extremely large and complex datasets
that traditional data processing software cannot handle
effectively. These datasets are characterized by their vast
size, fast generation, and variety of types. Big Data has
revolutionized how businesses, governments, and
organizations approach decision-making, enabling them to
gain insights from diverse and vast information sources.
IMPORTANCE OF BIG DATA IN
TODAY’S BUSINESS LANDSCAPE
• Big Data has become a cornerstone of modern business strategies, enabling
companies to harness vast amounts of information to drive innovation,
improve efficiency, and stay competitive. Here's why Big Data is so crucial in
today’s business environment:
• 1. Enhanced Decision-Making
• 2. Personalization and Customer Experience
• 3. Operational Efficiency
• 4. Market and Competitive Analysis
• 5. Innovation and Product Development
HOW BIG DATA IMPACTS DECISION-
MAKING
• Big Data has transformed the decision-making process in businesses by
providing valuable insights, enabling more informed, faster, and precise choices.
Here's how it impacts decision-making:
• 1. Data-Driven Decision Making
• 2. Faster Decision Making
• 3. Customer-Centric Decisions
• 4. Risk Management and Mitigation
• 5. Operational Efficiency and Cost Reduction
CHARACTERISTICS OF BIG DATA (4VS)
Volume: Refers to the vast amounts of data
generated every second. The sheer scale of data
from various sources like social media, IoT devices,
and digital platforms is immense, making
traditional data storage and processing methods
inadequate.

Velocity: The speed at which data is generated


and processed. Real-time or near-real-time
processing is critical in many industries to enable
timely decision-making.

Variety: Big Data comes in many forms—


structured, semi-structured, and unstructured. This
includes text, images, videos, audio, sensor data,
etc., requiring sophisticated methods to analyze.

Veracity: The accuracy and reliability of the data.


With such large volumes of data coming from
diverse sources, ensuring data quality is a
significant challenge.
TYPES OF BIG DATA
•Structured Data - This type of data is organized and formatted in a
predefined manner, usually stored in relational databases.
It is highly organized, making it easy to search, query, and analyze

•Unstructured Data - Unstructured data lacks a specific structure or


format,
making it harder to process and analyze using traditional methods.

•Semi-structured Data - Semi-structured data has some organizational


properties, like tags or markers,
but doesn’t fit neatly into the traditional relational databases.
SOURCES OF BIG DATA
• Social media
• Sensors/ Iot devices
• E-commerce platforms
• Financial transactions
• Enterprise systems (CRM, ERP)
BIG DATA VS. TRADITIONAL DATA
• 1. Data Volume

• Traditional Data: Typically involves smaller datasets that can be managed and
processed using standard tools like Excel, relational databases (e.g., SQL), or basic
statistical software.

• Big Data: Refers to massive datasets, often in terabytes or petabytes, that exceed the
capabilities of traditional data processing systems. These large volumes of data come
from sources like social media, IoT devices, and transaction logs.

• 2. Data Variety

• Traditional Data: Usually structured, meaning it fits into predefined models like tables
with rows and columns. Examples include financial records, customer databases, and
sales data.

• Big Data: Involves a wide variety of data types, including structured, semi-structured,
and unstructured data. This can include text (emails, documents), images, videos,
social media posts, sensor data, and more. The diversity of formats requires advanced
tools for processing.

• 3. Data Velocity
BIG DATA TECHNOLOGIES
• Data Storage and Management Technologies
• Hadoop: An open-source framework that allows distributed storage and
processing of large datasets across clusters of computers. Hadoop's Hadoop
Distributed File System (HDFS) enables scalable storage, and its MapReduce
programming model helps process vast amounts of data in parallel.
• NoSQL Databases: Unlike traditional relational databases, NoSQL databases
(e.g., MongoDB, Cassandra) are designed to handle unstructured or semi-
structured data. They are flexible, scalable, and efficient for managing large
amounts of data.
• Amazon S3 (Simple Storage Service): A scalable, cloud-based object storage
service by AWS, often used for storing large datasets in Big Data applications.
• Apache HBase: A distributed, scalable NoSQL database built on top of HDFS,
designed to handle real-time read/write access to Big Data.
• Data Processing and Analytics Technologies
• Apache Spark: A powerful, open-source data processing engine that allows fast data
analytics and supports in-memory processing. It is widely used for batch and real-time data
processing and can work with large datasets more efficiently than MapReduce.
• MapReduce: A programming model used by Hadoop to process large datasets in a
distributed manner. MapReduce divides a job into smaller tasks (map phase) and then
aggregates results (reduce phase).
• Apache Flink: A distributed processing engine designed for stream and batch processing.
Flink excels at handling real-time data processing with low latency.
• Apache Storm: A real-time computation system that processes unbounded streams of data.
It’s useful for tasks like real-time analytics and continuous computation.
• Apache Kafka: A distributed streaming platform used to build real-time data pipelines and
streaming applications. It is highly scalable and handles high-throughput data streams.
• ElasticSearch: A real-time search and analytics engine, commonly used for log and event
data analysis. It provides fast search capabilities and is often used with distributed systems.
BIG DATA ANALYTICS
• Big Data Analytics refers to the process of examining large and complex
datasets to uncover hidden patterns, correlations, trends, and other valuable
insights. By using advanced analytic techniques, businesses can gain a
competitive advantage, improve decision-making, and identify new
opportunities. Here's a deeper look into Big Data Analytics, its types,
benefits, and tools.
TYPES OF BIG DATA ANALYTICS
• Descriptive Analytics
• Predictive Analytics
• Prescriptive Analytics
• Real-time analytics
APPLICATIONS OF BIG DATA

•Healthcare
•Finance
•Retail
•Manufacturing
•Government
FUTURE TRENDS IN BIG DATA
• Growth of Artificial Intelligence (AI) and Machine
Learning (ML)
• Edge computing
• Cloud-based Big Data solutions
• Quantum computing
CONCLUSION
• Big Data has revolutionized the way businesses and industries operate in today’s data-
driven world. Its ability to process, analyze, and extract meaningful insights from vast
and complex datasets offers immense opportunities for organizations across sectors.
Big Data enables more informed decision-making, fosters innovation, improves
operational efficiency, and enhances customer experiences by uncovering trends and
patterns that were previously difficult or impossible to detect with traditional data
tools.
• As technology continues to advance, the role of Big Data will grow, with analytics,
machine learning, and AI driving even deeper insights and automation. Organizations
that embrace Big Data and invest in the right tools, infrastructure, and skills will be
better positioned to thrive in an increasingly competitive and dynamic marketplace.
Any question
Thank You

You might also like