0% found this document useful (0 votes)

53 views

Big Data

Big data analytics describes the process of analyzing large amounts of data from various sources to uncover patterns and insights. It involves collecting, processing, cleaning, and analyzing large datasets. This allows organizations to gain operational efficiencies, improve products based on customer needs, and track market trends. However, big data also presents challenges around data accessibility, quality, security, and choosing the right tools.

Uploaded by

nam trần

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

Big Data

Uploaded by

nam trần

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Big Data Analytics: What It Is, How It

Works, Benefits, And Challenges

Each day, your customers generate an abundance of data. Every time they open your
email, use your mobile app, tag you on social media, walk into your store, make an
online purchase, talk to a customer service representative, or ask a virtual assistant
about you, those technologies collect and process that data for your organization. And
that’s just your customers. Each day, employees, supply chains, marketing efforts,
finance teams, and more generate an abundance of data, too. Big data is an extremely
large volume of data and datasets that come in diverse forms and from multiple
sources. Many organizations have recognized the advantages of collecting as much
data as possible. But it’s not enough just to collect and store big data—you also have
to put it to use. Thanks to rapidly growing technology, organizations can use big data
analytics to transform terabytes of data into actionable insights.

What is big data analytics?

Big data analytics describes the process of uncovering trends, patterns, and
correlations in large amounts of raw data to help make data-informed decisions. These
processes use familiar statistical analysis techniques—like clustering and regression—
and apply them to more extensive datasets with the help of newer tools. Big data has
been a buzz word since the early 2000s, when software and hardware capabilities
made it possible for organizations to handle large amounts of unstructured data. Since
then, new technologies—from Amazon to smartphones—have contributed even more
to the substantial amounts of data available to organizations. With the explosion of
data, early innovation projects like Hadoop, Spark, and NoSQL databases were
created for the storage and processing of big data. This field continues to evolve as
data engineers look for ways to integrate the vast amounts of complex information
created by sensors, networks, transactions, smart devices, web usage, and more. Even
now, big data analytics methods are being used with emerging technologies, like
machine learning, to discover and scale more complex insights.

How big data analytics works

Big data analytics refers to collecting, processing, cleaning, and analyzing large
datasets to help organizations operationalize their big data.

1. Collect Data

Data collection looks different for every organization. With today’s technology,
organizations can gather both structured and unstructured data from a variety of
sources — from cloud storage to mobile applications to in-store IoT sensors and
beyond. Some data will be stored in data warehouses where business intelligence
tools and solutions can access it easily. Raw or unstructured data that is too diverse or
complex for a warehouse may be assigned metadata and stored in a data lake.
2. Process Data

Once data is collected and stored, it must be organized properly to get accurate results
on analytical queries, especially when it’s large and unstructured. Available data is
growing exponentially, making data processing a challenge for organizations. One
processing option is batch processing, which looks at large data blocks over time.
Batch processing is useful when there is a longer turnaround time between collecting
and analyzing data. Stream processing looks at small batches of data at once,
shortening the delay time between collection and analysis for quicker decision-
making. Stream processing is more complex and often more expensive.

3. Clean Data

Data big or small requires scrubbing to improve data quality and get stronger results;
all data must be formatted correctly, and any duplicative or irrelevant data must be
eliminated or accounted for. Dirty data can obscure and mislead, creating flawed
insights.

4. Analyze Data

Getting big data into a usable state takes time. Once it’s ready, advanced analytics
processes can turn big data into big insights. Some of these big data analysis methods
include:

 Data mining sorts through large datasets to identify patterns and relationships by identifying
anomalies and creating data clusters.
 Predictive analytics uses an organization’s historical data to make predictions about the
future, identifying upcoming risks and opportunities.
 Deep learning imitates human learning patterns by using artificial intelligence and machine
learning to layer algorithms and find patterns in the most complex and abstract data.

Big data analytics tools and technology

Big data analytics cannot be narrowed down to a single tool or technology. Instead, several types of tools work
together to help you collect, process, cleanse, and analyze big data. Some of the major players in big data ecosystems
are listed below.

 Hadoop is an open-source framework that efficiently stores and processes big datasets on clusters of
commodity hardware. This framework is free and can handle large amounts of structured and unstructured
data, making it a valuable mainstay for any big data operation.
 NoSQL databases are non-relational data management systems that do not require a fixed scheme,
making them a great option for big, raw, unstructured data. NoSQL stands for “not only SQL,” and these
databases can handle a variety of data models.
 MapReduce is an essential component to the Hadoop framework serving two functions. The first is
mapping, which filters data to various nodes within the cluster. The second is reducing, which organizes
and reduces the results from each node to answer a query.
 YARN stands for “Yet Another Resource Negotiator.” It is another component of second-generation
Hadoop. The cluster management technology helps with job scheduling and resource management in the
cluster.
 Spark is an open source cluster computing framework that uses implicit data parallelism and fault
tolerance to provide an interface for programming entire clusters. Spark can handle both batch and stream
processing for fast computation.
 Tableau is an end-to-end data analytics platform that allows you to prep, analyze, collaborate, and share
your big data insights. Tableau excels in self-service visual analysis, allowing people to ask new questions of
governed big data and easily share those insights across the organization.

The big benefits of big data analytics

The ability to analyze more data at a faster rate can provide big benefits to an organization, allowing it to more
efficiently use data to answer important questions. Big data analytics is important because it lets organizations use
colossal amounts of data in multiple formats from multiple sources to identify opportunities and risks, helping
organizations move quickly and improve their bottom lines. Some benefits of big data analytics include:

 Cost savings. Helping organizations identify ways to do business more efficiently

 Product development. Providing a better understanding of customer needs
 Market insights. Tracking purchase behavior and market trends

Read more about how real organizations reap the benefits of big data.

The big challenges of big data

Big data brings big benefits, but it also brings big challenges such new privacy and security concerns, accessibility for
business users, and choosing the right solutions for your business needs. To capitalize on incoming data,
organizations will have to address the following:

 Making big data accessible. Collecting and processing data becomes more difficult as the amount of data
grows. Organizations must make data easy and convenient for data owners of all skill levels to use.
 Maintaining quality data. With so much data to maintain, organizations are spending more time than
ever before scrubbing for duplicates, errors, absences, conflicts, and inconsistencies.
 Keeping data secure. As the amount of data grows, so do privacy and security concerns. Organizations will
need to strive for compliance and put tight data processes in place before they take advantage of big data.
 Finding the right tools and platforms. New technologies for processing and analyzing big data are
developed all the time. Organizations must find the right technology to work within their established
ecosystems and address their particular needs. Often, the right solution is also a flexible solution that can
accommodate future infrastructure changes.

Get started with big data analytics

Big data comes in all shapes and sizes, and organizations use it and benefit from it in numerous ways. How can your
organization overcome the challenges of big data to improve efficiencies, grow your bottom line and empower new
business models? Start with these seven tips for succeeding with big data.

Dental Research Report
No ratings yet
Dental Research Report
3 pages
Shanin DOE - Six Sigma
100% (1)
Shanin DOE - Six Sigma
7 pages
Analysis of Variance (F-Test) : Name: Jonalyn M. Cerilo Subject: Statistics Professor: Dr. Maria Dela Vega Topic
No ratings yet
Analysis of Variance (F-Test) : Name: Jonalyn M. Cerilo Subject: Statistics Professor: Dr. Maria Dela Vega Topic
5 pages
Document
No ratings yet
Document
5 pages
Big Data Analytics
No ratings yet
Big Data Analytics
4 pages
Unit 1 Big Data
No ratings yet
Unit 1 Big Data
124 pages
File 1
No ratings yet
File 1
3 pages
Ccs 334
No ratings yet
Ccs 334
16 pages
UNIT I BIG DATA Extra Content
No ratings yet
UNIT I BIG DATA Extra Content
15 pages
Bda Unit1
No ratings yet
Bda Unit1
19 pages
Big Data: Concepts, Techniques, Storage and Challenges
No ratings yet
Big Data: Concepts, Techniques, Storage and Challenges
9 pages
Big Data Analytics: Free Guide: 5 Data Science Tools To Consider
No ratings yet
Big Data Analytics: Free Guide: 5 Data Science Tools To Consider
8 pages
Big Data Seminar
100% (2)
Big Data Seminar
27 pages
Harnessing The Value of Big Data Analytics
No ratings yet
Harnessing The Value of Big Data Analytics
13 pages
MODULE 1 - ST
No ratings yet
MODULE 1 - ST
13 pages
Mittal School of Business: Course Code: CAP348 Course Title: Introduction To Big Data
No ratings yet
Mittal School of Business: Course Code: CAP348 Course Title: Introduction To Big Data
6 pages
Data Mining
No ratings yet
Data Mining
11 pages
Data Science Unit-I
No ratings yet
Data Science Unit-I
13 pages
Big Data Ashish
No ratings yet
Big Data Ashish
7 pages
TP 4 2docuatrimestre
No ratings yet
TP 4 2docuatrimestre
10 pages
Unit 1
No ratings yet
Unit 1
19 pages
Various Big Data Tools
No ratings yet
Various Big Data Tools
33 pages
The Definition of Big Data
No ratings yet
The Definition of Big Data
7 pages
Data Mining
No ratings yet
Data Mining
89 pages
Introduction to Big Data
No ratings yet
Introduction to Big Data
4 pages
Module_1_Session_3 Analytic Processes and Tools _ Analysis vs Reporting _ Modern Data Analytic Tools
No ratings yet
Module_1_Session_3 Analytic Processes and Tools _ Analysis vs Reporting _ Modern Data Analytic Tools
5 pages
Big Data: Spot Business Trends, Prevent Diseases, C Ombat Crime and So On"
No ratings yet
Big Data: Spot Business Trends, Prevent Diseases, C Ombat Crime and So On"
8 pages
Data Mining1
No ratings yet
Data Mining1
37 pages
Data Mining Tutorial - Javatpoint
No ratings yet
Data Mining Tutorial - Javatpoint
12 pages
Big Data Manual - Edited
No ratings yet
Big Data Manual - Edited
69 pages
Three V of Big Data
No ratings yet
Three V of Big Data
4 pages
Unit 2 (DWDM)
No ratings yet
Unit 2 (DWDM)
40 pages
Bda Aiml Note Unit 1
No ratings yet
Bda Aiml Note Unit 1
14 pages
R Programming UNIT-1
No ratings yet
R Programming UNIT-1
48 pages
What is Big Data
No ratings yet
What is Big Data
4 pages
Server Technology 600
No ratings yet
Server Technology 600
42 pages
Reading Teks Kelompok 2
No ratings yet
Reading Teks Kelompok 2
12 pages
Enterprise integration Report
No ratings yet
Enterprise integration Report
7 pages
Big Data Analytics Notes
No ratings yet
Big Data Analytics Notes
117 pages
FDS- Unit-I - Notes
No ratings yet
FDS- Unit-I - Notes
24 pages
Getting Started With Hadoop Planning Guide
No ratings yet
Getting Started With Hadoop Planning Guide
24 pages
Bigdata Mod-1
No ratings yet
Bigdata Mod-1
33 pages
L_1 Data Mining
No ratings yet
L_1 Data Mining
17 pages
What Is Big Data
No ratings yet
What Is Big Data
8 pages
Advanced Analytics: What Is Big Data Analytics? Definition, Benefits, and More
No ratings yet
Advanced Analytics: What Is Big Data Analytics? Definition, Benefits, and More
13 pages
Big Data
No ratings yet
Big Data
16 pages
Big Data UNIT1
No ratings yet
Big Data UNIT1
23 pages
BigData_BCom
No ratings yet
BigData_BCom
57 pages
BigData_BCom-Unit-1
No ratings yet
BigData_BCom-Unit-1
9 pages
Data Analytics
No ratings yet
Data Analytics
5 pages
Data Mining Tutorial
No ratings yet
Data Mining Tutorial
30 pages
What is Big Data
No ratings yet
What is Big Data
5 pages
BIG DATA ANALYTICS
No ratings yet
BIG DATA ANALYTICS
23 pages
BIG DATA ANALYTICS NOTES
No ratings yet
BIG DATA ANALYTICS NOTES
115 pages
Big Data Notes UNIT-1
No ratings yet
Big Data Notes UNIT-1
14 pages
BDA Notes
No ratings yet
BDA Notes
35 pages
Bda CHP1
No ratings yet
Bda CHP1
83 pages
data mining and business analytics
No ratings yet
data mining and business analytics
7 pages
Big Data Analytics - CCS334 - Notes - Unit 1 - Understanding Big Data
No ratings yet
Big Data Analytics - CCS334 - Notes - Unit 1 - Understanding Big Data
40 pages
Emerging Big Data and Cloud Computing
No ratings yet
Emerging Big Data and Cloud Computing
15 pages
Unit 1
No ratings yet
Unit 1
14 pages
Demystifying Big Data RGc1.0
100% (1)
Demystifying Big Data RGc1.0
10 pages
Data Analytics with Python: Data Analytics in Python Using Pandas
From Everand
Data Analytics with Python: Data Analytics in Python Using Pandas
Frank Millstein
3/5 (1)
Data Analytics & Business Intelligence
No ratings yet
Data Analytics & Business Intelligence
15 pages
Sample Size Requirements Reliability Studies: K and Number
No ratings yet
Sample Size Requirements Reliability Studies: K and Number
8 pages
End Sem Exam Mo-20 Bba MT 102 Business Statistics
No ratings yet
End Sem Exam Mo-20 Bba MT 102 Business Statistics
1 page
Lampiran Hasil Pengolahan Data Uji Regresi Berganda (Hipotesis Penelitian)
No ratings yet
Lampiran Hasil Pengolahan Data Uji Regresi Berganda (Hipotesis Penelitian)
5 pages
Working Capital Management
No ratings yet
Working Capital Management
47 pages
CBA Profile Book Batch 6
100% (1)
CBA Profile Book Batch 6
62 pages
RFP DataLytics - Vol 1
100% (1)
RFP DataLytics - Vol 1
101 pages
4257 16666 6 PB
No ratings yet
4257 16666 6 PB
9 pages
Data Science Terminology Flashcards - Quizlet
100% (1)
Data Science Terminology Flashcards - Quizlet
15 pages
M720enbr BB
No ratings yet
M720enbr BB
9 pages
Data Mining: Concepts and Techniques: - Chapter 6
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 6
112 pages
Exercise 25
No ratings yet
Exercise 25
3 pages
CH 7 Clustering
No ratings yet
CH 7 Clustering
37 pages
Research: Levita Blorecia-Grana, DM, CE Sjit
No ratings yet
Research: Levita Blorecia-Grana, DM, CE Sjit
56 pages
Regression Results
No ratings yet
Regression Results
7 pages
Faculty of Busniess and Management Bachelor in Office System Manangment (BA232) MGT555 Assignment 1 Prepared by
100% (1)
Faculty of Busniess and Management Bachelor in Office System Manangment (BA232) MGT555 Assignment 1 Prepared by
13 pages
7 Libraries That Help in Time-Series problems-AI Data Science
No ratings yet
7 Libraries That Help in Time-Series problems-AI Data Science
20 pages
FBA Unit 2
No ratings yet
FBA Unit 2
37 pages
CCW331 Business Analytics Material Unit I Type2
No ratings yet
CCW331 Business Analytics Material Unit I Type2
43 pages
The Scientific Approach and Alternative Approaches To Investigation
No ratings yet
The Scientific Approach and Alternative Approaches To Investigation
51 pages
a output one way anova
No ratings yet
a output one way anova
4 pages
SPSS Tutorials-Data Entry, Data Screening, Validity, Reliability
No ratings yet
SPSS Tutorials-Data Entry, Data Screening, Validity, Reliability
23 pages
Project On Dps Financial Service
No ratings yet
Project On Dps Financial Service
58 pages
Analysis
No ratings yet
Analysis
2 pages
Test Question
No ratings yet
Test Question
80 pages
1
No ratings yet
1
10 pages
CHAPTER 12 - Non Parametrics Test
No ratings yet
CHAPTER 12 - Non Parametrics Test
38 pages

Big Data

Uploaded by

Big Data

Uploaded by

Big Data Analytics: What It Is, How It

Works, Benefits, And Challenges

What is big data analytics?

How big data analytics works

Big data analytics tools and technology

The big benefits of big data analytics

 Cost savings. Helping organizations identify ways to do business more efficiently

The big challenges of big data

Get started with big data analytics

You might also like