Big Data Analytics in The Cloud For Business Intelligence
Big Data Analytics in The Cloud For Business Intelligence
Intelligence
K.Venkata Avinash
School of Computer Science And Engineering
The Research And Development Cell
Phagwara,India.
Abstract:
Cloud computing and big data analytics deploying big data analytics through cloud
are, without a doubt, two of the most computing. We argue that cloud
important technologies to enter the computing can support the storage and
mainstream IT industry in recent years. computing requirements of big data
Surprisingly, the two technologies are analytics. We discuss how the
coming together to deliver powerful results consolidation of these two dominant
and benefits for businesses. Cloud technologies can enhance the process of
computing is already changing the way IT big data mining enabling businesses to
services are provided by so called cloud improve decision-making processes. We
companies and how businesses and users also highlight the issues and risks that
interact with IT resources. Big Data is a should be addressed when using a so
data analysis methodology enables by called CLaaS, cloud-based service model.
recent advances in information and
communications technology. However, big Keywords: Cloud Computing, Big
data analysis requires a huge amount of Data Analytics, Cloud Analytics, Security,
computing resources making adoption Privacy, Business Intelligence,
costs of big data technology is not MapReduce, AaaS, CLaaS
affordable for many small to medium
enterprises. In this paper, we outline the
the benefits and challenges involved in
compelled to capture, understand
and harness their data to support
o 1.Introduction : decision making in order to improve
The term Business Intelligence (BI) business operationswas the first
refers to technologies, applications company to properly use the
and practices for the collection, characteristics of cloud computing
integration, analysis, and and to provide their physical
presentation of business resources as virtual resources to the
information. The main purpose of customers [73] followed by others.
Business Intelligence is to support Now different cloud platforms such
better and faster business decision as Google App Engine, Windows
making. Organizations are being Azure, and Salesforce.com etc. are
available in the market, providing
cloud services, which can be and challenges it brings to
utilized by enterprise developers to enterprises. First, we overview the
develop and migrate their concepts, issues and technology of
application and data, and benefit cloud computing and big data
from cloud computing. separately. We then present a
framework that combines these two
In an ever-changing business world, technologies to form an ideal
many companies now face growing platform for e-commerce. We
pressure to develop and ramp up discuss the role of big data in
their business intelligence efforts enhancing the main functional areas
quickly and at a low cost in order to of e-commerce such as customer
remain competitive. Recently management, marketing, payments,
emerged cloud computing is supply chain and management.
changing the way IT services are
provided by companies and how
businesses and users interact with 2.RELATED WORK:
IT resources. It represents a
Cloud computing popularity has prompted
paradigm shift that introduces
several academic and industry initiatives to
flexible service models that
explore the capabilities and enhancements
companies can subscribe on a pay-
in cloud computing. The value proposition
as-you-use model. The data in the
of cloud computing in comparison with on
world is growing exponentially. Big
premise investments is one of the key
data is an evolving term that
research areas. There are several initiatives
describes any huge amount of
to specifically address the security issues
structured, semi-structured and
and challenges in cloud computing. There
unstructured data that has the
have been several academic initiatives
potential to be mined for useful
investigating e-business model aspects of
information. Big data is data that
cloud computing. Aydin discusses research
exceeds the processing capacity of
of E-Commerce Based on Cloud
traditional databases. The data is too
Computing. Dan and Roger compared
big to be processed by a single
various cloud offerings such as Google
machine. The evolving field of big
App Engine, Amazon EC2, and Microsoft
data analytics examines large
Azure to provide guidance on cost,
amounts of data to uncover hidden
application performance (and limitations)
patterns, correlations and other
for different deployment scenarios.
insights. Big data technology has
Agarwal et al present various methods for
become possible with the latest
handling the problems of big data analysis
developments in computer
through Map Reduce framework over
technology as well as algorithms
Hadoop Distributed File System (HDFS).
and approaches developed to handle
In this paper, Map Reduce techniques have
big data. In this paper, our aim is to
been implemented for Big Data analysis
investigate the impacts of cloud
using HDFS. Yadav et al present an
computing and big data on
overview of architecture and algorithms
businesses and analyse the benefits
used in large data sets. These algorithms Standards (NIST). Per the NIST
define various structures and methods definition , “Cloud computing is a
implemented to handle Big Data and this model for enabling ubiquitous,
paper lists various tools that were convenient, on-demand network
developed for analysing them. It also access to a shared pool of
configurable computing resources
describes about the various security issues,
(e.g., networks, servers, storage,
application and trends followed by a large
applications, and services) that can
data set. Fan and Bifet present an overview be rapidly provisioned and released
of big data mining outlining its current with minimal management effort or
status, controversy, and forecast to the service provider interaction. This
future. This paper also covers various cloud model is composed of five
interesting and state-of-the-art topics on essential characteristics, five service
Big Data mining. Sharma and Navdeti models, and four deployment
discuss about the big data security at the models”
environment level along with the probing
of built in protections. It also presents 3.2 Cloud Computing
some security issues that we are dealing Characteristics
with today and proposes security solutions Cloud computing has five essential
and commercially accessible techniques to characteristics. They are on-demand
address the same. The paper also covers all capabilities, broad network access,
the security solutions to secure the Hadoop resource pooling, rapid elasticity
ecosystem. They also provide an overview and measured service. These are the
on big data, its importance in our live and characteristics that distinguish it
some technologies to handle big data. from other computing paradigms.
Jassena and David discuss issues, On-demand Capabilities: A
challenges and solutions of big data consumer can unilaterally provision
mining. Padgavankar and Gupta provide computing capabilities, such as
server time and network storage, as
detail analysis of the challenges involved
needed automatically without
in big data storage and propose some
requiring human interaction with
solutions to handle them. Jayasree each service provider.
provides an overview of big data Broad network access: Capabilities
technologies such as MapReduce and are available over the network and
Hadoop and compares with traditional data accessed through standard
mining techniques. Zulkernine et al mechanisms that promote use by
presents a conceptual architecture for a heterogeneous thin or thick client
cloud based analytics as a service (CLaaS). platforms (e.g., mobile phones,
tablets, laptops and workstations).
Resource Pooling: The provider's
3.Cloud Computing: computing resources are pooled to
serve multiple consumers using a
3.1 Cloud Computing
multi-tenant model, with different
Many researchers have defined physical and virtual resources
cloud computing differently. One dynamically assigned and
mostly accepted definition is given reassigned per consumer demand.
by the United States Institute of Rapid elasticity: Capabilities can
be elastically provisioned and a composition of two or more
released, in some cases distinct cloud infrastructures
automatically, to scale rapidly (private, community, or public)
outward and inward commensurate that remain unique entities, but are
with demand. Measured service: bound together by standardized or
Cloud systems automatically control proprietary technology that enables
and optimize resource use by data and application portability .
leveraging a metering capability at
some level of abstraction
appropriate to the type of service
(e.g., storage, processing,
bandwidth and active user
accounts).
3.3Cloud Deployment
Models Cloud
deployment models are grouped
broadly into four models: private
cloud, public cloud, community 3.4 Cloud Service
cloud and hybrid cloud. Private Delivery Models
cloud is the most secure way to Cloud-based services are grouped
utilize cloud computing. The cloud broadly into four models: Data as a
infrastructure is provisioned for Service (DaaS), Software as a
exclusive use by a single Service (SaaS), Platform as a
organization comprising multiple Service (PaaS), and Infrastructure as
consumers (e.g., business units). It a Service (IaaS). Software as a
may be owned, managed, and Service (SaaS) is a model that
operated by the organization, a third provides the user with access to
party, or some combination of them, already developer applications that
and it may exist on or off premises. are running in the cloud. The access
Community cloud is provisioned for is achieved by cloud clients and the
exclusive use by a specific cloud users do not manage the
community of consumers from infrastructure where the application
organizations that have shared resides, eliminating with this the
concerns. It may be owned, way the need to install and run the
managed, and operated by one or application on the cloud user’s own
more of the organizations in the computers. Platform as a Service
community, a third party, or some (PaaS): is a model that delivers to
combination of them, and it may the user development environment
exist on or off premises. Public services where the user can develop
cloud is provisioned for open use and run in-house built applications.
by the public. It may be owned, The services might include an
managed, and operated by a operating system, a programming
business, academic, or government language execution environment,
organization, or some combination databases and web servers.
of them. It exists on the premises of Infrastructure as a Service (IaaS) is
the cloud provider. Hybrid cloud is a model that provides the user with
virtual infrastructure, for example
servers and data storage space.
Virtualization plays a major role in
this mode, by allowing IaaS-cloud
providers to supply resources on-
demand extracting them from their
large pools installed in data centres.
Data as a Service (DaaS) is a model
in which, data is readily accessible
through a Cloud-based platform.
Simply put, DaaS is a new way of
accessing business-critical data
within an existing data centre.
Figure 1 illustrates the general cloud
computing architecture.
References :
Cloud Computing for E-Commerce, Journal of Mobile Computing and Application.[ Aydin, N. (2015)].
Talia, D. (2013). Clouds for Scalable Big Data Analytics. Published by IEEE Computer Society.
Fan, J., Han, F. & Liu, H., 2013. Challenges of Big Data Analysis. ResearchGate
Yadav, C. Wang, S. and Kumar M. (2013). “Algorithm and Approaches to handle large Data- A Survey”,
IJCSN
https://nessi.eu/Files/Private/NESSI_WhitePaper_BigData.pdf
http://www.edbt.org/Proceedings/2011-Uppsala/papers/edbt/a50-agrawal.pd
https://ijkie.org/IJKIE_December2014_IRINA%26HAO.pdf