Hora Paper 351 Visual Analytics Methodology
Hora Paper 351 Visual Analytics Methodology
net/publication/361594614
CITATIONS READS
0 288
4 authors, including:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Suraya Yaacob on 04 July 2022.
Abstract— Big data usage evolves from previously looking into the an essential role in understanding and fitting the analytics
capacity of big data's descriptive and diagnostic perspectives into prediction in their business decisions. Hence, there is a need to
currently feeding the demands for predictive big data analytics. embed prediction in visual analytics and becomes balanced to
The needs come about due to organizations that crave predictive provide understandable predictive insights. When carefully
analytics capabilities to reduce risk, make intelligent decisions, and executed, it can provide practical insights and predictions by
generate different customer experiences. Similarly, visual analytics
play an essential role in understanding and fitting the analytics
analyzing current and historical data.
prediction in their business decision. Hence, the combination of Identifying a clear and practical methodology is critical to
descriptive, diagnostics and predictive within Visual Analytics moving the demand for prediction into valuable business
emerges as a balanced field to provide understandable predictive
practice. To a larger extent, it strengthens and clarifies the usage
insight. Due to the organizational demand and multi-discipline
area, the approach to developing visual analytics is still uncertain and implementation of prediction in the big data lifecycle. Due
in the Big Data Project Lifecycle from methodological perspectives. to the organizational demand and multi-discipline area,
While there are a few potential methodological approaches that developing Visual Analytics still lacks in the Big Data Project
could be used for visual analytics, they are scattered across Lifecycle from methodological perspectives. The methodology
numerous academic research and industrial practice. To date, should encompass a multi-perspective approach that
there is no coherent review and analysis of the work that has been incorporates business understanding, data integration, statistics,
explored specifically for Visual Analytics methodology. This paper assumptions, modelling, visualization and analytical reasoning
reports on a review of previous literature concerning how Visual within the big data lifecycle. Few potential methodological
Analytics has been executed in the big data life cycle to address the
approaches could be used. However, previous findings obtained
gap. The review is organized in this study from three perspectives:
i) general ICT-related methodology (e.g. SDLC, Agile, DevOps), ii) in past studies are scattered across numerous conferences and
Data Science-related methodology (e.g. CRISP-DM, SEMMA, industrial practices. Consequently, there seems to be a lack of
KDD) and iii) Visual Analytics-related methodologies in which coherent review and analysis of the work that specifically
each method will be benchmarked based on the Visual Analytics explored the use of Visual Analytics methodology. Thus, this
major part of reality, computer and human, in terms of its width, paper reports on a review of previous literature concerning how
depth, and flows. This study found insufficiencies, non-specific and Visual Analytics has been executed in the big data lifecycle.
vague conditions in handling the Visual Analytics when using
current methodological approaches based on the review conducted. II. VISUAL ANALYTICS
The paper also highlights the Visual Analytics-related
methodological review, which can shed some light on the Visual Analytics combines automated analysis techniques
approaches and ways of implementing analytics in the big data with interactive visualization and concerned with the science of
lifecycle, which can be beneficial for future studies in proposing a analytical reasoning from raw data, which is often presented in
more comprehensive methodology for Visual Analytics in the big dynamic and interactive visual interfaces (e.g. dashboard, graph
data lifecycle. or map) [5]. In current demand, visual analytics is regarded as a
Keywords—process; methodology; visual analytics; big data visual analytics platform that incorporates or enhances
analytics. diagnostics and predictive analytics, providing a predictive
event pattern with interactive visual representation[3]. From the
I. INTRODUCTION automated analysis perspectives, it focuses on the analysis of the
historical data to diagnose the situation or predict future events.
Big data refers to massive, complex structured and Hence, it is concerned in identifying the root causes for the
unstructured data sets that are rapidly generated and transmitted situation as a basic to predict future probabilities and trends
from a wide variety of sources. It extracts value from the data based on observed events, encompassing a multi-perspective
and analyzes insights that lead to better decisions and strategic approach that includes integrated reasoning, pattern recognition,
business moves. Big data usage evolves from descriptive, and predictive modelling associated with the domain knowledge
diagnostics, and more recently to predictions capabilities. As a [4].
result, Predictive Visual Analytics is currently in high demand
for business and organization [1]. This is because organizations Visual analytics focuses more on the computational part and
require predictive capabilities to reduce risk, make intelligent goes in-depth for its technicality and statistics. It involves
decisions, and generate different customer experiences. It extracting information from large data sets to identify the pattern
attracts many industrial players to implement predictive and trends used to generate models and predict future outcomes
analytics in their business [2]. In parallel, visual analytics play and behaviors of interest [6]. It aims to anticipate unknown
future actions through data mining, statistics, modeling, deep
The biggest shortcoming of ICT-related methodologies in Furthermore, Saltz & Shamsuhurin [40] mentioned that the
CRISP-DM might not be suitable for a big data project because
handling Visual Analytics is that the processes are too shallow
of the 5Vs characteristics of Big Data. Another shortcoming of
for Visual Analytics applications and needs more focus on
CRISP-DM is the lack of Business and Data Understanding
analytics specification. From a reality context, managing guidelines. The business understanding in CRISP-DM does not
business requirement must focus on the strategic-business indicate a data acquisition phase [37], the process of converting
process instead of operational-business processes since Visual data from the real world to be displayed, analyzed, and stored in
Analytics is meant to support the decision making instead of the the digital domain. Next, the limitation of the CRISP-DM is that
operational business workload. More specific technical, the project management tasks are not carried out. CRISP-DM is
modelling, and statistical elements need to be covered in-depth not an accurate method for project management since it contains
within the computer part processes. Finally, for the human part, the presumption that its user consists of only one person or a
the ICT-related methodologies focus more on the evaluation small scale project [41] whereby the team coordination,
part based on the User Acceptance Test of the application communication and prioritization needed for larger projects are
functionalities. It is contradictory since Visual Analytics ignored [42]. Some project management practises such as
focuses more on how the analytics outcomes should be quality management, or change management are also not
understood by the users and become knowledge that can be included in CRISP-DM [43]. Ponsard [44] realized that CRISP-
applied to facilitate the business in its context of use. DM suffers to deliver a good management viewpoint on
communications, knowledge and project aspects. As a result,
III. BDA RELATED METHODOLOGIES CRISP-DM has failed to underline more important steps and
BDA-related methodologies are the nearest fields for Visual milestones that can be enhanced progressively. Lastly, CRISP-
DM suffered from the absence of techniques (process or
Analytics. Most Visual Analytics use these methodologies to
procedure that need to follow) and tools (devices or application)
facilitate them during project development. Based on the real
that are recommended [45], obstructing an effective process and
usage and development, the Cross-Industry Standard Process