Data Visualisation and Analytics
Data Visualisation and Analytics
Data analytics is the process of inspecting, cleansing, transforming data with the goal of
discovering useful information. It involves identifying or discovering the trends and patterns
inherent in the data. Data is extracted and categorized to identify and analyse behavioural
data and patterns, and techniques vary according to organizational requirements.
Data analytics is also known as data analysis. Data analytics is primarily conducted in
business-to-consumer (B2C) applications. Global organizations collect and analyse data
associated with customers, business processes, market economics or practical experience.
Data is categorized, stored and analyzed to study purchasing trends and patterns.
At a high level, data analytics methodologies include exploratory data analysis (EDA), which
aims to find patterns and relationships in data, and confirmatory data analysis (CDA), which
applies statistical techniques to determine whether hypotheses about a data set are true or
false. EDA is often compared to detective work, while CDA is akin to the work of a judge or
jury during a court trial.
Data analytics can also be separated into quantitative data analysis and qualitative data
analysis. The former involves analysis of numerical data with quantifiable variables that can
be compared or measured statistically. The qualitative approach is more interpretive -- it
focuses on understanding the content of non-numerical data like text, images, audio and
video, including common phrases, themes and points of view.
Data analytics initiatives support a wide variety of business uses. For example, banks and
credit card companies analyse withdrawal and spending patterns to prevent fraud and identity
theft.
Data collection. Data scientists identify data that will be needed for a particular analytics
application. Data is then assembled for use with or without the help of data engineers. Data is
then transformed into a common format and loaded into analytics system like Hadoop cluster,
NoSQL database or data warehouse.
The next step is Data profiling and cleansing. This is where data is checked for errors and
duplicates. Data is prepared to manipulate and organize the data for the planned analytics use.
Necessary data governance policies are applied to ensure that the data is compatible with the
corporate standards.
It is after this that data analytics begins. Analytics models are used incorporating the use of
predictive modelling tools or programming languages like python, Scala, r and SQL. Data is
tested for accuracy until the data performs as intended.
The last step is communicating the results of data analytics to executives and other users for
decision-making. Data visualisation techniques helps in the process of communicating the
results using charts and other infographics to make it easier for findings to be understood.