0% found this document useful (1 vote)
120 views

Data Analytics PDF

The document discusses various topics related to data analytics and data science including machine learning algorithms, business use cases, and roles in different industries. It provides explanations of supervised learning algorithms like linear regression and logistic regression and sample use cases. Unsupervised learning algorithms like K-means clustering and Gaussian mixture models are described along with examples. Reinforcement learning and deep learning are also overviewed. The roles of data analytics in marketing, finance, human resources, and operations are summarized.

Uploaded by

Prazavi Jain
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (1 vote)
120 views

Data Analytics PDF

The document discusses various topics related to data analytics and data science including machine learning algorithms, business use cases, and roles in different industries. It provides explanations of supervised learning algorithms like linear regression and logistic regression and sample use cases. Unsupervised learning algorithms like K-means clustering and Gaussian mixture models are described along with examples. Reinforcement learning and deep learning are also overviewed. The roles of data analytics in marketing, finance, human resources, and operations are summarized.

Uploaded by

Prazavi Jain
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Q1 Create two original blog entries on contemporary topics in Analytics and Data

Science?

Ans 1- https://pushpraj5518.blogspot.com/2020/01/google-analytics-google-analytics-web.html
2- https://princymaheshwari.blogspot.com

SUMMARY OF DATA ANALYTICS AND DATA SCIENCE

Data analytics is the science of analyzing raw data in order to make conclusions
about that information. Many of the techniques and processes of data analytics
have been automated into mechanical processes and algorithm that work over raw
data for human consumption.

Data analytics techniques can reveal trends and metrics that would otherwise be
lost in the mass of information. This information can then be used to optimize
processes to increase the overall efficiency of a business or system. Data analytics
is a broad term that encompasses many diverse types of data analysis. Any type of
information can be subjected to data analytics techniques to get insight that can be
used to improve things.

For example, manufacturing companies often record the runtime, downtime, and
work queue for various machines and then analyze the data to better plan the
workloads so the machines operate closer to peak capacity.
Just like football, as football is big business and mistakes can cost clubs millions.
So when Liver pool FC’s Phillippe Coutinho scored a free kick against Brighton in
December 2017 by firing his shot underneath the jumping defensive wall, Jurgen
klopp credited his analysts for pointing out the opportunity. Data was also
instrumental when hearts ended Celtic’s 69 game unbeaten run last year. Football
has become increasingly data - driven and Hearts drew upon Instat’s vast database
of football statistics to help mastermind their victory.
The process involved in data analysis involves several different steps:

1. The first step is to determine the data requirements or how the data is
grouped. Data may be separated by age, demographic, income, or gender.
Data values may be numerical or be divided by category.
2. The second step in data analytics is the process of collecting it. This can be
done through a variety of sources such as computers, online sources,
cameras, environmental sources, or through personnel.
3. Once the data is collected, it must be organized so it can be analyzed.
Organization may take place on a spreadsheet or other form of software that
can take statistical data.
4. The data is then cleaned up before analysis. This means it is scrubbed and checked
to ensure there is no duplication or error, and that it is not incomplete. This step
Program to read datasets and filter data

ANACONDA, JUPYTER & PYTHON these are the programs that build chart
and work with database as it helps to correct any errors before it goes on to a data
analyst to be analyzed.

First of all Anaconda is package manager and Jupyter is a presentation


layer.

Anaconda tries to solve the dependency hell in python—where different projects


have different dependency versions—so as to not make different project
dependencies require different versions, which may interfere with each other.

Jupyter tries to solve the issue of reproducibility in analysis by enabling an


iterative and hands-on approach to explaining and visualizing code; by using rich
text documentations combined with visual representations, in a single solution.
Anaconda is similar to pyenv, venv and minconda; it’s meant to achieve a python
environment that’s 100% reproducible on another environment, independent of
whatever other versions of a project’s dependencies are available. It’s a bit similar
to Docker, but restricted to the Python ecosystem.

Jupyter is an amazing presentation tool for analytical work; where you can present
code in “blocks,” combines with rich text descriptions between blocks, and the
inclusion of formatted output from the blocks, and graphs generated in a well-
designed matter by way of another block’s code.

Jupyter is incredibly good in analytical work to ensure reproducibility in


someone’s research, so anyone can come back many months later and visually
understand what someone tried to explain, and see exactly which code drove which
visualization and conclusion.

Python is programming language and Anaconda is a program that consists of


python + couple of python libraries. It also has its own virtual environment and
repository (which u can use alongside python command is : conda install package).

Machine learning is emerging as a key focus area for AI researchers, developers


and investors alike, due to its many potential applications. This sub-category of AI
goes beyond creating rule-based systems to developing algorithms that can be
trained to learn from data – and identify patterns, connections and insights –
without being programmed to work towards these specific conclusions.
Three major types of machine learning are:-
Supervised Learning
Unsupervised Learning
Reinforcement Learning
In supervised learning there is:-
1-Algorithms
2-Sample business use case
Algorithms
Linear regression
1 Highly interpretable, standard method for modeling the past relationship between
independent input variables and dependent output variables (which can have an
infinite number of values) to help predict future values of the output variables.
Logistic regression
2. Extension of linear regression that’s used for classification tasks, meaning the
output variable is binary (e.g. only black or white) rather than continuous (eg, an
infinite list of potential colors)
In Sample business use case linear regression - Understand product-sales drivers
such as competition prices, distribution, advertisement etc. Optimize price points
and estimate product-price elasticities. Classify customers based on how likely
they are to repay a loan predict if a skin lesion is benign or malignant based on its
characteristics (size, shape, color, etc.) it predict client churn and also predict a
sales lead’s likelihood of closing.
In Sample business use case logistic regression - Classify customers based on
how likely they are to repay a loan and also predict if a skin lesion is benign or
malignant based on its characteristics (size, shape, color, etc.)
In Unsupervised learning

1. Algorithms
2. Sample business use cases

In Algorithms
There is K-means clustering which puts data into a number of groups (k) that
each contain data with similar characteristics (as determined by the model, not in
advance by humans).

Gaussian mixture model -A generalization of k-means clustering that


provides more flexibility in the size and shape of groups (clusters).

In Sample business use cases K-means clustering segment customers into


groups by distinct characteristics (e.g. age group)— for instance, to better
assign marketing campaigns or prevent churn.

In Sample business use cases Gaussian mixture model- Segment


customers to better assign marketing campaigns using less-distinct customer
characteristics (e.g., product preferences) and segment employees based on
likelihood of attrition.

Now come to Reinforcement machine learning –It Optimizes the trading


strategy for an options-trading portfolio and Balance the load of electricity
grids in varying demand cycles also it stock and pick inventory using robots
and optimizes the driving behavior of self-driving cars also optimizes the
pricing in real time for an online auction of a product with limited supply.

Deep Learning is also a subset of machine learning in artificial intelligence


(AI) that has networks capable of learning unsupervised from data that is
unstructured or unbaled it is also known as deep neural learning or deep
neural network. Deep learning is a type of machine learning that can process
a wider range of data resources, requires less data preprocessing
by humans, and can often produce more accurate results than traditional
machine-learning approaches. The network can then make a determination
about the data, learn if its determination is correct, and use what it has
learned to make determinations about new data. For example, once it learns
what an object looks like, it can recognize the object in a new image.

Q. Suggest how we can apply Analytics and Data Science in finance/operation/


marketing/ human resource industry?
Roles of data analytics
Ans - In marketing – Analytics is the practice of measuring, managing and
analyzing marketing performance to maximize its effectiveness and optimize
return on investment. Understanding marketing analytics allows marketers to be
more efficient at their jobs and minimize wasted web marketing dollars.
In finance – Financial analytics is a field that gives different views of a company’s
financial data. It helps to gain in depth knowledge and take action against it to
improve the performance of business. Financial analytics plays a very important
role in calculating the profit of a business.
In Human resource - Data analytics is the people analytics also knows an HR
analytics and work force analytics, is the use of analytical process to solve business
problems. HR analysts collect and study info related to jobs, issues and cost that
affect their companies. Many HR information system programs (HRIS) is a
computer software package that helps these professionals manage their data more
efficiently and accurately.
In operations – Data analytics offer power full tools for helping a company make
better operations decisions. In particular, combining data from multiple data
sources and applying time- series, techniques can provide deep insights into
company’s operational strengths and weaknesses.

You might also like