AI Project
AI Project
The 4W’s of Problem Scoping are Who, What, Where and Why. This helps in
identifying and understanding the problem in a better and efficient
manner.
Who - “Who” part helps us in comprehending and categorizing who all
are affected directly and indirectly with the problem and who are called
the Stake Holders
What - “What” part helps us in understanding and identifying thenature
of the problem and under this block, you also gather evidence to prove
that the problem you have selected exists.
Where- "Where” does the problem arise, situation,context,and location.
Why - “Why” is the given problem worth solving.
Problem Statement Template
As the term clearly mentions, this stage is about acquiring data for
the project. Let us first understand what is Data. Data can be a
piece of information or facts and statistics collected together for
reference or analysis. Whenever we want an AI project to be able
to predict an output, we need to train it first using data.
For Example:- For example, If you want to make an Artificially
Intelligent system which can predict the salary of any employee
based on his previous salaries, you would feed the data of his
previous salaries into the machine. This is the data with which the
machine can be trained. Now, once it is ready, it will predict his next
salary efficiently. The previous salary data here is known as Training
Data while the next salary prediction data set is known as the
Testing Data.
Data Exploration
Supervised Learning
In a supervised learning model, the dataset which is fed to the
machine is labelled. In other words, we can say that the dataset is
known to the person who is training the machine only then he/she is
able to label the data.
There are two types of Supervised Learning models:
Classification: Where the data is classified according to the labels.
Regression: Such models work on continuous data.
Types of learning based approach.
Unsupervised Learning
An unsupervised learning model works on un labelled dataset. This means that
the data which is fed to the machine is random and there is a possibility that the
person who is training the model does not have any information regarding it.
Unsupervised learning models can be further divided into two categories:
Clustering: Refers to the unsupervised learning algorithm which can cluster the
unknown data according to the patterns or trends identified out of it.
Dimensionality Reduction: We humans are able to visualise upto 3-Dimensions
only but according to a lot of theories and algorithms, there are various entities
which exist beyond 3-Dimensions
Evaluation