Data Science QnA
Data Science QnA
a) Alphanumeric
(b) Alphabetic
(c) Numeric
31. If temperature is less than Zero, then what is data in the statement?
(a) temperature
(b) temperature <0
(c) 0
(d) <0
32. When an investigator uses the data which has already been collected by
others, such data is called as:
36. This is a library used to create data structures and carry out scientific
calculations.
(a) Python
(b) Numpy
(c) Pandas
(d) Mathplotlib
38. Which of the following statements is False in the case of the KNN Algorithm?
(a) For a very large value of K, points from other classes may be included in the
neighborhood
(b) For the very small value of K, the algorithm is very sensitive to noise
(c) KNN is used only for classification problem statements
(d) KNN is a lazy learner
1. Rohan bated 7 games last weekend. His scores are : 155, 165,
138, 172, 127, 193, 142. What is the range of Rohan‟s score?
Answer : 54
2. If the standard deviation of a data Set is 4, what is the variance?
Answer : 2
3. What is the standard deviation for this given data?
5, 1 0, 7, 12, 0, 20, 15, 22, 8, 2
Answer : 6.89
4. Students wer IIe asked how many hours a night they sleep.
The responses were: 10, 8, 7.5, 6, 5, 5.5, 6.5,9
Find the standard deviation
Answer : 1.638
5. what is the variance of the first 10 natural numbers?
Answer : 9.166
1. The type of data that cannot be counted measured or easily expressed using
numbers
Ans : Qualitative Data
2. This type of data describes qualities or characteristics.
Ans : Qualitative Data
3. A data visualization and graphical plotting library built on NumPy arrays.
Ans : Matplotlib
4. This is the average of numbers in the list.
Ans : Mean
5. The term used to describe the middle of a sorted list of numbers.
Ans : Median
6. The number which appears most often in a sorted array.
Ans : Mode
7. This is defined as the average of the squared differences from the Mean.
Ans : Variance
4. What are the differences between Artificial Intelligence and Data Science?
Ans
5. Explain the term of revisiting the AI Project Cycle?
Ans :
a) Problem Scoping : refers to understanding a problem and finding out
various factors which affect the problem, define the goal or aim of the
project.
The 4W's of Problem Scoping are Who, What, Where and Why. These 4W’s helps
in identifying and understanding the problem in a better and efficient manner.
1) Who - "Who" part helps us in comprehending and categorizing who all are
affected directly and indirectly with the problem and who are called the
Stakeholders
2) What - "What" part helps us in understanding and identifying the nature of
the problem and how do we get to know what helps to get us know the
evidence.
3) Where - "Where" does the problem arises, situation and the location.
4) Why - "Why" is the given problem worth solving.
13. Define the term Data type. What are the file format of Datasets?
Ans :
Data type is termed as a file format which is being used for storing the encoding
data in a computer file.
Commonly used file format of datasets are CSV files, spreadsheets and SQL.
20. What is the difference between Python List and Numpy Array?
Ans
25. Define the term Measures of Central tendency. What are the different
methods to measure?
Ans : Measures of central tendency are numbers that tend to cluster around the
“centre” of a set of values.
There are three methods to measures of central tendency :
a) Mean is the average or the most common value in a collection of numbers.
b) Median is the 'middle' of a sorted list of numbers that occurs in a sorted
array.
c) Mode is simply the number which appears most often in a sorted array.
K-NN algorithm stores all the available data and classifies a new data point
based on the similarity, it means when new data appears then it can be easily
classified into a well suited category by using K-NN algorithm.
K-NN algorithm can be used for Regression as well as for Classification but
mostly it is used for the Classification problems.