Data Visualization Question Bank eDBDA Sept 21
Data Visualization Question Bank eDBDA Sept 21
Contents
EDA............................................................................................................................................................. 1
Numpy, Pandas and Data Visualization ..................................................................................................... 2
Matplot and seaborn ................................................................................................................................. 4
EDA
3. To show a functional relationship between your independent and dependent variables, the graph of choice
would be a
A. line graph. B. histogram. C. pie chart D. scatterplot.
5. In which of the following situations would you not want to use a Pearson correlation coefficient?
A. when the relationship between variables is nonlinear
B. when both of your variables are measured on at least an interval scale
C. when the variances of your distributions are very similar D. all of the above
6. A curve showing a functional relationship that starts off flat, becomes progressively steeper, and shows a single
direction of change is
A. negatively accelerated. B. monotonic
C. positively accelerated. D. both b and c
7. A ________ distribution has most scores collected about the center and is symmetrical about its midpoint.
A. functional B. normal B. monotonic D. bimodal
1
USM’s Shriram Mantri Vidyanidhi Info Tech Academy
Data Visualization Question Bank eDBDA Sept 21
9. A functional graph that shows a uniformly increasing or decreasing functional relationship is said to
be A. monotonic. B. negatively skewed. C. normal. D. positively skewed.
10. If you have discrete group data, such as months of the year, age group, shoe sizes, and animals.
Which is best to explain?
A. Boxplot B. histogram C. bar D. scatterplot
11. Which graph is better used when data needs to be classified or categorize?
A. stack bar B. Pie chart C. histogram D. None of the above
14. From which plot you will come to the distribution of the target variable?
A. histogram B. pie chart C. bar D. Pareto chart
15. TrueFalse: The quantilequantile (qq) plot is a graphical technique for determining if two data sets come from
populations with a common distribution.
A. True B. False
16. TrueFalse: In Boxplot the middle line inside the box display the mean of the
distribution A. True B. False
17. TrueFalse: For Numeric vs Numeric data scatterplot is the best representation. A.
True B. False
18. TrueFalse: For Bivariant data, correlogram or corr plot show the correlation of each
variable. A. True B. False
19. TrueFalse: the height of the bar corresponds to the value of each category. A. True
B. False
20. TrueFalse: The height of the resulting Stacked Bar shows the combined result of the
groups. A. True B. False
2
USM’s Shriram Mantri Vidyanidhi Info Tech Academy
Data Visualization Question Bank eDBDA Sept 21
3) Pandas does easy handling of missing data in floating point as well as nonfloating point data? A.
True B. False
7) Pivot table can aggregate the data and summarize it by grouping the
columns A. True B. False
9) Dimensions should match along the axis you are _______ on.
A. concatenating B. merging C. joining D. collaborating
10) Series can have axis labels and it can be indexed by a label
A. True B. False
3
USM’s Shriram Mantri Vidyanidhi Info Tech Academy
Data Visualization Question Bank eDBDA Sept 21
18) _______ is a visualisation library that provides a highlevel interface to draw attractive statistical graphics.
A. Scrapy B. Seaborn C. Airborn D. Statistica
2. Point out the correct combination with regards to kind keyword for graph plotting:
A. ‘hist’ for histogram B. ‘box’ for boxplot
C. ‘area’ for area plots D. all of the Mentioned
Explanation: The kind keyword argument of plot() accepts a handful of values for plots other than the default Line
plot.
4. You can create a scatter plot matrix using the __________ method in pandas.tools.plotting.
A. sca_matrix B. scatter_matrix C. DataFrame.plot D. all of the Mentioned
Explanation: You can create density plots using the Series/DataFrame.plot.
5. Point out the wrong combination with regards to kind keyword for graph
plotting: A. ‘scatter’ for scatter plots B. ‘kde’ for hexagonal bin plots C.
‘pie’ for pie plots D. none of the Mentioned
Explanation: kde is used for density plots.
6. Which of the following plots are used to check if a data set or time series is
random ?
A. Lag B. Random C. Lead D. None of the Mentioned
Explanation: Random data should not exhibit any structure in the lag plot.
8. Which of the following plots are often used for checking randomness in time series ?
A. Autocausation B. Autorank C. Autocorrelation D. None of the Mentioned
Explanation: If time series is random, such autocorrelations should be near zero for any and all timelag separations.
4
USM’s Shriram Mantri Vidyanidhi Info Tech Academy
Data Visualization Question Bank eDBDA Sept 21