0% found this document useful (0 votes)
11 views6 pages

VIP Question Bank For DPV For Theory Exam

The document is a question bank for data visualization and analysis, covering topics such as Matplotlib, NumPy, data preparation, and statistical concepts. It includes multiple-choice questions, short answer questions, and sections for detailed explanations on various data analysis techniques. The content is aimed at assessing knowledge in data visualization principles, handling missing data, and understanding different types of variables and plots.

Uploaded by

VIKRAMADITYA
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views6 pages

VIP Question Bank For DPV For Theory Exam

The document is a question bank for data visualization and analysis, covering topics such as Matplotlib, NumPy, data preparation, and statistical concepts. It includes multiple-choice questions, short answer questions, and sections for detailed explanations on various data analysis techniques. The content is aimed at assessing knowledge in data visualization principles, handling missing data, and understanding different types of variables and plots.

Uploaded by

VIKRAMADITYA
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

Question Bank for DPV

1)_______ will automatically adjust the axis aspect ratio to match the input data in contour plot.
a) plt.imshow()
b) plt.contour()
c) plt.contourf()
d) All of the mentioned

2) Which principle is important in data visualization design?


a) Complexity
b) Clarity
c) Randomness
d) Ambiguity

3) Which function would you use in Matplotlib to create a line plot?


a) plt.bar()
b) plt.scatter()
c) plt.plot()
d) plt.pie()

4) Data preparation phase include ________ activity.


a) Access and combine the data table
b) Look for error
c) Transform data
d) All of the above

5) Which method is commonly used to handle missing values in a dataset?


a) Drop rows
b) Fill with median
c) Replace with a constant
d) All of the above

6) What is NumPy primarily used for in Python?


a) String manipulation
b) Data visualization
c) Numerical computing
d) File handling

7) Each column of the table describes a _________and each row describes an _________.
a) Variable, attributes
b) Variables, Observation
c) Variable, features
d) All of the above

8) Multivariate data analysis can be visualized using which of the following chart?
a) Scatter plot matrix
b) Star plots
c) Spider web chart
d) All of the mentioned
9) What is the output of dataframe, head() method _____.
a) return only the first row
b) return only headers or column name of the DataFrame
c) return the first five rows of the DataFrame
d) throw an exception as it expects parameter(number) in parenthesis

10) In python, Alpha typically refers to the ________ level of a color.


a) Transparency
b) Visibility
c) Clearance
d) Combination

11) Which of the following method is use to show topographical features both on land and under
the ocean using Basemap library?
a) bluemarble()
b) etopo()
c) lcc()
d) None

12) Which of the following array is use for getting and setting smaller subarrays within a larger
array?
a) Indexing of arrays
b) Reshaping of arrays
c) Slicing of arrays
d) Joining and splitting of arrays

13) The _________ transformation display the range is between 0 and 1.


a) min–max transformation
b) z-score transformation
c) Standard deviation
d) decimal scaling transformation

14) In a dataset, variables like height and weight are examples of which type of variable?
a) Categorical
b) Continuous
c) Ordinal
d) Nominal

1) In classification, the data are arranged according to:


a) Similarities b) Differences c) Percentages d) Ratios

2) A distribution in which the observations are concentrated at one end of the distribution is called a:
a) Symmetric distribution b) Normal distribution c) Skewed distribution d) Uniform distribution

3) What are the different types of variables?


a) Nominal b) Ordinal c) Quantitative d) All of above

4) Data that summarize all observations in a category are called____ data


a) frequency b) summarized c) raw d) none
5) What is the meaning of Mean_____
a) average b) sum c) difference d) none of these

6) Metrics to measure Central Tendency


a) Mean b) Mode c) Median d) All of above

7) What does NumPy stand for?


a) Numerical Python b) Natural Python c) Numeric Program d) Nonlinear Python

8)
Which of the following is used to find the indices of the maximum and minimum elements in a NumPyarray?
a) argmax() and argmin() b) max() and min() c) amax() and amin() d) None

9)
import numpy as np

a) 0 b) 1 c) 2 d) 3

10) Dataframe object is value mutable.


a)True b)False

11) Amongst which of the following is / are not correct to access individual item from dataframe 'df'.
a) df.iat[2,2] b) df.loc[2,2] c) df.at[2,2] d) df[0,0]

12) which of the following library is used to plot the graph in python-------
a) Numpy b) Pandas c) Matplotlib d) All of above

13) Which is the correct command to install Matplotlib?


a) pip install matplotlib b) pip install matplotlib.pz
c) pip install matplotlib.* d) All of the above

14) Which function is used to create a histogram in Seaborn?


a) sns.lineplot() b) sns.scatterplot() c) sns.histplot() d) sns.barplot()
Section – I

a) Explain the process of making sense of data. Discuss the various steps involved in data
analysis, from collection to interpretation.
b) Elaborate the types of Hypothesis with example
c) What is Scatter plot ?draw the Positive and Negative relationship
d) Define fancy indexing in NumPy. How does it differ from regular indexing, and when
is it useful?

a) What are the Metrics of Central Tendency with example


b) How to convert text to numbers and numbers to text with example.
c) How do functions like min(), max(), sum() and prod() help in analyzing data? Provide
practical examples.

Section – I
e) What are the Different types of Variable
f) Explain the types of Hypothesis with example
g) What is Scatter plot ?draw the Positive and Negative relationship
h) Define the terms ndim , shape , size, dtype with syntax
a) What are the Metrics of Central Tendency with example
d) How to convert text to numbers and numbers to text with example.
e) From the given array: array([0,1,2,3,4,5,6,7,8,9]) Write a Python code for
1) Display first five element
2) Element after index 5
3) Middle subarray.
4) Display last 2 element.
5) Display only even element
6) Display only odd elements

SECTION-I
a) Define the term observation and variable. Describe the types of variables in detail with example.
b) How missing data is handled while preparing the data tables?
c) Write a short note on confidence intervals.
d) Elaborate the process of making sense of data.
e) What do you mean by Fancy Indexing?

a) Here are the runs scored by a cricket team in a league of 12 matches – 100, 120, 110, 150, 110,
140,130, 170, 120, 220, 140, 110. Draw a box plot for the given data and identify the outlier.

b) What do you mean by Central Tendency? Identify the following statistics with suitable example.
(i) Mode, (ii) Median, (iii) Mean, (iv) Range, (v) Variance, (vi) Standard deviation
c) Visualize the strong positive correlation between two variables using correlation coefficient
method. Use suitable example.

SECTION-II
a) Define Exploratory Data Analysis and Elaborate the steps of EDA.
b) Elaborate the features of Pandas.
c) Elaborate Two-Way Contingency Tables with suitable example.
d) What is hierarchical indexing?
e) What is frequency distribution table?Draw a frequency distribution table for suitable example.

a) Why contour plot is required? Write a python code to design a labelled contour plot along
with color bar.

b) Design a Radar Chart for product comparison anddescribe the findings. Let the product is 4
types ofsmartphones, the axes might include “battery life”, “camera quality”, “screen size”,
“processor speed”, “price”. Also list out the benefits and limitationsof using Radar chart.
Section – II

a)
Explain the operations on data in pandas in detail
b) How to handle missing values in Pandas, explain with example.
c) What is Data Visualization, identify the design principles of data visualization
d)
Define Matplotlib and List out the libraries for plotting the graph, explain any one in detail with syn

a) What are the steps to combine dataset using append, concat & Merge, join in pandas.
b) Explain Different types of operations on string in pandas
c) Write the Python code to draw the Bar Chart, Histogram using Matplotlib and Seaborn

Section – II

a)
Explain the operations on data in pandas in detail
e) What are pivot tables in Pandas? Explain their role in data analysis and provide an
example of how to create and interpret a pivot table
f)
Explain the difference between univariate and multivariate data visualization. Discuss the types of p
g)
Define Matplotlib and List out the libraries for plotting the graph, explain any one in detail with syn
d) What are the steps to combine dataset using append, concat & Merge, join in pandas.
e) What are some strategies for dealing with missing values in a dataset using Pandas?.
f)
Explain the differences between line plots and scatter plots in Matplotlib. When should each type o

You might also like