0% found this document useful (0 votes)
24 views5 pages

MCQ FDS (1)

MULTIPLE CHOICE QUESTION FDS

Uploaded by

ramyaproject
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
24 views5 pages

MCQ FDS (1)

MULTIPLE CHOICE QUESTION FDS

Uploaded by

ramyaproject
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

(Academic year 2019-2020)

CS-VAC- Fundamentals of Data Science


1. Which of the following would be more appropriate to be replaced with question
mark in the following figure?

a) Data Analysis
b) Data Science
c) Descriptive Analytics
d) Commerce

2. Which of the following is the most important language for Data Science?
a) Java
b) Ruby
c) R
d) Basic

3.------------------------Shows all individual data points.


a) Box-plot
b) Scatter Plot
c) Line plot
d) Pie chart
4.What is the primary goal of exploratory data analysis (EDA)?

a) Making predictions
b) Summarizing data characteristics
c) Testing hypotheses
d) d)Model deployment

5. What does the term "outlier" refer to in statistics?


a)The mean of a dataset
b)Unusual or extreme values in a dataset
c)The mode of a dataset
d)The standard deviation of a dataset

6.What is the key difference between correlation and causation?


a)Correlation measures the strength of a relationship, causation implies a cause-
and-effect relationship
b)Correlation and causation are synonymous terms
c)Correlation implies a cause-and-effect relationship, causation measures the strength
of a relationship
d)Correlation and causation have no relationship

7.What is the purpose of a histogram in data visualization?


a)Displaying hierarchical relationships
b)Comparing multiple datasets
c) Showing the distribution of a single variable
d)Representing geographical data

8.How do you calculate the median of a dataset?


a)Sorting the data and selecting the middle value
b)Finding the average of the smallest and largest values
c)Summing all values and dividing by the number of values
d)Identifying the most frequently occurring value

9. What is the primary goal of the Data Science Process?


a) Data Retrieval
b) Building Models
c) Presenting Findings
d) Data Preparation

10.What is the primary purpose of a scatter plot?


a)Showing the distribution of a single variable
b)Representing hierarchical relationships
c)Visualizing the relationship between two variables
d)Displaying the summary statistics of a dataset

11.The stem and leaf displaying technique is used to present data in


a) descriptive data analysis
b) exploratory data analysis
c) nominal data analysis
d) ordinal data analysis
12.PANDAS stands for _____________
a. Panel Data Analysis
b. Panel Data analyst
c. Panel Data
d. Panel Dashboard

13.Which of the following are modules/libraries in Python?


a. NumPy
b. Pandas
c. Matplotlib
d. All of the above

14. Which of the following library in Python is used for plotting graphs and visualization.
a. Pandas
b. NumPy
c. Matplotlib
d. None of the above

15. Which of the following command is used to install pandas?


a. pip install pandas
b. install pandas
c. pip pandas
d. None of the above
16. The data label associated with a particular value of Series is called its ______________
a. Data value
b. Index
c. Value
d. None of the above
17. How many values will be there in array1, if given code is not returning any error?
>>> series4 = pd.Series(array1, index = [“Jan”, “Feb”, “Mar”, “Apr”])
a. 1
b. 2
c. 3
d. 4
18. How many elements will be there in the series named “S1”?
S1 = pd.Series(range(5))
print(S1)
a. 5
b. 4
c. 6
d. None of the above

19. Write the output of the following :


S1=pd.Series([14, 7, 9] ,index = range(1, 8, 3))
print(S1)
a.
14 1
7 4
9 7
dtype: int64
b.
1 14
47
79
dtype: int64
c. Error
d. None of the above

20. Which of the following code will generate the following output?
Jan 31
Feb 28
Mar 31
dtype: int64
a.
import pandas as pd
S1 = pd.Series(data = [31,28,31], index=["Jan","Feb","Mar"])
print(S1)
b.
import pandas as pd
S1 = pd.Series([31,28,31], index=["Jan","Feb","Mar"])
print(S1)
c. Both of the above
d. None of the above

21. Which of the following fills the missing values in Series?


a. fill value
b. fill-value
c. fill_value
d. fill_value( )

22. Which of the following method is used to add two series?


a. sum( )
b. addition( )
c. add( )
d. None of the above

23. We can perform _____________ on two series in Pandas.


a. Addition
b. Subtraction
c. Multiplication
d. All of the above

24. Write the output of the following :


import pandas as pd
S1=pd.Series([1,2,3,4])
S2=pd.Series([7,8,9,10])
S2.index=['a','b','c','d']
print((S1+S2).count())

a. 8
b. 4
c. 0
d. 6
25.Write the output of the following code :
import pandas as pd
S1=pd.Series([1,2,3,4])
S2=pd.Series([7,8])
S3=S1+S2
print(S3.size)
a. 2
b. 4
c. 6
d. Error

You might also like