PCED-30-01 Certified Entry-Level Data Analyst With Python Dumps
PCED-30-01 Certified Entry-Level Data Analyst With Python Dumps
2.What does the acronym API stand for in the context of data communication?
A. Application Programming Interface
B. Advanced Python Interface
C. Algorithmic Programming Interface
D. Automated Process Integration
Answer: A
3.What is the process of transforming categorical variables into a numerical format for analysis
called?
A. Normalization
B. Encoding
C. Clustering
D. Dimensionality Reduction
Answer: B
5.Which type of model is commonly used for anomaly detection in data analysis?
A. Neural network
B. Support Vector Machine
C. Random forest
D. Isolation Forest
Answer: D
6.In data analysis and modeling, what is the process of identifying and handling missing values in a
dataset called?
A. Data preprocessing
B. Feature engineering
C. Outlier detection
D. Data imputation
Answer: D
7.Which machine learning algorithm is commonly used for clustering in data modeling?
A. Decision tree
B. K-means
C. Linear regression
D. Random forest
Answer: B
8.What is the result of the following Python code? ```python a = 10 b = 5 result = a % b print(result) ```
A. 2
B. 5
C. 0
D. 1
Answer: D
10.In which phase of the data modeling process is model validation typically performed?
A. Data cleaning
B. Data preprocessing
C. Model building
D. Model evaluation
Answer: D
11.Which of the following is a valid way to import a module named 'math' in Python?
A. include math
B. import math
C. from math import *
D. using math
Answer: B
14.Which keyword is used in Python to exit a loop before it completes all its iterations?
A. break
B. continue
C. pass
D. stop
Answer: A
15.What type of plot is commonly used to visualize the relationship between two continuous
variables?
A. Histogram
B. Scatter plot
C. Bar chart
D. Pie chart
Answer: B
16.In data visualization, what is the purpose of adding annotations to a chart or graph?
A. To make the chart more colorful
B. To add text descriptions to data points
C. To show categorical data
D. To adjust the size of the chart
Answer: B
17.Which of the following is NOT a commonly used method for data pre-processing?
A. Data cleaning
B. Data integration
C. Data visualization
D. Data transformation
Answer: C
20.In a line chart, which axis typically represents the time or chronological sequence of events?
A. X-axis
B. Y-axis
Answer: A
21.In a line plot, what do the points connected by a line represent?
A. Mean values
B. Median values
C. Maximum values
D. Sequential data points
Answer: D
22.Which type of visualization is commonly used to show the relationship between two numerical
variables?
A. Scatter plot
B. Bar chart
C. Line chart
D. Box plot
Answer: A
23.What statistical test is used to compare means between more than two groups or conditions?
A. Independent samples t-test
B. Paired t-test
C. ANOVA
D. Wilcoxon signed-rank test
Answer: C
24.What does the following Python code snippet do? ```python numbers = [1, 2, 3, 4, 5] squared =
[x**2 for x in numbers] print(squared) ```
A. Prints the original list of numbers
B. Prints the square of each number in the list
C. Prints the sum of the numbers in the list
D. Prints the average of the numbers in the list
Answer: B
25.Which statistical measure is used to assess the spread of data around the mean?
A. Median
B. Range
C. Standard deviation
D. Variance
Answer: C
28.In a line plot, which axis typically represents the independent variable?
A. X-axis
B. Y-axis
Answer: A
29.What is the process of collecting raw data from various sources before it is cleaned and
transformed for analysis?
A. Data Visualization
B. Data Mining
C. Data Acquisition
D. Data Modeling
Answer: C
31.Which of the following correlation coefficients indicates a perfect negative linear relationship
between two variables?
A. 0.75
B. -0.50
C. 0
D. -1
Answer: D
32.Which of the following libraries in Python is commonly used for creating interactive visualizations?
A. NumPy
B. Pandas
C. Matplotlib
D. Scikit-learn
Answer: C
33.1.In the context of data acquisition, what does ETL stand for?
A. Extract, Transform, Load
B. Extract, Transfer, Load
C. Edit, Transform, Load
D. Enhance, Transform, Load
Answer: A
34.When evaluating a classification model, which metric can be used to assess the balance between
precision and recall?
A. Accuracy
B. F1 score
C. Mean squared error
D. R-squared
Answer: B
35.Which Python function can be used to merge two datasets based on a common column in data pre-
processing?
A. merge()
B. concat()
C. join()
D. append()
Answer: A
36.Which statistical method is commonly used in data analysis to determine the relationship between
two continuous variables?
A. T-test
B. ANOVA
C. Pearson correlation
D. Chi-square test
Answer: C