0% found this document useful (0 votes)
5 views

DPT Week 4

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views

DPT Week 4

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Week 4

Questions:
Explain the principles of EDA.

Answer:

Exploratory Data Analysis (EDA) is a critical phase in the data analysis process that focuses on
understanding the underlying patterns, trends, and relationships within a dataset before formal
modeling. The main principles of EDA include:

1. Visualization: EDA emphasizes the use of graphical representations such as histograms,


box plots, scatter plots, and pair plots to identify distributions, outliers, and correlations
among variables.
2. Summary Statistics: It involves calculating summary statistics, such as mean, median,
mode, variance, and standard deviation, which provide insights into the central
tendencies and variability of the data.
3. Identifying Patterns and Relationships: EDA seeks to uncover relationships between
variables, helping to identify potential predictors or groups within the data.
4. Handling Missing Values and Outliers: EDA assists in detecting and addressing missing
data and outliers, which can skew analysis and affect model performance.
5. Iterative Process: It is an iterative approach where initial findings prompt further
investigation, leading to deeper insights into the data.

By employing these principles, EDA helps data scientists and analysts gain a comprehensive
understanding of the data, setting a solid foundation for more formal analysis and modeling.

You might also like