Chatgpt Unit - 1
Chatgpt Unit - 1
4. Review of Probability
Fundamental Concepts:
Random Variables: Represent outcomes of random phenomena.
Probability Distribu on: A func on that describes the likelihood of outcomes.
Condi onal Probability: Probability of an event given another event has occurred.
Bayes' Theorem:
P(A∣B)=P(B∣A)⋅P(A)P(B)P(A|B) = \frac{P(B|A) \cdot P(A)}{P(B)}
Applica ons in ML:
Spam filtering.
Recommenda on systems.
Medical diagnosis.
7. Data Preprocessing
Steps:
1. Data Cleaning:
o Handle missing values (mean imputa on, removal).
o Remove duplicates.
2. Normaliza on: Scale features to a consistent range.
3. Encoding Categorical Variables: Convert labels to numbers.
4. Outlier Detec on: Iden fy and handle anomalies.
5. Feature Selec on: Retain only relevant features.
9. Func on Approxima on
Process:
1. Select a mathema cal func on.
2. Minimize error between predic ons and actual outputs.
Examples:
Linear regression.
Neural networks.
10. Overfi ng
What is Overfi ng?
Occurs when the model memorizes training data rather than learning general pa erns.
Preven on Techniques:
Use more data.
Apply regulariza on.
Employ simpler models.
Perform cross-valida on.