0% found this document useful (0 votes)

6 views10 pages

aai

Uploaded by

sairajbandre04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views10 pages

aai

Uploaded by

sairajbandre04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Generative models

Generative modelling

● Purpose: It models how data is generated by learning the joint probability distribution
P(x,y)P(x, y)P(x,y) or just P(x)P(x)P(x) for unlabeled data. It aims to understand the underlying
data structure and generate new, realistic samples.
● How it Works: It learns both the features and their correlations, allowing it to generate new
data points similar to the training set. It tries to answer, "How likely is this input?"
● Examples:
○ GANs (Generative Adversarial Networks): Generate realistic images, videos, or
sounds by pitting two neural networks (Generator and Discriminator) against each
other.
○ VAEs (Variational Autoencoders): Learn efficient data representations to generate
new data samples with controlled variation.
● Applications: Image synthesis, text generation, data augmentation, and anomaly detection.
● Advantages: Can generate new, unseen data samples, useful for creative tasks and data
augmentation.
● Disadvantages: Often harder to train and prone to issues like mode collapse (repetitive
outputs).

Discriminative modelling

● Purpose: It models the decision boundary between classes by learning the conditional
probability P(y∣x)P(y|x)P(y∣x), focusing on distinguishing between different categories.
● How it Works: It learns to map inputs to their corresponding labels, directly optimizing for
classification accuracy. It tries to answer, "Which class does this input belong to?"
● Examples:
○ Logistic Regression: Classifies data into two categories by estimating probabilities.
○ SVM (Support Vector Machine): Finds the optimal hyperplane that separates
different classes with the maximum margin.
○ Neural Networks: Learn complex decision boundaries for tasks like image
classification and speech recognition.
● Applications: Image classification, spam detection, speech recognition, and fraud detection.
● Advantages: Usually simpler to train and more accurate for classification tasks.
● Disadvantages: Cannot generate new data samples and may require large labeled datasets
Generative VS discriminative modelling

Significance of generative models

● Data Generation: Generative models create new, realistic data samples that
resemble the original dataset.
● Data Augmentation: They generate variations of existing data to improve model
robustness and accuracy.
● Simulation and Training: Help simulate real-world scenarios for training AI agents in
robotics, games, and more.
● Natural Language Generation: Power language models like GPT to generate fluent,
context-aware text.
● Representation Learning: Capture meaningful features from data for tasks like
classification or clustering.
● Anomaly Detection: Identify unusual patterns by modeling what normal data should
look like.
● Personalization and Recommendations: Create customized content or
suggestions tailored to individual users.

Challenges of generative models

● Mode Collapse: In models like GANs, the generator may produce limited variations,
ignoring parts of the data distribution.
● Training Instability: Generative models, especially GANs, can be difficult to train
and may not converge reliably.
● High Computational Cost: They often require large datasets and significant
computational resources to train effectively.
● Evaluation Difficulty: It’s hard to objectively measure the quality and diversity of
generated outputs.
● Overfitting: Generative models may memorize training data instead of learning to
generalize.
● Security Risks: Generated data can be misused for deepfakes or identity fraud.
● Data Privacy Concerns: If not trained carefully, models might leak sensitive
information from the training set.
● Poor Interpretability: It’s often unclear what exactly the model has learned or why it
generates specific outputs.

GAN VS VAE

Probabilistic models

GMM
A Gaussian Mixture Model (GMM) is a probabilistic model that assumes all the data points
are generated from a mixture of several Gaussian (normal) distributions with unknown
parameters.
Each component in the mixture represents a cluster, and the model assigns probabilities to
each point for belonging to a particular cluster.

Working of GMM
GMM works by modeling the data as a weighted sum of multiple Gaussian distributions,
each having its own mean and covariance.

🔁 Steps:
1. Initialization:

○ Choose the number of components (clusters), say KKK.

○ Randomly initialize the means μk\mu_kμk, covariances Σk\Sigma_kΣk, and

mixing coefficients πk\pi_kπk.

2. E-Step (Expectation):

○ For each data point, calculate the probability of it belonging to each Gaussian
using the current parameters (soft clustering).

3. M-Step (Maximization):

○ Update the parameters (means, covariances, and mixing coefficients) to

maximize the likelihood of the data under the current assignments.

4. Repeat E and M steps until convergence (i.e., the parameters stop changing
significantly).

This is done using the Expectation-Maximization (EM) algorithm.

Advantages
● Soft Clustering: Unlike K-Means, GMM assigns probabilities, not hard labels —
better for overlapping clusters.

● Flexible Shapes: Can model elliptical (non-spherical) clusters due to covariance

matrices.

● Probabilistic Framework: Can handle uncertainty and provides likelihoods.

● Works with EM Algorithm: Efficient estimation of parameters.

Limitations
● Number of Components Must Be Predefined: You need to specify the number of
Gaussians in advance.

● Sensitive to Initialization: Bad initialization may lead to poor convergence.

● Prone to Overfitting: Especially with many components or small datasets.

● Assumes Gaussian Distributions: Not ideal if the actual data distribution is

non-Gaussian.

Applications
● Image Segmentation: Separating objects based on pixel intensities.

● Anomaly Detection: Points with low probability are flagged as anomalies.

● Speech Recognition: Modeling acoustic features using GMMs.

● Clustering: An alternative to K-Means when data has different variances.

● Finance: Modeling returns of assets which often follow a mixture distribution.

HMM

A Hidden Markov Model (HMM) is a statistical model that describes systems that are influenced by
hidden (unobservable) states but produce observable outputs. It models the probabilistic
relationship between a sequence of hidden states and corresponding observations, helping to make
predictions or understand patterns when the underlying system is not directly visible. It involves:

● Hidden States: Unobservable internal states (e.g., weather conditions like Sunny or Rainy).
● Observations: Visible outputs influenced by hidden states (e.g., Dry or Wet ground).

Key Components:

1. Initial State Probabilities: Probabilities of starting in each hidden state.

2. Transition Probabilities: Probabilities of moving from one hidden state to another.
3. Emission Probabilities: Probabilities of observing a specific output given a hidden state.

Example: Weather Prediction

Imagine you want to predict the weather (Sunny or Rainy) based on observed conditions (Dry or
Wet). In this case:

● Hidden States: The actual weather (Sunny, Rainy) which you cannot observe directly.
● Observations: The ground condition (Dry, Wet) that you can see.

Using HMM:

● Start Probabilities: Initial chances of the weather being Sunny (60%) or Rainy (40%).
● Transition Probabilities: Chances of moving from one weather state to another. For example,
if it's Sunny today, there's a 70% chance it will be Sunny tomorrow and a 30% chance of Rain.
● Emission Probabilities: Chances of observing Dry or Wet conditions given the hidden state.
For example, if it's Sunny, there's a 90% chance the ground is Dry.

With this setup, HMM can predict the most likely sequence of weather conditions given a sequence
of observed ground states.

Uses of Hidden Markov Models

● Weather Forecasting: Predicting weather patterns based on observed conditions.

● Speech Recognition: Mapping audio signals to words.
● Bioinformatics: DNA sequence analysis and gene prediction.
● Finance: Modeling stock market trends.
● Natural Language Processing: Part-of-speech tagging and language translation.

Advantages

● Versatility: Applicable to a wide range of sequential data problems.

● Probabilistic Framework: Handles uncertainty and noise in observations.
● Efficient Algorithms: Algorithms like Viterbi and Baum-Welch efficiently find hidden states
and train the model.

Disadvantages

● Assumption of Markov Property: Assumes that the current state only depends on the
previous state, which may not hold in complex scenarios.
● Parameter Estimation: Requires careful estimation of transition and emission probabilities.
● Hidden State Limitations: Number of hidden states must be predefined, which might
oversimplify complex systems.

Problems: Hidden Markov Model Clearly Explained! Part - 5

MRF

A Markov Random Field (MRF), also known as an Undirected Graphical Model or Markov Network,
is a probabilistic model that uses an undirected graph to represent the dependencies between
random variables. In an MRF:

● Nodes represent random variables.

● Edges represent direct interactions between variables.

Key Features

● Undirected Edges: Unlike Bayesian networks, MRFs use undirected edges, indicating that the
relationship between connected nodes is mutual without a directional cause-and-effect flow.
● No Conditional Probability Distribution: Edges in an MRF show potential interactions but are
not associated with conditional probabilities.
● Local Interactions: Two nodes interact directly only if they are connected by an edge.

How It Works

1. Graph Structure: Nodes are variables (e.g., pixels), and edges show dependencies.
2. Potential Functions: These define how connected variables influence each other.
3. Joint Probability: Calculated using these functions to find the likelihood of a certain
configuration.
4. Inference: Used to predict unknown variables (e.g., labeling image regions).

Applications

● Image Segmentation: Classifying each pixel into objects or regions.

● Denoising: Cleaning noisy images while preserving edges.
● NLP Tasks: Part-of-speech tagging and named entity recognition.
● Spatial Analysis: Modeling geographical data dependencies.

Advantages

● Captures complex dependencies without assuming direction.

● Efficient calculations due to localized interactions.

Disadvantages

● Inference is computationally expensive.

● Needs large datasets for accurate parameter estimation.
● Accuracy depends on the chosen graph structure.

Bayesian network

A Bayesian Network (also known as a Belief Network) is a probabilistic graphical model that
represents a set of variables and their conditional dependencies using a directed acyclic graph
(DAG). It is based on Bayes' theorem and is used to model uncertainty in complex systems.

Components of a Bayesian Network:

1. Nodes: Represent random variables, which can be observable quantities, latent variables, or
unknown parameters.
2. Edges: Directed edges between nodes represent conditional dependencies. If there's an edge
from node AAA to node BBB, then AAA directly influences BBB.
3. Conditional Probability Tables (CPTs): Each node has a CPT that quantifies the effect of the
parent nodes on the node.
Scenario:

● B: Burglary occurred
● E: Earthquake occurred
● A: Alarm went off
● J: John called to report the alarm
● M: Mary called to report the alarm

The dependencies are:

● A burglary or an earthquake can trigger the alarm.

● If the alarm goes off, there's a chance that John and Mary will call.

Applications:

● Security Systems: To determine the likelihood of a break-in based on multiple sensor alerts.
● Medical Diagnosis: Inferring diseases from symptoms and test results.
● Fault Detection: In engineering systems based on observed failures.

Advantages:
● Compact Representation: Efficiently represents joint distributions.
● Causal Relationships: Clearly shows dependencies and causal structures.
● Flexible Inference: Can calculate probabilities given any evidence.

Challenges:

● Complex Inference: Exact calculations can be computationally expensive.

● Dependency Knowledge: Requires knowledge of conditional dependencies.
● Parameter Estimation: CPTs need accurate probabilities, which might not always be
available.

EM algorithm
The Expectation-Maximization (EM) algorithm is an iterative method used to find
maximum likelihood estimates of parameters in probabilistic models when the data has
missing or hidden (latent) variables.

Where It's Used

● Gaussian Mixture Models (GMM)

● Hidden Markov Models (HMM)

● Missing data problems

● Clustering with soft assignments

How EM Works
It alternates between two steps:

1. Expectation Step (E-step)

Estimate the expected value of the latent variables (like cluster assignments) given the
current parameters of the model.

Example in GMM: Compute the probability that each data point belongs to each
Gaussian (soft assignment).

2. Maximization Step (M-step)

Update the model parameters (like mean, variance, mixing coefficients) to maximize the
expected log-likelihood found in the E-step.
Example in GMM: Update the means, covariances, and weights of each
Gaussian using the soft assignments.

Repeat until convergence (i.e., changes in parameters or log-likelihood

are minimal).

Advantages
● Handles missing or hidden data effectively.

● Can work with soft (probabilistic) assignments.

● Guaranteed to converge to a local optimum.

Limitations
● May converge to a local maximum, not necessarily the global one.

● Sensitive to initialization.

● Can be computationally expensive for large datasets.

Applications
● Clustering (e.g., Gaussian Mixture Models)

● Natural Language Processing (e.g., topic models)

● Image restoration

● Bioinformatics

● Anomaly detection

AAI IA1 QUE ANS
No ratings yet
AAI IA1 QUE ANS
17 pages
Generative Learning algorithims 1233
No ratings yet
Generative Learning algorithims 1233
33 pages
CSGL
No ratings yet
CSGL
11 pages
AAI Module 1
No ratings yet
AAI Module 1
12 pages
Deep Gen Models Tutorial
No ratings yet
Deep Gen Models Tutorial
96 pages
Week 12 Chats
No ratings yet
Week 12 Chats
4 pages
Adv Ai
No ratings yet
Adv Ai
9 pages
Generatuvemodals
No ratings yet
Generatuvemodals
3 pages
AI Week 14
No ratings yet
AI Week 14
3 pages
UCS-401_CSE7th M L Lect 02_done
No ratings yet
UCS-401_CSE7th M L Lect 02_done
22 pages
A hidden Markov model
No ratings yet
A hidden Markov model
6 pages
lec12
No ratings yet
lec12
15 pages
Topic: Machine Learning
No ratings yet
Topic: Machine Learning
35 pages
HiddenMarkovModels_BARCA
No ratings yet
HiddenMarkovModels_BARCA
44 pages
PR Unit 1 2
No ratings yet
PR Unit 1 2
40 pages
Generative
No ratings yet
Generative
19 pages
Machine Learning Technique - Introduction To Graphical Models
No ratings yet
Machine Learning Technique - Introduction To Graphical Models
12 pages
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
No ratings yet
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
12 pages
ml 5
No ratings yet
ml 5
28 pages
AI Bootcamp Sarris2024
No ratings yet
AI Bootcamp Sarris2024
64 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
32 pages
Generative VS Discriminative Models - by Prathap Manohar Joshi - Medium
No ratings yet
Generative VS Discriminative Models - by Prathap Manohar Joshi - Medium
1 page
Unit 4 Learning
No ratings yet
Unit 4 Learning
5 pages
Ai Notes
No ratings yet
Ai Notes
8 pages
Dl Highlights
No ratings yet
Dl Highlights
6 pages
Machine Leaning 1 unit
No ratings yet
Machine Leaning 1 unit
10 pages
Module 1
No ratings yet
Module 1
22 pages
Unit-2: Logistic Regression
No ratings yet
Unit-2: Logistic Regression
30 pages
Lecture # 1-2 Introduction To Gen AI
No ratings yet
Lecture # 1-2 Introduction To Gen AI
41 pages
Machine Learning - Unit - 1
100% (1)
Machine Learning - Unit - 1
58 pages
ML Merged
No ratings yet
ML Merged
433 pages
An Introduction To Machine Learning and Its Applications
No ratings yet
An Introduction To Machine Learning and Its Applications
8 pages
Arngren - 2007 - Unknown - Modelling cognitive representations
No ratings yet
Arngren - 2007 - Unknown - Modelling cognitive representations
114 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
39 pages
Top 9 Machine Learning Applications in Real World
No ratings yet
Top 9 Machine Learning Applications in Real World
7 pages
Decision Trees. These Models Use Observations About Certain
No ratings yet
Decision Trees. These Models Use Observations About Certain
6 pages
AI PROJECT CYCLE
No ratings yet
AI PROJECT CYCLE
30 pages
poly_aml
No ratings yet
poly_aml
76 pages
PAIML-UNIT 5 (1) (1)
No ratings yet
PAIML-UNIT 5 (1) (1)
38 pages
AIML NOTES
No ratings yet
AIML NOTES
12 pages
Report Print
No ratings yet
Report Print
22 pages
Brief Intro To ML PDF
No ratings yet
Brief Intro To ML PDF
236 pages
Challenges in ML&DM
No ratings yet
Challenges in ML&DM
12 pages
Machine Learning and Deep Learning a Comprehensive Overview.pptx
No ratings yet
Machine Learning and Deep Learning a Comprehensive Overview.pptx
15 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
48 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
Data analysis ch1
No ratings yet
Data analysis ch1
13 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
L11 - UCLxDeepMind DL2020
No ratings yet
L11 - UCLxDeepMind DL2020
68 pages
pdf
No ratings yet
pdf
7 pages
Module_-1
No ratings yet
Module_-1
9 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
Introduction to machine learning
No ratings yet
Introduction to machine learning
33 pages
Advanced Machine Learning
No ratings yet
Advanced Machine Learning
63 pages
Python 06 MachineLearning
No ratings yet
Python 06 MachineLearning
45 pages
Chapter 5 - Graphical Models
No ratings yet
Chapter 5 - Graphical Models
65 pages
A Gentle Introduction To Generative Adversarial Networks (GANs)
No ratings yet
A Gentle Introduction To Generative Adversarial Networks (GANs)
15 pages
Introduction to Artificial Intelligence
No ratings yet
Introduction to Artificial Intelligence
1 page
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Artificial Intelligence As A Sustainable Tool in Wastewater Treatment Using
No ratings yet
Artificial Intelligence As A Sustainable Tool in Wastewater Treatment Using
15 pages
Vitro TD 119
No ratings yet
Vitro TD 119
9 pages
Lecturenote 2015 230308130356 E31eac03
No ratings yet
Lecturenote 2015 230308130356 E31eac03
113 pages
Remission of Prison Sentences Through Reading in Rio de Janeiro: Possibilities and Advances
No ratings yet
Remission of Prison Sentences Through Reading in Rio de Janeiro: Possibilities and Advances
15 pages
project proposal bad sample
No ratings yet
project proposal bad sample
6 pages
E&E1
No ratings yet
E&E1
3 pages
RPS Report CH17D009
No ratings yet
RPS Report CH17D009
30 pages
The Grind - Don't Ease Up, Don't Give Up
No ratings yet
The Grind - Don't Ease Up, Don't Give Up
3 pages
Thesis On Affordable Housing in Ghana
100% (2)
Thesis On Affordable Housing in Ghana
8 pages
Data Sheet FOSC 600 Fiber Optic Splice Closures
No ratings yet
Data Sheet FOSC 600 Fiber Optic Splice Closures
4 pages
Bus Analytics Lec 3 and 4 Mod - DR Mahmoud Beshr
No ratings yet
Bus Analytics Lec 3 and 4 Mod - DR Mahmoud Beshr
59 pages
Datasheet Eaf4-2-Eam4 c13 HBK
No ratings yet
Datasheet Eaf4-2-Eam4 c13 HBK
1 page
Maths Q Bank Term II Class 10
No ratings yet
Maths Q Bank Term II Class 10
110 pages
De HSG Huyen Di Linh - 22 - 23
No ratings yet
De HSG Huyen Di Linh - 22 - 23
9 pages
Screw Compressor Packages Grasso SPduo Large Series
No ratings yet
Screw Compressor Packages Grasso SPduo Large Series
84 pages
Automation and Robotics I MID QP and BITP Feb 2022
No ratings yet
Automation and Robotics I MID QP and BITP Feb 2022
4 pages
(Ebook) Linear Algebra: A First Course with Applications by Larry E. Knop ISBN 9781584887829, 9781584887836, 1584887826, 1584887834 download
100% (1)
(Ebook) Linear Algebra: A First Course with Applications by Larry E. Knop ISBN 9781584887829, 9781584887836, 1584887826, 1584887834 download
58 pages
TMV Dyna400 - CXT Applications Training Binder
No ratings yet
TMV Dyna400 - CXT Applications Training Binder
22 pages
Template GPSJ Bahasa Indonesia - For Merge
No ratings yet
Template GPSJ Bahasa Indonesia - For Merge
13 pages
Biology Subject For High School - Darwin's Theory of Natural Selection by Slidesgo
No ratings yet
Biology Subject For High School - Darwin's Theory of Natural Selection by Slidesgo
56 pages
Mennonite 3
No ratings yet
Mennonite 3
308 pages
Your Brain is a Quantum Time Machine
No ratings yet
Your Brain is a Quantum Time Machine
35 pages
Who Is A Child The Adults Perspective Within Adul
No ratings yet
Who Is A Child The Adults Perspective Within Adul
22 pages
Epogen Safety Data Sheet 20130213 Rev 4
No ratings yet
Epogen Safety Data Sheet 20130213 Rev 4
8 pages
Hdr 2025 Report En
No ratings yet
Hdr 2025 Report En
328 pages
How To Write A Laboratory Report
No ratings yet
How To Write A Laboratory Report
9 pages
Introduction To Purposive Communication
No ratings yet
Introduction To Purposive Communication
11 pages
Ireland Fellows Programme Directory of Programmes 2021-22 0 PDF
No ratings yet
Ireland Fellows Programme Directory of Programmes 2021-22 0 PDF
131 pages
River Sediment Management
No ratings yet
River Sediment Management
82 pages
Random Encounters Remastered
100% (3)
Random Encounters Remastered
35 pages

aai

Uploaded by

aai

Uploaded by

Generative models

Significance of generative models

Challenges of generative models

○​ Choose the number of components (clusters), say KKK.​

○​ Randomly initialize the means μk\mu_kμk​, covariances Σk\Sigma_kΣk​, and

2.​ E-Step (Expectation):​

3.​ M-Step (Maximization):​

○​ Update the parameters (means, covariances, and mixing coefficients) to

This is done using the Expectation-Maximization (EM) algorithm.

●​ Flexible Shapes: Can model elliptical (non-spherical) clusters due to covariance

●​ Probabilistic Framework: Can handle uncertainty and provides likelihoods.​

●​ Works with EM Algorithm: Efficient estimation of parameters.

●​ Sensitive to Initialization: Bad initialization may lead to poor convergence.​

●​ Assumes Gaussian Distributions: Not ideal if the actual data distribution is

●​ Anomaly Detection: Points with low probability are flagged as anomalies.​

●​ Speech Recognition: Modeling acoustic features using GMMs.​

●​ Clustering: An alternative to K-Means when data has different variances.​

●​ Finance: Modeling returns of assets which often follow a mixture distribution.

1.​ Initial State Probabilities: Probabilities of starting in each hidden state.

Example: Weather Prediction

Uses of Hidden Markov Models

●​ Weather Forecasting: Predicting weather patterns based on observed conditions.

●​ Versatility: Applicable to a wide range of sequential data problems.

Problems: Hidden Markov Model Clearly Explained! Part - 5

●​ Nodes represent random variables.

●​ Image Segmentation: Classifying each pixel into objects or regions.

●​ Captures complex dependencies without assuming direction.

●​ Inference is computationally expensive.

Components of a Bayesian Network:

The dependencies are:

●​ A burglary or an earthquake can trigger the alarm.

●​ Complex Inference: Exact calculations can be computationally expensive.

Where It's Used

●​ Hidden Markov Models (HMM)​

●​ Missing data problems​

●​ Clustering with soft assignments

1. Expectation Step (E-step)

2. Maximization Step (M-step)

Repeat until convergence (i.e., changes in parameters or log-likelihood

●​ Can work with soft (probabilistic) assignments.​

●​ Guaranteed to converge to a local optimum.

●​ Can be computationally expensive for large datasets.

●​ Natural Language Processing (e.g., topic models)​

You might also like

○ Choose the number of components (clusters), say KKK.

○ Randomly initialize the means μk\mu_kμk, covariances Σk\Sigma_kΣk, and

2. E-Step (Expectation):

3. M-Step (Maximization):

○ Update the parameters (means, covariances, and mixing coefficients) to

● Flexible Shapes: Can model elliptical (non-spherical) clusters due to covariance

● Probabilistic Framework: Can handle uncertainty and provides likelihoods.

● Works with EM Algorithm: Efficient estimation of parameters.

● Sensitive to Initialization: Bad initialization may lead to poor convergence.

● Assumes Gaussian Distributions: Not ideal if the actual data distribution is

● Anomaly Detection: Points with low probability are flagged as anomalies.

● Speech Recognition: Modeling acoustic features using GMMs.

● Clustering: An alternative to K-Means when data has different variances.

● Finance: Modeling returns of assets which often follow a mixture distribution.

1. Initial State Probabilities: Probabilities of starting in each hidden state.

● Weather Forecasting: Predicting weather patterns based on observed conditions.

● Versatility: Applicable to a wide range of sequential data problems.

● Nodes represent random variables.

● Image Segmentation: Classifying each pixel into objects or regions.

● Captures complex dependencies without assuming direction.

● Inference is computationally expensive.

● A burglary or an earthquake can trigger the alarm.

● Complex Inference: Exact calculations can be computationally expensive.

● Hidden Markov Models (HMM)

● Missing data problems

● Clustering with soft assignments

● Can work with soft (probabilistic) assignments.

● Guaranteed to converge to a local optimum.

● Can be computationally expensive for large datasets.

● Natural Language Processing (e.g., topic models)