Alignment Lecture 4
Alignment Lecture 4
Introduction to
Alignment
Approaches
Pairwise Sequence Alignment - I
Alignment
• The process of
inexact matching
while keeping in
view the conserved
residues is called
sequence
“Alignment”
Introduction to Alignment Approaches
Background
• Sequences vary due
to insertions,
deletions and
substitutions
• To calculate the
similarity between
two sequences, we
need to align the
sequences
Introduction to Alignment Approaches
Introduction
• Global Alignment
• Local Alignment
Introduction to Alignment Approaches
Local Alignment
“Match one sequence with a portion
of the second sequence”
Why Local Alignments?
Other Advantages
To calculate the
similarity between • Different length
two sequences, sequences can be
compared
we need to align
the sequences • Conserved domains
can be elicited from
proteins belonging
Local alignments have to different families
the power to detect • Common functional
small regions of high features can thus be
similarity between two identified
sequences
Introduction to Alignment Approaches
Global Alignment
“An end to end match of one sequence with the
second sequence”
Pairwise Sequence Alignment - II
• Local alignment
focuses on the
highly matching sub
regions within the
sequences
• Global alignment
creates an end-to-
end matching
between two
sequences
END
Pairwise Sequence Alignment -
Background
Pairwise sequence alignment is therefore
alignment of a pair of two sequences
• No change? match
• Insertions or deletions ("indels") result in
gaps in alignments
• Substitutions result in mismatches
Pairwise Sequence Alignment - III
Optimal Alignment
Align 1 & 2
1: THIS IS A RATHER LONGER SENTENCE THAN THE NEXT.
2: THIS IS A SHORT SENTENCE.
Conclusion
• Global and local
pairwise alignments
are two types of
alignment and they
can lead to different
results
• Substitutions lead to
mismatches
END
Sequence Analysis
Multiple
Sequence
Alignment
Multiple Sequence Alignment
Introduction
• Multiple sequence
alignment involves
comparison of three
of more sequences
• Pairwise alignment
was either local or
global
• Multiple alignment is
mostly global
Multiple Sequence Alignment
MQVKLFTPLHDKSDHGKYH
MQVKIFTPLHDKS-HGKSH
MQVHLY-PLHDKS-TGKSH
MQVHLF-PLHDKSDTGKSH
MQVKLYTPLHDKSDHGKYH
Multiple Sequence Alignment
MSA vs Pairwise
• For pairwise
alignments, we used
Dynamic
Programming
• Solution:
Progressive
Alignment
Multiple Sequence Alignment
Conclusion
• MSA can help align
multiple sequences
• Dynamic
Programming
cannot be used as it
becomes expensive
• Progressive
alignment can help
perform MSA!
• CLUSTAL – Online
tool
END
More on Multiple Sequence Alignment
Application of
MSA
• Predict secondary
and tertiary
structures of new
protein sequences
• Evaluate
evolutionary order
of species or
“Phylogeny”
More on Multiple Sequence Alignment
Methodology
• Pairwise alignment
is the alignment of
two sequences
• MSA can be
performed by
repeated application
of pairwise
alignment
More on Multiple Sequence Alignment
Perform clustering of
Create a hierarchy using
pairwise aligned
binary trees
sequences
Conclusion
• MSA can help align
multiple sequences
• Progressive
alignment can help
perform MSA!
• Need to remove
sequences with
>80% similarity
• CLUSTAL – Online
tool
END
Sequence Analysis
MSA - Example
MSA - Example
Background
• MSA involves
progressive
alignment of
sequences
• Doing so many
progressive
alignments can be
slow
• Let’s take a look at
an example
MSA - Example
Hbb_human
Glb5_petma
Hba_human
Myg_phyca
Lgb2_lupla
Hbb_horse
Hba_horse
Hbb_human 1 -
Hbb_horse 2 .17 -
Hba_human 3 .59 .60 -
Hba_horse 4 .59 .59 .13 -
Myg_phyca 5 .77 .77 .75 .75 -
Glb5_petma 6 .81 .82 .73 .74 .80 -
Lgb2_lupla 7 .87 .86 .86 .88 .93 .90 -
Hba_human
1
Hba_horse
2
Hbb_horse 3
Hbb_human 4
Glb5_petma 5
Myg_phyca 6
Lgb2_lupla
.081
Hbb_human
.226
2.084
.061 Hbb_horse
3 .055
Hba_human
.015
4
.219
1.065
Hba_horse
.062 .398
5 Myg_phyca
6 .389
Glb5_petma
.442
Lgb2_lupla
Alignment results
Conclusion
• MSA can be better
performed using
clustering strategies
followed by
alignment of the
alignments later
• CLUSTAL is a free
online tool that does
all of this for us!
END