BLAST - A Heuristic Algorithm
BLAST - A Heuristic Algorithm
algorithm
Anjali Tiwari
Pannaben Patel
Pushkala Venkataraman
Basic Local
Alignment Search
Tool
BLAST
Rapid
Searching of
Protein &
nucleotide DBs
Databa
se
nr = non redundant
Seeking similar
sequences
GenBa
nk
SwissP
nr
PIR
rot
PDB
PRF
3
Program
Search
Level
Blastp
Amino
Amino
Amino
acid
acid
acid
Blastn
Nucleotide Nucleotide Nucleotide
Blastx
Nucleotide
Amino
Amino
acid
acid
Tblastn
Amino
Nucleotide
Amino
BLASTacid
3 STEP ALGORITHM acid
Tblastx Nucleotide Nucleotide
Amino
Compile Words
Scan DB acid
Extend
Query
Database
Some definitions
Alignment
Process of lining up 2
or more sequences to
asses similarity
BLOSUM62
A 20*20 substitution
matrix for amino acids
Gap
Space introduced
into alignment to
compensate for
insertions/deletions
in 1 sequence
relative5to another
Similarity
Measures
Similarity
Matrix - BLOSUM
Local
Search
Algorithms
Unlikely
Replacements = -ve
6
Query Input
1000s of
sequences
Calculate
HSP
Calculate
MSP
Display
output
7
Step
2
Step
3
Example
Step -1
Query QQGPHUIQEGQQGKEEDPP
Words of length 3 w = QQG, QGP, GPH, PHU, HUI
Take first triple QQG
Make neighborhood words w = QQG, QEG, GQG
Find high scoring triples Blosum(w, w) > T where T
= Threshold parameter
Suppose Blosum (QQG, QEG) =18
Blosum(QQG,GQG) = 12
Blosum(QQG, QQG)= 16
T=13
Choose QQG and QEG since Blosum Value9> T value
Step -2
Suppose Database Sequence = PKLMMQQGKQEGM
10
Step -3
Query
QQGPHUIQEGQQGKEEDPP
Blosum(QQG, QQG)
=16
DB Sequence
QQGPHUIQEGQQGKEEDP
PKLMMQQGKQEGM
Blosum(QQGK, QQGK)
P
=21
PKLMMQQGKQEGM
QQGPHUIQEGQQGKEEDP
Blosum(QQGKE,
P
QQGKQ) =23
QQGPHUIQEGQQGKEEDP
PKLMMQQGKQEGM
Blosum(QQGKEE,
P
QQGKQE) =28
PKLMMQQGKQEGM
QQGPHUIQEGQQGKEEDP
Blosum(QQGKEED,
P
QQGKQEG) =27
11
ADVANTAGES
DISADVANTAGES
13
Gapped Blast
Normal alignment is
ACGTA
ACATA
AC-ATA
14
CTGACAAGGGTATCG
Mismatch
Gap
15
PSI BLAST-
Query Sequence
BLAST search
of DB
Sequences with high
scores collected
Multiple alignment &
profile made
DB searched with
profile16
References
Altschul, S.F., Gish, W., Miller, W., Myers,
E.W. & Lipman, D.J. (1990) "Basic local
alignment search tool." Journal of Molecular
Biology 215:403-410.
Altschul, S.F.,Thomas L.M., Alejandro A.S,
Jinghui Z, Zheng Z, W. Miller & David J.L.
(1997) Gapped BLAST and PSI-BLAST: a
new generation of protein database search
programs. Nucleic Acids Research.
http://www.ncbi.nlm.nih.gov/
http://bioinf.man.ac.uk/ember/prototype/
17
References (Continued)
http://www.psc.edu/biomed/training/tutorials
/sequence/db/index.html
http://aracyc.stanford.edu/~jshrager/jeff/mb
cs/match.html
http://www.ime.usp.br/~durham/cursos/ibi50
32/pub/doc/allignmentTutorial.pdf
http://ibivu.cs.vu.nl/teaching/masters/seq_an
alysis/sa_lecture3.pdf
18