Protein Structure Prediction Using Homology Modeling
Protein Structure Prediction Using Homology Modeling
egories
Protein structure
Every 1D protein sequence string folds into 3D structures. These 3D protein structures are
determine how a protein responds to various environments and which other molecules it
interacts with, and hence is critical in the ability of the protein to perform its functions.
The 3D structure of protein is described by providing the coordinates (x-y-z) of every atom
in the protein, in 3D space.
Homology Modeling
One of most popular computational methods for protein structure prediction is Homology
Modeling. Homology modeling leverages the property of evolutionary conservation of
https://www.commonlounge.com/discussion/73dd4037cc00473fb37bf4c52ec5bf30#_=_ 1/11
3/25/2020 Protein structure prediction using homology modeling | CommonLounge
protein structures to predict 3D structures of proteins. Two proteins that have evolved
egories
from the same common ancestor (homology) tend to have similar 3D structures. In
homology modeling, this property of conservation of protein structure is used to predict
structures of newly discovered protein sequences whose structures cannot be resolved
using traditional experimental methods.
The main idea is that protein sequence with unknown structure is searched against the
sequence database of proteins where structures of all proteins are experimentally known
and the unknown structure is modeled from the evolutionarily closest or best matching
protein from the database.
In this article, we describe the approach and methodology of homology modeling, i.e. how
it works. We also describe how to use the SWISS-MODEL tool for performing homology
modeling.
For predicting the structure of the protein, we'll rst predict the co-ordinates of N, Ca, Cb
(backbone) and then the co-ordinates of the R-group (side-chain) of each amino acid.
https://www.commonlounge.com/discussion/73dd4037cc00473fb37bf4c52ec5bf30#_=_ 2/11
3/25/2020 Protein structure prediction using homology modeling | CommonLounge
Four amino acids in a chain. Notice the backbone comprising of N, Ca, Cb atoms and side-chain
egories
comprised of R-groups.
Given that we have already found an initial template, we can now use more expensive
alignment algorithms to nd a better alignment. For example, we can use multiple
sequence alignment algorithms for this step. Multiple sequence alignment are useful for
identifying regions that are highly divergent, and hence better detecting the appropriate
locations for insertions and deletions.
https://www.commonlounge.com/discussion/73dd4037cc00473fb37bf4c52ec5bf30#_=_ 3/11
3/25/2020 Protein structure prediction using homology modeling | CommonLounge
The secondary structure of the protein consists of helices, strands and loops. Since
conformational changes implied by insertions and deletions can't happen in helices and
strands, they must happen in the loops.
There are two main approaches to model loops: knowledge based and energy based. The
former approach searches for conformations of loops that have similar sequences and
endpoints as the target in the database of known structures. The latter models the loop
conformation in an ab initio manner by predicting the loop structures with lowest
structural energies using force eld functions and molecular dynamics. These methods
provide fairly accurate results for short loops with up to 5-8 residues.
https://www.commonlounge.com/discussion/73dd4037cc00473fb37bf4c52ec5bf30#_=_ 4/11
3/25/2020 Protein structure prediction using homology modeling | CommonLounge
Target-template recognition
First step is to search the target sequence against the database of sequences with known
protein structures. Paste accession number into the window and hit the “Search For
Templates” button
https://www.commonlounge.com/discussion/73dd4037cc00473fb37bf4c52ec5bf30#_=_ 5/11
3/25/2020 Protein structure prediction using homology modeling | CommonLounge
The search results show different protein structure templates that can be used to predict
egories
the protein structure for our target sequence. These templates are ranked according to
how well their sequences align with the target protein sequence. The rst selected
structure template is the best matching (99% identity). The second template matches the
target sequence with 51% identity. The superimposed protein structures of the two top
matching templates can be seen in the window on the right.
We will use the top two results to build or predict two structures for the target sequence
and then select the best predicted structure.
The modeling results can be seen below. The two predicted structures are ranked
according to the quality of their models. Next, we will assess the quality of both the
predicted structures to see which is the best.
https://www.commonlounge.com/discussion/73dd4037cc00473fb37bf4c52ec5bf30#_=_ 6/11
3/25/2020 Protein structure prediction using homology modeling | CommonLounge
egories
The QMEAN is one of the primary measures used to assess the model quality. QMEAN is
a composite scoring function based on different geometrical properties of protein
structures and provide both global (i.e.for the entire structure) and local (i.e. per residue)
absolute quality estimates.
QMEAN consists of four individual terms. The four individual terms of the global QMEAN
quality scores are also listed. The white area in the bar-plots (numerical values close to
zero) indicates that the property is similar to what is observed in experimental structures.
Positive values indicate that the model scores higher than experimental structures on
average, negative numbers indicate that the model scores lower than experimental
structures on average.
For the rst model (built using 2ef0.1.A as a template) the QMEAN terms mostly fall
within the white region.
https://www.commonlounge.com/discussion/73dd4037cc00473fb37bf4c52ec5bf30#_=_ 7/11
3/25/2020 Protein structure prediction using homology modeling | CommonLounge
egories
However, for the second model (built using 3gd5.1.A as a template) most QMEAN terms
signi cantly differ from the optimal
https://www.commonlounge.com/discussion/73dd4037cc00473fb37bf4c52ec5bf30#_=_ 8/11
3/25/2020 Protein structure prediction using homology modeling | CommonLounge
egories
Therefore, the structure predicted by the template 2ef0.1.A is the most optimal model
and can be used as the predicted structure for our target sequence.
Resources
SWISS-MODEL
MODELLER
References
"Homology Modeling" by Elmar Krieger, Sander B. Nabuurs, and Gert Vriend
https://www.commonlounge.com/discussion/73dd4037cc00473fb37bf4c52ec5bf30#_=_ 9/11
3/25/2020 Protein structure prediction using homology modeling | CommonLounge
egories Comment
MARK COMPLETED
★★★★★ Leave a Review
PREVIOUS
Detecting Mutations with Read Mapping and Suf x Trees
Akshay Yadav
Masters in Bioinformatics and 6 years of work experience in Bioinformatics & Computational Biology
Keshav Dhandhania
BSc, MSc 2014 @ MIT (AI, Deep Learning). Former Competitive Programmer.
Add a comment
Back to top
https://www.commonlounge.com/discussion/73dd4037cc00473fb37bf4c52ec5bf30#_=_ 10/11
3/25/2020 Protein structure prediction using homology modeling | CommonLounge
Cryptography
egories
ABOUT US
Our Mission
How to Contribute
Help and FAQ
CommonLounge Meta
Team
Privacy, Terms & Refunds
GET IN TOUCH
[email protected]
Facebook
Twitter
San Francisco, CA, USA
+1 844 318 7406
https://www.commonlounge.com/discussion/73dd4037cc00473fb37bf4c52ec5bf30#_=_ 11/11