Slides 1
Slides 1
Senior Lecturer
Academic Unit of Respiratory Medicine
LU108
[email protected]
I am a respiratory cell biologist
• A cell is dynamic
– Responds to external conditions
– Most cells follow a cell cycle of division
Proteomics
My work principally involves the identification and study
of novel genes and the comparative analysis of well
established genes.
These include
EST analysis
Proteomics
These include:
EST analysis
Proteomics
HE4
De novo gene prediction
Output often require manual annotation and editing!
HE4
HE4 was identified in 1991 as a human epididymis-
specific DNA by the use of differential screening
Sequence analysis revealed that HE4 shared sequence homology with
WAP proteins and therefore it was suggested that it is a antiproteinase!
They have no functional evidence.
Subsequently it was used as a epididymal marker and cloned from dog
and rabbit.
More recently HE4 was identified as a putative marker of human
ovarian tumour by the use of a variety of expression array studies.
These papers used three different techniques; DNA and
oligonucleotide arrays as well as Serial Analysis of Gene Expression
(SAGE)
Sequence analysis revealed that HE4 shared sequence homology with
WAP proteins and therefore it was suggested that it is a antiproteinase!
They have no functional evidence.
Subsequently it was used as a epididymal marker and cloned from dog
and rabbit.
More recently HE4 was identified as a putative marker of human
ovarian tumour by the use of a variety of expression array studies.
These papers used three different techniques; DNA and
oligonucleotide arrays as well as Serial Analysis of Gene Expression
(SAGE)
http://www.embl-heidelberg.de/info/sage/
HE4 may be differentially expressed in distinct types of ovarian
cancers
The human WFDC protein locus on chromosome 20 contains at
least 15 genes
Most are completely uncharacterised (other than by PCR)
HE4
Searching ensembl with the term HE4 provides access
to expression and sequence information as well as to
gene structure predictions
Much of the data is derived from other public databases and represents
the “best effort” of the sequencing and annotation communities - hence
they are not always correct!
>ENST00000217425
CCTGCACCCCGCCCGGGCATAGCACCATGCCTGCTTGTCGCCTAGGCCCG
CTAGCCGCCGCCCTCCTCCTCAGCCTGCTGCTGTTCGGCTTCACCCTAGT
CTCAGGCACAGGAGCAGAGAAGACTGGCGTGTGCCCCGAGCTCCAGGCTG
ACCAGAACTGCACGCAAGAGTGCGTCTCGGACAGCGAATGCGCCGACAAC
CTCAAGTGCTGCAGCGCGGGCTGTGCCACCTTCTGCTCTCTGCCCAATGA
TAAGGAGGGTTCCTGCCCCCAGGTGAACATTAACTTTCCCCAGCTCGGCC
TCTGTCGGGACCAGTGCCAGGTGGACAGCCAGTGTCCTGGCCAGATGAAA
TGCTGCCGCAATGGCTGTGGGAAGGTGTCCTGTGTCACTCCCAATTTCTG
AGCTCCAGCCACCACCAGGCTGAGCAGTGAGGAGAGAAAGTTTCTGCCTG
GCCCTGCATCTGGTTCCAGCCCACCTGCCCTCCCCTTTTTCGGGACTCTG
TATTCCCTCTTGGGCTGACCACAGCTTCTCCCTTTCCCAACCAATAAAGT
AACCACTTTCAGCA
Output of blast analysis of HE4
vs human EST database
Mouse-over to show defline and scores. Click to show alignments
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A.
Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David
J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of
protein database search
programs", Nucleic Acids Res. 25:3389-3402.
RID: 1002902122-2668-16916
|HUMAN| 60 KCCSAGCATFCSLPN---------------------------------------------
|DOG| 59 KCCQAGCATICHLPN---------------------------------------------
|PIG| 59 KCCKAGCVTICQMPN---------------------------------------------
|RABBIT| 61 KCCRAGCSAICSIPN---------------------------------------------
|MOUSE| 60 KCCQAGCSSVCSKPNGPSEGELSGTDTKLSETGTTTQSAGLDHTTKPPGGQVSTKPPAVT
|RAT| 47 KCCQAGCGSVCSKNGPLSEGKLS-----RTATGTTTLSAGLARTSPLSRGQVSTKPPVVT
|HUMAN| 75 -------DKEGSCPQVNINFPQLGLCRDQCQVDSQCPGQMKCCRNGCGKVSCVTPNF
|DOG| 74 -------EKEGSCPQVNTDFPQLGLCQDQCQVDSHCPGLLKCCYNGCGKVSCVTPIF
|PIG| 74 -------EKEGSCPQVDIAFPQLGLCLDQCQVDSQCPGQLKCCRNGCGKVSCVTPVF
|RABBIT| 76 -------EKEGSCP--SIDFPQLGICQDLCQVDSQCPGKMKCCLNGCGKVSCVTPNF
|MOUSE| 120 REGLGVREKQGTCP--SVDIPKLGLCEDQCQVDSQCSGNMKCCRNGCGKMACTTPKF
|RAT| 102 KE-GGNGEKQGTCP--SVDFPKLGLCEDQCQMDSQCSGNMKCCRNGCGKMGCTTPKF
We characterised the gene and could show that the
human gene can undergo complex alternative splicing
which potentially generates a number of distinct protein
products (Oncogene, 2002)
a. 1 2 3b 3a 3 4b 4a 4 5
FL
127 124 >331 129 136 >128 290 153 162 V4
V1
* V2
V3
V1 * N WAP
V2 * C WAP SP
N -W A P
V4 *
C -W A P
U n iq ue
V3 *
We characterised the gene and could show that the
human gene can undergo complex alternative splicing
which potentially generates a number of distinct protein
products (Oncogene, 2002)
a. 1 2 3b 3a 3 4b 4a 4 5
FL
127 124 >331 129 136 >128 290 153 162 V4
V1
* V2
V3
V1 * N WAP
V2 * C WAP SP
N -W A P
V4 *
C -W A P
U n iq ue
V3 *
V1 * N WAP
V2 * C WAP
V4 *
V3 *
Gene identification now often uses a combination of
bioinfomatic and “wet” labaratory techniques.
But all of the genomics based information still required much additional
functional studies not least to determine if the protein has true
antiproteinase activities.