0% found this document useful (0 votes)

7 views

Slides 1

Uploaded by

Phlip Ong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Slides 1

Uploaded by

Phlip Ong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 57

Dr Colin Bingle Me and my bioinformatics hat!

Senior Lecturer
Academic Unit of Respiratory Medicine
LU108
[email protected]
I am a respiratory cell biologist

I am interested in components of the pulmonary innate

immune response - in normal and diseased lung
My studies involve understanding the following
questions

• Why are certain genes expressed in certain cells?

• What factors control the expression of these genes?
• How are these genes processed?
• What do the gene products do?
My studies involve understanding the following
questions

• Why are certain genes expressed in certain cells?

• What factors control the expression of these genes?
• How are these genes processed?
• What do the gene products do?
• These are all questions that involve gene hunting,
transcriptional and proteomic analysis
Genome is fixed – Cells are dynamic
• A genome is static
– Every cell in our body has a copy of same genome

• A cell is dynamic
– Responds to external conditions
– Most cells follow a cell cycle of division

• Cells differentiate during development and can alter

their transcriptome/proteome
1. Gene Hunting
2. Transcriptome and Proteome
Outline - I will use examples of my own research to
illustrate the ways in which genes are increasingly
identified and studied.
(Essentially) gone are the days when studies are
initiated from traditional experimental techniques
Predominantly genes are identified based on where and
when they are expressed or by purely bioinfomatic
approaches
1. Gene Hunting
2. Transcriptome and Proteome
We will study how genes are identified - basic
techniques
What constitutes the transcriptome and the proteome
how it varies and and how it can be studied
You will see how such techniques can be used in
individual projects
Six steps at which eukaryotic gene expression can
be controlled can be considered to be divided
between genomic/transcriptomic and proteomic
Genomics, transcriptomics and proteomics should
be considered as different spokes of the same
wheel - information from each provides support to
the others. Transcriptomics
Genomics

Proteomics
My work principally involves the identification and study
of novel genes and the comparative analysis of well
established genes.

Such genes have been identified through a variety of

approaches.

These include

EST analysis

SAGE and array studies

Proteomics

De novo gene predictions followed by cloning

My work principally involves the identification and study
of novel genes and the comparative analysis of well
established genes.

Such genes have been identified through a variety of

approaches.

These include:

EST analysis

SAGE and array studies

Proteomics

De novo gene predictions followed by cloning

Do not trust everything you read
Do not trust everything you read

There is much to learn and many genes to discover!

AN EXERCISE IN FUNCTIONAL GENOMICS: THE
BIOLOGY OF HE4
For the past few years we have been studying the biology of two low molecular weight
antiproteinases which play a role in host defence in the airways.
These proteins Elafin and Secretory leukocyte proteinase inhibitor (SLPI) are members of
the Whey acidic domain protein (WAP)/ 4 disulphide core family of proteins.
WAP domains comprise units of =50
amino acids that include 8 conserved
cysteines.
WAP proteins are typically small
secretory proteins
In SLPI and Elafin the precise
arrangement of the WAP domain confers
the antiproteinase activity.
SLPI and elafin are co-localised within
75Kb of each other on Chromosome 20
Due to this observation we have been
looking to see if they are co-ordinately
regulated.
In searching the surrounding regions of
Ch20 we were able to locate a number of
additional WAP domain containing
proteins -including HE4
WAP domains are present in multiple proteins - and
individual proteins can contain more than 1 domain
The human WFDC protein locus on chromosome 20 contains at
least 15 genes
Most are completely uncharacterised (other than by PCR)

HE4
De novo gene prediction
Output often require manual annotation and editing!

However it can be used as the basis for subsequent

PCR and cloning
The human WFDC protein locus on chromosome 20 contains at
least 15 genes
Most are completely uncharacterised (other than by PCR)

HE4
HE4 was identified in 1991 as a human epididymis-
specific DNA by the use of differential screening
Sequence analysis revealed that HE4 shared sequence homology with
WAP proteins and therefore it was suggested that it is a antiproteinase!
They have no functional evidence.
Subsequently it was used as a epididymal marker and cloned from dog
and rabbit.
More recently HE4 was identified as a putative marker of human
ovarian tumour by the use of a variety of expression array studies.
These papers used three different techniques; DNA and
oligonucleotide arrays as well as Serial Analysis of Gene Expression
(SAGE)
Sequence analysis revealed that HE4 shared sequence homology with
WAP proteins and therefore it was suggested that it is a antiproteinase!
They have no functional evidence.
Subsequently it was used as a epididymal marker and cloned from dog
and rabbit.
More recently HE4 was identified as a putative marker of human
ovarian tumour by the use of a variety of expression array studies.
These papers used three different techniques; DNA and
oligonucleotide arrays as well as Serial Analysis of Gene Expression
(SAGE)

These unbiased approaches all identify genes based on

differential expression in tissues or cells.
A DNA micro array can allow us to observe a genome’s gene expression program.
Each cell in our bodies expresses a specific set of genes according to a precisely
controlled genetic script that gives that cell its distinctive design and functional
capabilities.
The gene expression program that unfolds during a developmental or physiological
or pathological process can be read as a kind of a script for that process.

Serial Analysis of Gene Expression, or SAGE, is

also designed to gain a quantitative measure of gene
expression. The SAGE technique itself includes several
steps utilizing molecular biological, DNA sequencing
and bioinformatics techniques. These steps have been
used to produce small "tags", which are then, in some
manner, assigned gene descriptions.
A DNA micro array can allow us to observe a genome’s gene expression program.
Each cell in our bodies expresses a specific set of genes according to a precisely
controlled genetic script that gives that cell its distinctive design and functional
capabilities.
The gene expression program that unfolds during a developmental or physiological
or pathological process can be read as a kind of a script for that process.
Uses known or unknown cDNA or oligo sequence
Serial Analysis of Gene Expression, or SAGE, is
also designed to gain a quantitative measure of gene
expression. The SAGE technique itself includes several
steps utilizing molecular biological, DNA sequencing
and bioinformatics techniques. These steps have been
used to produce small "tags", which are then, in some
manner, assigned gene descriptions.
A DNA micro array can allow us to observe a genome’s gene expression program.
Each cell in our bodies expresses a specific set of genes according to a precisely
controlled genetic script that gives that cell its distinctive design and functional
capabilities.
The gene expression program that unfolds during a developmental or physiological
or pathological process can be read as a kind of a script for that process.
Uses known or unknown cDNA or oligo sequence
Serial Analysis of Gene Expression, or SAGE, is
also designed to gain a quantitative measure of gene
expression. The SAGE technique itself includes several
steps utilizing molecular biological, DNA sequencing
and bioinformatics techniques. These steps have been
used to produce small "tags", which are then, in some
manner, assigned gene descriptions.
Generates data on unknown sequences
Three principles underlie the SAGE
methodology: 1) A short sequence tag
(10-14bp) contains sufficient information to
uniquely identify a transcript provided that
that the tag is obtained from a unique
position within each transcript; 2)
Sequence tags can be linked together to
from long serial molecules that can be
cloned and sequenced; and 3)
Quantitation of the number of times a
particular tag is observed provides the
expression level of the corresponding
transcript.

http://www.embl-heidelberg.de/info/sage/
HE4 may be differentially expressed in distinct types of ovarian
cancers
The human WFDC protein locus on chromosome 20 contains at
least 15 genes
Most are completely uncharacterised (other than by PCR)

HE4
Searching ensembl with the term HE4 provides access
to expression and sequence information as well as to
gene structure predictions

Much of the data is derived from other public databases and represents
the “best effort” of the sequencing and annotation communities - hence
they are not always correct!

Transcript cDNA Sequence

Total length: 564 bp No. Exons: 4

>ENST00000217425
CCTGCACCCCGCCCGGGCATAGCACCATGCCTGCTTGTCGCCTAGGCCCG
CTAGCCGCCGCCCTCCTCCTCAGCCTGCTGCTGTTCGGCTTCACCCTAGT
CTCAGGCACAGGAGCAGAGAAGACTGGCGTGTGCCCCGAGCTCCAGGCTG
ACCAGAACTGCACGCAAGAGTGCGTCTCGGACAGCGAATGCGCCGACAAC
CTCAAGTGCTGCAGCGCGGGCTGTGCCACCTTCTGCTCTCTGCCCAATGA
TAAGGAGGGTTCCTGCCCCCAGGTGAACATTAACTTTCCCCAGCTCGGCC
TCTGTCGGGACCAGTGCCAGGTGGACAGCCAGTGTCCTGGCCAGATGAAA
TGCTGCCGCAATGGCTGTGGGAAGGTGTCCTGTGTCACTCCCAATTTCTG
AGCTCCAGCCACCACCAGGCTGAGCAGTGAGGAGAGAAAGTTTCTGCCTG
GCCCTGCATCTGGTTCCAGCCCACCTGCCCTCCCCTTTTTCGGGACTCTG
TATTCCCTCTTGGGCTGACCACAGCTTCTCCCTTTCCCAACCAATAAAGT
AACCACTTTCAGCA
Output of blast analysis of HE4
vs human EST database
Mouse-over to show defline and scores. Click to show alignments

BLASTN 2.2.1 [Apr-13-2001]

Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A.
Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David
J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of
protein database search
programs", Nucleic Acids Res. 25:3389-3402.

RID: 1002902122-2668-16916

Query= gi|32050|emb|X63187.1|HSHE4MR H.sapiens

HE4 mRNA for extracellular proteinase inhibitor
homologue (583 letters)

Database: GenBank Human EST entries

3,832,541 sequences; 1,821,805,599 total letters
Distribution of 194 Blast Hits on the
Query Sequence
EST analysis of non-human databases allows the
identification of HE4 sequences from mouse, rat and pig

Multiple alignment reveals a high degree of sequence similarity

between species. Also it is clear the the two WAP domains in
the mouse and rat proteins are separated by a “linker” region
not found in the other species. This may have functional
relevance
|HUMAN| 1 MPACRLGPLAAALLLSLLLFG-FTLVSGTGAEKTGVCPELQADQNCTQECVSDSECADNL
|DOG| 1 MPASRPGPLAGALLLGLLLG--LPRVPGTEVEKPGVCPQVSVDLNCTQDCVSDAQCADNL
|PIG| 1 MPACRLGLLVASLLLGLLLG--LPPPTGTGAEKSGVCPAVEVDMNCTQECLSDADCADNL
|RABBIT| 1 MPASRLVLLGAVLLLGLLLLLELPPVTGTGADKPGVCPQVSVDLNCTQDCRADQDCAENL
|MOUSE| 1 MPACRPCLLAAGLLLGLLCGT-PISATGTDAEKPGECPQVEPITDCVLDCTLDKDCADNR
|RAT| 1 -------------LLGLLLFT-PLSATGTRAEKPGVCPQVEPITDCVKACILDNDCQDNY

|HUMAN| 75 -------DKEGSCPQVNINFPQLGLCRDQCQVDSQCPGQMKCCRNGCGKVSCVTPNF
|DOG| 74 -------EKEGSCPQVNTDFPQLGLCQDQCQVDSHCPGLLKCCYNGCGKVSCVTPIF
|PIG| 74 -------EKEGSCPQVDIAFPQLGLCLDQCQVDSQCPGQLKCCRNGCGKVSCVTPVF
|RABBIT| 76 -------EKEGSCP--SIDFPQLGICQDLCQVDSQCPGKMKCCLNGCGKVSCVTPNF
|MOUSE| 120 REGLGVREKQGTCP--SVDIPKLGLCEDQCQVDSQCSGNMKCCRNGCGKMACTTPKF
|RAT| 102 KE-GGNGEKQGTCP--SVDFPKLGLCEDQCQMDSQCSGNMKCCRNGCGKMGCTTPKF
We characterised the gene and could show that the
human gene can undergo complex alternative splicing
which potentially generates a number of distinct protein
products (Oncogene, 2002)

a. 1 2 3b 3a 3 4b 4a 4 5
FL
127 124 >331 129 136 >128 290 153 162 V4

V1
* V2

V3
V1 * N WAP

V2 * C WAP SP

N -W A P
V4 *
C -W A P

U n iq ue
V3 *
We characterised the gene and could show that the
human gene can undergo complex alternative splicing
which potentially generates a number of distinct protein
products (Oncogene, 2002)

a. 1 2 3b 3a 3 4b 4a 4 5
FL
127 124 >331 129 136 >128 290 153 162 V4

V1
* V2

V3
V1 * N WAP

V2 * C WAP SP

N -W A P
V4 *
C -W A P

U n iq ue
V3 *

Each of these will likely have a distinct function!

In fact it may be more complicated than this!
In fact it may be more complicated than this!




The HE4 gene probably has at least 3 promoter regions

Each promoter region will be differentially regulated
a. 1 2 3b 3a 3 4b 4a 4 5

127 124 >331 129 136 >128 290 153 162

V1 * N WAP

V2 * C WAP

V4 *

V3 *
Gene identification now often uses a combination of
bioinfomatic and “wet” labaratory techniques.

These rely on;

Genomic,
Transcriptomic
Proteomic methodologies
These studies have added greatly to the understanding of the
function of HE4.

But all of the genomics based information still required much additional
functional studies not least to determine if the protein has true
antiproteinase activities.

We also require information on the relative levels of expression of the

different isoforms as well as specific functional information on them.
In this study the authors set out to identify mediators
found in the vomeronasal organ (VNO), located at the
base of the nasal septum, responsible for mediating
pheromone information in mice
• It was known that
something in soiled
bedding could mediate
gene expression (using
a transcription factor c-
fos) in female VNO.
• Such activity was gland
specific and could be
purified using HPLC
• The protein identified
was not previously
described
• This shows a recent
example of the
identification of a novel
family of completely
unknown proteins
present within the
“finished” and annotated
human genome.

• There are still more to

be discovered

Basic Medical Sciences For MRCP Part 1
96% (26)
Basic Medical Sciences For MRCP Part 1
441 pages
Microbiology: For The Students of Pharmacy Technicians (Category-B)
100% (4)
Microbiology: For The Students of Pharmacy Technicians (Category-B)
49 pages
Cellular and Molecular Pharmacology
From Everand
Cellular and Molecular Pharmacology
Dr. Amteshwar Singh Jaggi
4.5/5 (6)
APPLICATION OF BIOINFORMATICS IN MOLECULAR BIOLOGY AND CURRENT RESEACRH-Dr. Ruchi Yadav
No ratings yet
APPLICATION OF BIOINFORMATICS IN MOLECULAR BIOLOGY AND CURRENT RESEACRH-Dr. Ruchi Yadav
105 pages
SAGE
No ratings yet
SAGE
4 pages
Serial Analysis of Gene Expression
No ratings yet
Serial Analysis of Gene Expression
22 pages
Gene Control: Unlocking Genetic Secrets
From Everand
Gene Control: Unlocking Genetic Secrets
Deevakar Asan
No ratings yet
Lecture 1 Introduction to molecular biology
No ratings yet
Lecture 1 Introduction to molecular biology
33 pages
Module 5-Lecture 3
No ratings yet
Module 5-Lecture 3
8 pages
Bioinformatics Unit I
No ratings yet
Bioinformatics Unit I
6 pages
Serial Analysis of Gene Expression (SAGE)
No ratings yet
Serial Analysis of Gene Expression (SAGE)
34 pages
Ests: Gene Discovery Made Easier
No ratings yet
Ests: Gene Discovery Made Easier
7 pages
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
Sage Technology and Its Applications
No ratings yet
Sage Technology and Its Applications
61 pages
The Application of The Permutation Test in Genome Wide Expression Analysis
No ratings yet
The Application of The Permutation Test in Genome Wide Expression Analysis
115 pages
Proteomics Introduction
67% (3)
Proteomics Introduction
39 pages
Na Plug Jacks
No ratings yet
Na Plug Jacks
63 pages
Human Transcriptom E: by Dr. Ina Garg
No ratings yet
Human Transcriptom E: by Dr. Ina Garg
60 pages
Chapter 20 Genomics
No ratings yet
Chapter 20 Genomics
43 pages
Date Received: September 05, 2002
No ratings yet
Date Received: September 05, 2002
86 pages
January 2011 INS - Unit 5 Edexcel Biology A-Level
No ratings yet
January 2011 INS - Unit 5 Edexcel Biology A-Level
10 pages
Genomes 4. ISBN 9780815345084, 978-0815345084
100% (20)
Genomes 4. ISBN 9780815345084, 978-0815345084
23 pages
19 - Genetic Technology
No ratings yet
19 - Genetic Technology
6 pages
Genomic Medicine: Basic Molecular Biology
No ratings yet
Genomic Medicine: Basic Molecular Biology
23 pages
Gene Expression: Quantification of Information Molecules and Their Applications
No ratings yet
Gene Expression: Quantification of Information Molecules and Their Applications
146 pages
Bioinformatics Manual
No ratings yet
Bioinformatics Manual
117 pages
1. Functional Proteome intro_2
No ratings yet
1. Functional Proteome intro_2
35 pages
Sage
No ratings yet
Sage
12 pages
Methods To Study Gene Locationdocx
No ratings yet
Methods To Study Gene Locationdocx
2 pages
100,000 Genomes Project
No ratings yet
100,000 Genomes Project
9 pages
Genomics and Proteomics
No ratings yet
Genomics and Proteomics
2 pages
BCH 211 Today
No ratings yet
BCH 211 Today
44 pages
Note Jan 30 2014
No ratings yet
Note Jan 30 2014
8 pages
2 - Some Terminology used in the Molecular Biology
No ratings yet
2 - Some Terminology used in the Molecular Biology
19 pages
# 1 - Introduction To Chemical Biology
No ratings yet
# 1 - Introduction To Chemical Biology
31 pages
Science 13
No ratings yet
Science 13
4 pages
Functional Genomics A Practical Approach - 1st Edition Full Digital Edition
No ratings yet
Functional Genomics A Practical Approach - 1st Edition Full Digital Edition
17 pages
Guide to Human Genome Computing, 2nd Edition Full eBook Access
No ratings yet
Guide to Human Genome Computing, 2nd Edition Full eBook Access
15 pages
Cancer Bodies
No ratings yet
Cancer Bodies
6 pages
Perioperative
No ratings yet
Perioperative
22 pages
PR 50b7cd3d
No ratings yet
PR 50b7cd3d
21 pages
Introduction To Bioinformatics 1
No ratings yet
Introduction To Bioinformatics 1
109 pages
Genomics in Healthcare & of Haemat Cancers 2
No ratings yet
Genomics in Healthcare & of Haemat Cancers 2
79 pages
Functional Proteomics To Exploit Genome Sequences: A. Donny Strosberg
No ratings yet
Functional Proteomics To Exploit Genome Sequences: A. Donny Strosberg
6 pages
DNA To Proteins
No ratings yet
DNA To Proteins
9 pages
DNA Sequencing 2009 10
No ratings yet
DNA Sequencing 2009 10
24 pages
Molecular Diagnostics
No ratings yet
Molecular Diagnostics
3 pages
Study On Gene Therapy
No ratings yet
Study On Gene Therapy
8 pages
Basic Medical Science MRCP Part 1
No ratings yet
Basic Medical Science MRCP Part 1
441 pages
Epigenetics Book: The Most Comprehensive Exploration of the Practical, Social and Ethical Impact of DNA on Our Society and Our World
From Everand
Epigenetics Book: The Most Comprehensive Exploration of the Practical, Social and Ethical Impact of DNA on Our Society and Our World
Roy Carroll
4/5 (2)
5 - Introduction To Molecular Patholgoy
No ratings yet
5 - Introduction To Molecular Patholgoy
99 pages
Biology Project Final
No ratings yet
Biology Project Final
32 pages
Molecular Biology of The Cell Cumulative Final Exam Study Guide
No ratings yet
Molecular Biology of The Cell Cumulative Final Exam Study Guide
22 pages
Agency Law
No ratings yet
Agency Law
57 pages
Adelaide High Biology SAT1 2018
No ratings yet
Adelaide High Biology SAT1 2018
14 pages
Frontiers in Developmental Biology
From Everand
Frontiers in Developmental Biology
Robert A. Meyers
No ratings yet
Overview of Gene Expression
No ratings yet
Overview of Gene Expression
5 pages
Bt504 Current Papars 2022
No ratings yet
Bt504 Current Papars 2022
21 pages
Cancer Info
No ratings yet
Cancer Info
11 pages
Unit V DM
No ratings yet
Unit V DM
96 pages
Genetics Brain Teasers: Explore your Understadning of Genes, DNA, RNA, mRNA, Chromosomes, Mutation, Heredity, Evolution, and more!
From Everand
Genetics Brain Teasers: Explore your Understadning of Genes, DNA, RNA, mRNA, Chromosomes, Mutation, Heredity, Evolution, and more!
Dr. Leo Lexicon
No ratings yet
6 Micro Arrays
100% (1)
6 Micro Arrays
60 pages
Slides 3
No ratings yet
Slides 3
53 pages
7 Longrangereg
No ratings yet
7 Longrangereg
65 pages
2 Intropapers
No ratings yet
2 Intropapers
29 pages
8 Epigenetic
No ratings yet
8 Epigenetic
54 pages
4-Regul Signals
No ratings yet
4-Regul Signals
52 pages
1 Introduction
No ratings yet
1 Introduction
30 pages
lt17 06nd
No ratings yet
lt17 06nd
41 pages
lt11 06cmn
No ratings yet
lt11 06cmn
39 pages
lt14 06cmn
No ratings yet
lt14 06cmn
21 pages
lt05 06cmn
No ratings yet
lt05 06cmn
19 pages
lt16 06cmn
No ratings yet
lt16 06cmn
28 pages
Food Chemistry: X: Caulerpa Chemnitzia
No ratings yet
Food Chemistry: X: Caulerpa Chemnitzia
9 pages
A List of Irregular Plurals (English)
No ratings yet
A List of Irregular Plurals (English)
5 pages
Hunter Gatherers of the Congo Basin Cultures Histories and Biology of African Pygmies First Edition. Edition Barry S. Hewlett (Editor) - The ebook with rich content is ready for you to download
No ratings yet
Hunter Gatherers of the Congo Basin Cultures Histories and Biology of African Pygmies First Edition. Edition Barry S. Hewlett (Editor) - The ebook with rich content is ready for you to download
66 pages
(eBook PDF) Children and Their Development 3rd Canadian Editioninstant download
100% (4)
(eBook PDF) Children and Their Development 3rd Canadian Editioninstant download
49 pages
Crmtcs 1-Personal Identification Name: - Score
100% (2)
Crmtcs 1-Personal Identification Name: - Score
6 pages
Carbon Materials PPT 2 (Autosaved)
No ratings yet
Carbon Materials PPT 2 (Autosaved)
21 pages
Activity 1Concept-map-about-Biochemistry - Borbon
No ratings yet
Activity 1Concept-map-about-Biochemistry - Borbon
1 page
Lecture 2.4 Protista
No ratings yet
Lecture 2.4 Protista
27 pages
Mark Scheme (Results) Summer 2013 GCE Biology (6BI04) Paper 01 Unit 4: The Natural Environment and Species Survival
No ratings yet
Mark Scheme (Results) Summer 2013 GCE Biology (6BI04) Paper 01 Unit 4: The Natural Environment and Species Survival
22 pages
Twist BitBiome Transaminase Kit Brochure 1740921816
No ratings yet
Twist BitBiome Transaminase Kit Brochure 1740921816
3 pages
LAST Macromolecules Worksheet
No ratings yet
LAST Macromolecules Worksheet
2 pages
Epigenetics in Precision Medicine 1st Edition Jose Luis Garcia-Gimenez - eBook PDF download pdf
100% (1)
Epigenetics in Precision Medicine 1st Edition Jose Luis Garcia-Gimenez - eBook PDF download pdf
50 pages
Red Algae Cole
No ratings yet
Red Algae Cole
108 pages
Part II-Meiosis Activity
No ratings yet
Part II-Meiosis Activity
2 pages
Commercial Plant Breeding
No ratings yet
Commercial Plant Breeding
8 pages
Whales and Dolphins - Behavior, Biology and Distribution (PDFDrive)
100% (1)
Whales and Dolphins - Behavior, Biology and Distribution (PDFDrive)
173 pages
Unit 6 - Sponges (1)
No ratings yet
Unit 6 - Sponges (1)
4 pages
Bacterial Cell Structure
No ratings yet
Bacterial Cell Structure
58 pages
Fungi_General morphological characteristics
No ratings yet
Fungi_General morphological characteristics
15 pages
Act 4
No ratings yet
Act 4
6 pages
Sexual reproduction in flowering plants_(Exe)
No ratings yet
Sexual reproduction in flowering plants_(Exe)
7 pages
Psychology Solved
No ratings yet
Psychology Solved
14 pages
Biology Project On Ultraviolet Rays
No ratings yet
Biology Project On Ultraviolet Rays
14 pages
Forensic Chemistry and Toxicology
No ratings yet
Forensic Chemistry and Toxicology
3 pages
Chlorella Vulgaris Thesis
100% (2)
Chlorella Vulgaris Thesis
8 pages
CHAPTER 1.3 Morphology - Prelims
No ratings yet
CHAPTER 1.3 Morphology - Prelims
20 pages
Oxford Handbook of Human Symbolic Evolution , 1st Edition Unrestricted Download
100% (11)
Oxford Handbook of Human Symbolic Evolution , 1st Edition Unrestricted Download
14 pages
DNA Replication Notes
100% (1)
DNA Replication Notes
2 pages
UNIT 4 Notes
No ratings yet
UNIT 4 Notes
20 pages
8zoJnUds61vI 53B
No ratings yet
8zoJnUds61vI 53B
15 pages

Slides 1

Uploaded by

Slides 1

Uploaded by

Dr Colin Bingle Me and my bioinformatics hat!

I am interested in components of the pulmonary innate

• Why are certain genes expressed in certain cells?

• Why are certain genes expressed in certain cells?

• Cells differentiate during development and can alter

Such genes have been identified through a variety of

SAGE and array studies

De novo gene predictions followed by cloning

Such genes have been identified through a variety of

SAGE and array studies

De novo gene predictions followed by cloning

There is much to learn and many genes to discover!

However it can be used as the basis for subsequent

These unbiased approaches all identify genes based on

Serial Analysis of Gene Expression, or SAGE, is

Transcript cDNA Sequence

Total length: 564 bp No. Exons: 4

BLASTN 2.2.1 [Apr-13-2001]

Query= gi|32050|emb|X63187.1|HSHE4MR H.sapiens

Database: GenBank Human EST entries

Multiple alignment reveals a high degree of sequence similarity

Each of these will likely have a distinct function!

The HE4 gene probably has at least 3 promoter regions

127 124 >331 129 136 >128 290 153 162

These rely on;

We also require information on the relative levels of expression of the

• There are still more to

You might also like