0% found this document useful (0 votes)

9 views

biologicaldatabase-190402034501

The document provides an overview of biological databases, which are collections of machine-readable records of biological data that can be accessed and modified. It classifies databases into primary, secondary, and composite types, detailing examples such as GenBank, DDBJ, and EMBL for nucleotide sequences, as well as Swiss-Prot and UniProt for protein sequences. Additionally, it discusses the significance of these databases in managing and sharing biological information for research purposes.

Uploaded by

benjaminkatiyo76

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

biologicaldatabase-190402034501

Uploaded by

benjaminkatiyo76

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 26

BIOLOGICAL DATABASE

Dr. Nusaifa Beevi.P

Associate Professor & HOD,
PG Department of Botany,
Iqbal College, Peringammala
INTRODUCTION
 BIOLOGICAL DATABASES
 Collection of files containing records of
biological data in machine readable form
 Can be accessed, added, retrieved,
manipulated and modified
 Store, manage, connect and distribute
data
 Data are arranged by sets of rules which
are programmed into software that
manages the data called Database
Management System or DBMS.
Classification based on type of
data stored
 Primary Databases: Contain original data in
the form of primary sequence data or structural
data as submitted by the scientific community.
 Secondary Databases: Contain information
that has been processed and derived from the
raw data available in primary database.eg:
PROSITE, PRINTS, BLOCKS etc..
 Composite Databases: Collect and present
data after comparing and filtering them from
different primary databases and exhibit only
the non-redundant sequences
PRIMARY DATABASES
 Nucleic acid databases: Gen Bank,
EMBL,DDBJ

 Protein sequence databases: PIR, Swiss-

Prot, UNIPROT

 Protein structure database: PDB

 Metabolic databases: KEGG

Nucleotide sequence
database
 Composed of a group of nucleotide sequence
entries.
 Data repositories that accept nucleic acid
sequence data and make it freely available to
the public.
 GenBank, EMBL,DDBJ are principal
nucleotide databases.
 All the three are members of the
International Nucleotide Sequence
Database Consortium (INSDC) and
interchange data.
Gen Bank of NCBI
 Hosted by National Centre for Biotechnology
Information (NCBI), situated at the campus of US National
Institute of Health, USA.
 Gen Bank offers all publicly available nucleotide sequences,
their protein translation, and their annotated information.
 It also facilitate direct submission of sequence data by a
user friendly process.
 Researchers from anywhere can submit their data to Gen
Bank.
 An accession number is given to the submitted sequence
and then released to the public database after the quality
assurance check.
 This information can be retrieved using the Entrez
retrieval system.
 We can access the data in NCBI over the internet through
their site, http://www.ncbi.nlm.nih.gov/genbank
Home page of GenBank
DNA DATABANK OF JAPAN (DDBJ)
 Started in 1986, hosted now at National Institute of
Genetics, Japan.
 Gather data mainly from scientists in Japan and from
researchers all over the world.
 This can also share nucleotide sequence data with Gen
Bank and EMBL.
 About 99% of the nucleotide data in INSDC submitted by
Japanese researchers through DDBJ, and enhances the
quality of INSDC.
 It includes details of sequences, submitters details,
biological significance , and the scientific name and
taxonomy of the organism. In addition, features that
identify coding region, transcription units, mutation sites
etc. are also displayed in a feature table.
DDBJ Contd…..
 Major activities of the DDBJ include, providing
internationally recognized accession
numbers to sequences, bioinformatics
database management, developing tools for
the analysis and visualization of biological
data, and also conducting courses for
beginners to reduce the complexity in the
biological data analysis.
 DDBJ can be accessed through homepage,

http://www.ddbj.nig.ac.jp/.
DDBJ homepage
EMBL database
 European Molecular Biology Laboratory Nucleotide
Sequence Database, first established in 1974.
 Hosted at UK by the EMBL European Bioinformatics
Institute.
 EMBL is a non-profit research institution supported by 20
European countries and Australia, for Molecular Biology
Research.
 EMBL collects nucleotide sequence data from individual
researchers, genome sequence projects and patent
applications.
 Sequences are stored in this database as they would exist
in the biological state.
 The stored data correspond to wild type sequences
without mutation or genetic manipulation.
 Accessed through the URL, http://www.ebi.ac.uk/embl
EMBL homepage
PROTEIN SEQUENCE
DATABASES

 An array of amino acid sequence entries

arranged according to the identification
number.
 Well known protein sequence databases

available on www are

◦ Swiss-Prot
◦ PIR
◦ UNIPROT
Swiss-Prot
 Developed by the Swiss Institute of Bioinformatics (SIB)
and European Bioinformatics Institute(EBI).
 High quality, manually annotated protein sequence
database created in 1986.
 It provides high level annotations with functions of protein
and post transcriptional modifications.
 It provide all known relevant information about a
particular protein.
 Consists of two sections:- UniProt KB/Swiss-Prot, which is
manually annotated and is reviewed, and Uni
ProtKB/TrEMBL, which is automatically annotated and not
reviewed.
 Available at http://www.expasy.ch/sprot
Swiss-Prot homepage
PIR
 Protein Information Resource database
 Established in 1984, by National Biomedical Research
Foundation (NBRF).
 It is an integrated public bioinformatics resource that
support genomic and proteomic research, and scientific
studies.
 It assists researchers in the identification and
interpretation of protein sequence information.
 PIR can be searched for entries or sequence similarity
searches.
 Can be downloaded at
http://www.pir.georgetown.edu/.
 PIR offers a variety of resources mainly oriented to
assist the propagation and standardization of
protein annotation.
PIR homepage
UNIPROT
 It provide a comprehensive, high quality and
freely accessible resource of protein
sequence.
 Entries are derived from genome
sequencing projects.
 The Uniprot consortium comprises the
European Bioinformatics Institute(EBI),the
Swiss Institute of Bioinformatics(SIB), And
the Protein Information Resourse(PIR).
 Uniprot is composed of four components,

each optimized for different uses.

COMPONENTS OF UNIPROT
 1. UniProt Knowledge Base (UniProtKB)-
For extensive curated protein information
with two sections-UniProt KB/Swiss-
Prot, which is manually annotated and
is reviewed, and Uni ProtKB/TrEMBL,
which is automatically annotated and not
reviewed.
 2. UniProt Reference Clusters (UniRef)
 3. UniProt Archive (UniParc)
 4. UniProt Metagenomic and

Environmental Sequences (UniMes)

UNIPROT homepage
PROTEIN STRUCTURE DATABASE
 Many proteins which exhibit a common
evolutionary origin, show structural
similarities.
 Dissimilar proteins exhibit changes in

primary, secondary, teritiary and

quarternary structures.
 Similar or dissimilar protein structure

can be predicted with structure

database.
 These databases store a collection of

three dimensional structures of

PROTEIN DATA BANK (PDB)
 Understanding the shape of a molecule helps to
understand how it works.
 PDB is the main primary database used for the
prediction of 3D Structures of proteins and nucleic
acids.
 The single world wide archive of structural data.
 Maintained by the Research Collaboratory for
structural bioinformatics (RCSB)
 The data obtained from X-ray chrystallography and
NMR-spectroscopy, are submitted to the PDB.
 Then, these structures are annotated as per the
depositors specifications.
 Freely available and accessed through URL
http://www.pdb.org/
PDB homepage
MODEL ORGANISM
DATABASE
 MODs are also called Organism – specific

databases. They describe genome and other

information about well studied experimental
organisms in life sciences.
 They store large volumes of data and allow users
to analyse results and interpret datasets and data
they generated. ( organism of their own interest).
 Examples:
 Fly Base- database of Drosophilla melanogaster
 SGD- Sacharomyces Genome Database
 AGR- Arabidopsis Genome Resource
 HGP- Human Genome Project
 RGD- Rat Genome Database etc…
BIODIVERSITY DATABASE
 Provide information on the biodiversity of a particular area or
group of living organisms.
 They may store genus level information, species level
information, information on nomenclature or any
combination of the three.
 Species 2000
◦ Established in September 1994, by the International Union of
Biological Sciences(IUBS), in co-operation with the committee on
Data for science and technology(CODATA) and the International
Union of Microbiological Sciences(IUMS).
◦ It is a Federation of database organizations working closely with
users, taxonomists and sponsoring agencies.
◦ It plans to create an array of participant global species databases
covering each of the major groups of organisms(plants, animals,
fungi and microbes)
◦ The goal of species 2000 is to provide a uniform and validated
quality index of names of all known species for use as a practical
tool.
Thank You

Kami Export - CH 11 Reading Guide PDF
100% (3)
Kami Export - CH 11 Reading Guide PDF
14 pages
Biological Databases Lec 2,3
No ratings yet
Biological Databases Lec 2,3
49 pages
Biomolecules One Shot Bounceback
100% (1)
Biomolecules One Shot Bounceback
144 pages
Biotechnology and Genetic Engineering Summary PPT Igcse Caie
No ratings yet
Biotechnology and Genetic Engineering Summary PPT Igcse Caie
16 pages
Databases - Final
No ratings yet
Databases - Final
50 pages
Biological Data and Database
No ratings yet
Biological Data and Database
13 pages
6.1 Bioinformatics Databases and Tools - Introduction: Lecture 6: December, 28, 2001
No ratings yet
6.1 Bioinformatics Databases and Tools - Introduction: Lecture 6: December, 28, 2001
31 pages
Biological Data Bases
No ratings yet
Biological Data Bases
36 pages
Bioinformatics Biological Database
No ratings yet
Bioinformatics Biological Database
31 pages
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
No ratings yet
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
75 pages
Module 2 Biodata
No ratings yet
Module 2 Biodata
36 pages
Bioinformatics
No ratings yet
Bioinformatics
47 pages
Databases Bioinformatics
No ratings yet
Databases Bioinformatics
42 pages
Introduction To Bioinformatics (Databases)
No ratings yet
Introduction To Bioinformatics (Databases)
28 pages
Unit II Bioinformatics
No ratings yet
Unit II Bioinformatics
25 pages
Databases 2025
No ratings yet
Databases 2025
50 pages
Bioinformatics Databases
No ratings yet
Bioinformatics Databases
10 pages
Bioinformatics Lecture Notes Database
No ratings yet
Bioinformatics Lecture Notes Database
28 pages
Database
No ratings yet
Database
40 pages
Biological Databases Genbank
No ratings yet
Biological Databases Genbank
31 pages
Presentation 11
No ratings yet
Presentation 11
20 pages
BIOINFORMATICS - eNOTES
No ratings yet
BIOINFORMATICS - eNOTES
23 pages
Biological Information on Artificial Intelligence
No ratings yet
Biological Information on Artificial Intelligence
20 pages
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
No ratings yet
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
105 pages
BCH 505 Bioinformatics 3(2 2) Databases
No ratings yet
BCH 505 Bioinformatics 3(2 2) Databases
17 pages
Anjali-1
No ratings yet
Anjali-1
16 pages
Sequence and Structure Retrieval
No ratings yet
Sequence and Structure Retrieval
9 pages
CH12
No ratings yet
CH12
8 pages
Unit II Major Databases in Bioinformatics
No ratings yet
Unit II Major Databases in Bioinformatics
54 pages
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
No ratings yet
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
48 pages
Lecture 4 Nucleic Acid Sequence Database
No ratings yet
Lecture 4 Nucleic Acid Sequence Database
21 pages
Bio PPT
No ratings yet
Bio PPT
35 pages
Nucleic_Acid_Databases
No ratings yet
Nucleic_Acid_Databases
37 pages
Lecture 2 Introduction To The Computational Tools
No ratings yet
Lecture 2 Introduction To The Computational Tools
15 pages
Day 1
No ratings yet
Day 1
38 pages
Tics - A Brief Introduction
No ratings yet
Tics - A Brief Introduction
4 pages
Lecture 3-Uniprot-Biological Information Repository.
No ratings yet
Lecture 3-Uniprot-Biological Information Repository.
15 pages
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
No ratings yet
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
5 pages
Lec2 Databases
No ratings yet
Lec2 Databases
135 pages
Biological Databases: - Bio-Informatics
No ratings yet
Biological Databases: - Bio-Informatics
16 pages
Unit-2 (2)
No ratings yet
Unit-2 (2)
36 pages
Adv Bi Unit 1
No ratings yet
Adv Bi Unit 1
39 pages
Protein Database
No ratings yet
Protein Database
3 pages
M Lec 01 & 02 Biological Database
No ratings yet
M Lec 01 & 02 Biological Database
50 pages
Biological Data and Database Biological Data
No ratings yet
Biological Data and Database Biological Data
10 pages
Bioinformatics. CH 3 Databases (Summarized Notes)
50% (2)
Bioinformatics. CH 3 Databases (Summarized Notes)
5 pages
Bioinformatics Day2
No ratings yet
Bioinformatics Day2
3 pages
Lecture Topic: Protein Databases: Topics Covered
No ratings yet
Lecture Topic: Protein Databases: Topics Covered
67 pages
Introduction To Databases - NCBI, PDB and Uniprot
No ratings yet
Introduction To Databases - NCBI, PDB and Uniprot
5 pages
Biological Database
No ratings yet
Biological Database
8 pages
Data Base in Bioinformatics
No ratings yet
Data Base in Bioinformatics
30 pages
Bioinfo Lecture 2
No ratings yet
Bioinfo Lecture 2
29 pages
CMSC 838T - Lecture 9: Bioinformatics Databases
No ratings yet
CMSC 838T - Lecture 9: Bioinformatics Databases
65 pages
Lecture 5- DataBase
No ratings yet
Lecture 5- DataBase
18 pages
Basics of Bioinformatics in Biological Research
No ratings yet
Basics of Bioinformatics in Biological Research
5 pages
Rese Rach
No ratings yet
Rese Rach
37 pages
module 4 merged
No ratings yet
module 4 merged
283 pages
9. Biological Databases
No ratings yet
9. Biological Databases
17 pages
Biological Databases PDF
No ratings yet
Biological Databases PDF
13 pages
Biological Databases
No ratings yet
Biological Databases
13 pages
UNIT II
No ratings yet
UNIT II
23 pages
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
WhatsApp Chat With Mahima~
No ratings yet
WhatsApp Chat With Mahima~
3 pages
Forest Biodiversity Conservation Book Chapter
No ratings yet
Forest Biodiversity Conservation Book Chapter
26 pages
Tomorrow PDF
No ratings yet
Tomorrow PDF
7 pages
bacterialgrowthcurve-200719160626
No ratings yet
bacterialgrowthcurve-200719160626
14 pages
blast-170122070200
No ratings yet
blast-170122070200
22 pages
Craig's Area
No ratings yet
Craig's Area
2 pages
Vricella2017
No ratings yet
Vricella2017
20 pages
Design and Fabrication of Major Components of Turbojet Engine
No ratings yet
Design and Fabrication of Major Components of Turbojet Engine
7 pages
Biology - Enzymes Grade 11
No ratings yet
Biology - Enzymes Grade 11
20 pages
Micro (Volume 1)
No ratings yet
Micro (Volume 1)
67 pages
Resporation Worksheet
No ratings yet
Resporation Worksheet
6 pages
1098D Pharm 313 Mabbayad-John-Nikko PDF
No ratings yet
1098D Pharm 313 Mabbayad-John-Nikko PDF
2 pages
Lecture 13 Genes and How They Work
No ratings yet
Lecture 13 Genes and How They Work
31 pages
Regulation of Erythropoiesis
No ratings yet
Regulation of Erythropoiesis
3 pages
Nondestructive DNA Extraction From Sperm Whale Teeth and Scrimshaw
No ratings yet
Nondestructive DNA Extraction From Sperm Whale Teeth and Scrimshaw
4 pages
Probiotics Mechanism of Action
No ratings yet
Probiotics Mechanism of Action
7 pages
WBCS ANTHROPOLOGY Syllabus
No ratings yet
WBCS ANTHROPOLOGY Syllabus
8 pages
Characterization of Intact Protein and Acid Hydrolyzate From Casein Using Color Reaction Tests
No ratings yet
Characterization of Intact Protein and Acid Hydrolyzate From Casein Using Color Reaction Tests
7 pages
As 104 Lecture Notes Protein Degradation
No ratings yet
As 104 Lecture Notes Protein Degradation
32 pages
Biotechnology and Its Applications
No ratings yet
Biotechnology and Its Applications
1 page
Protein Lab Report
No ratings yet
Protein Lab Report
9 pages
CHLOROPLAST
No ratings yet
CHLOROPLAST
4 pages
B.sc. (H) Zoology 4th Semester 2017
No ratings yet
B.sc. (H) Zoology 4th Semester 2017
25 pages
Protein Metabolism Overview, Animation
No ratings yet
Protein Metabolism Overview, Animation
21 pages
2017 Book SH2Domains
No ratings yet
2017 Book SH2Domains
546 pages
Statement of purpose Institute of molecular medicine
No ratings yet
Statement of purpose Institute of molecular medicine
2 pages
DNA Replication
No ratings yet
DNA Replication
3 pages
Naturally Occurring Allele Diversity Allows Potato Cultivation in Northern Latitudes
No ratings yet
Naturally Occurring Allele Diversity Allows Potato Cultivation in Northern Latitudes
7 pages
TB_4647feedback_668e35918d1394.668e3593154317.03598962
No ratings yet
TB_4647feedback_668e35918d1394.668e3593154317.03598962
6 pages
AP Biology Practice Test 15 Interactions Lead To Complex Properties
No ratings yet
AP Biology Practice Test 15 Interactions Lead To Complex Properties
6 pages
Guide To Electropherogram v3
No ratings yet
Guide To Electropherogram v3
12 pages
Lesson Plan Activity Sheet
No ratings yet
Lesson Plan Activity Sheet
3 pages
Practice Biology Mid-Term Test
No ratings yet
Practice Biology Mid-Term Test
6 pages
CH 11 Biotechnology Principle & Process
No ratings yet
CH 11 Biotechnology Principle & Process
12 pages
2.4 Protein - Haemoglobin and Collagen
No ratings yet
2.4 Protein - Haemoglobin and Collagen
36 pages

biologicaldatabase-190402034501

Uploaded by

biologicaldatabase-190402034501

Uploaded by

BIOLOGICAL DATABASE

Dr. Nusaifa Beevi.P

 Protein sequence databases: PIR, Swiss-

 Protein structure database: PDB

 Metabolic databases: KEGG

 An array of amino acid sequence entries

available on www are

each optimized for different uses.

Environmental Sequences (UniMes)

primary, secondary, teritiary and

can be predicted with structure

three dimensional structures of

databases. They describe genome and other

You might also like