Lecture 5- DataBase

Uploaded by

aletimanaswini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Lecture 5- DataBase

Uploaded by

aletimanaswini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

ISC 211 Introduction

to Bioinformatics
Lecture 5 – Bioinformatics DataBase
Dr. Athira B
Asst. Professor, CSE
IIIT Kottayam
Motivation
• Key concept in Molecular Biology is the information flow
DNA →RNA→ Protein
• From a data point of view: we have multiple omic data:
Genomics → Trancriptomics → Proteomic → Metabolomisc
• This vast amount of data needs to be stored and organized for easy
access around the globe
Motivation-Human Genome Project
• A landmark global scientific effort whose signature goal was to
generate the first sequence of the human genome (almost all genes in
human)
• Identified 1,00,000 genes in DNA
• more than 3 Billion base pairs were extracted
• The goals were:
• Alert patients that are at risk of certain diseases
• Reliably predict course of disease
• Precise diagnose and treatment
• Developing new treatments at molecular level
• Milestone in Biomedical Research
• https://www.genome.gov/about-genomics/educational-resources/
fact-sheets/human-genome-project.
Motivation-Biological Big Data
• Advancement in sequencing techniques generated good amount of
Biological data
• Similar to human, genetic data of other model organisms are also
generated:
• Yeast (Saccharomyces cerevisiae)
• Fruit fly (Drosophila melanogaster)
• Nematode worm (Caenorhabditis elegans)
• Western clawed frog (Xenopus tropicalis)
• Mouse (Mus musculus)
• Zebrafish (Danio rerio)
• How to store these data so that researchers can easily retrieve data
efficiently
Databases
• Database stores and organizes related data for easy retrieval
Eg: Your Phone contact book
• Most common form of Database is relational database (SQL)
• There are many other databases- column databases, graph databases,
etc
• Biological databases stores biological data and associated knowledge
• These knowledge bases are fundamentals to the survival of science
Biological Databases
• Store and handle the staggering volume of Biological information
through the establishment and use of computer databases
• Current biological databases use all three types of database
structures: flat files, relational, and object oriented
• Based on their contents, biological databases can be roughly divided
into three categories: primary databases, secondary databases, and
specialized databases.
Primary Databases
• Contain original biological data. They are archives of raw sequence or
structural data submitted by the scientific community
• GenBank, the European Molecular Biology Laboratory (EMBL)
database, Protein Data Bank (PDB) and the DNA Data Bank of
Japan (DDBJ)
Secondary Databases
• Secondary databases contain computationally processed or manually
curated information, based on original information from primary
databases.
• Translated protein sequence databases containing functional
annotation belong to this category
SWISS-PROT
Specialized Databases
• Specialized databases normally serve a specific research community
or focus on a particular organism
• The content of these databases may be sequences or other types of
information
• Examples include Flybase, WormBase, AceDB, Microarray gene
expression database, and TAIR
Composite Databases
• Variety of primary databases combined
• One place for different primary databases
Information Retrieval from Biological
Databases
• The most popular retrieval systems for biological databases are
Entrez and Sequence Retrieval Systems (SRS)
• Join a series of keywords using logical terms such as AND, OR, and
NOT to indicate relationships between the keywords used in a search
• Entrez3, a biological database retrieval system by NCBI
• For a complex search, a user can use the Boolean operators
• Online Mendelian Inheritance in Man (OMIM) accessible from Entrez,
which is a non-sequence-based database of human disease genes and
human genetic disorders
GenBank
• GenBank is the most complete collection of annotated nucleic acid
sequence data for almost every organism.
• The content includes genomic DNA, mRNA, cDNA, ESTs, high
throughput raw sequence data, and sequence polymorphisms
• There is also a GenPept database for protein sequences
GenBank: Sequence Format
Header
• origin of the sequence, identification of organism, unique identifiers
• Locus: unique database identifier
• Sequence length and molecule type(DNA or RNA)
• Three-letter code eg: PLN for plant, BCT for bacteria…
• Definition : name of the sequence, name and source of organism,
whether sequence is partial or complete
• Accession number : number cited in publications
• Version number : to identify the current version, if the sequence is
revised at a later stage
• Organism: source of organism with the scientific name of the species
• Reference : author and title information, contact information
Gene information
• Features : annotation information
• Source: length of sequence, scientific name of organism
• Gene : nucleotide coding sequence and its name
• CDS : information about boundaries of the sequence that can be
translated into amino acids. For eukaryotic, locaton of exons also
mentioned
DNA SEQUENCE
• ORIGIN: sequence itself; ends with two forward slashes (“//”)

• In retrieving the DNA sequence, search can be limited to “organism”,

“accession number”, “author”, “publication date”.
Fasta: Sequence Format
Reading Assignment
• Read more on Biological Databases:
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4411498/
[ZMYZ15]
• Practice: explore various databases
• Assessment
• Bring your laptops
• Explore Entrez: https://www.ncbi.nlm.nih.gov/search/
• Explore NCBI databases
• Read Chapter 2, Essential Bioinformatics by Jin Xiong[Xio06]

3 Physiology Notes PDF
100% (3)
3 Physiology Notes PDF
119 pages
Biological Databases Lec 2,3
No ratings yet
Biological Databases Lec 2,3
49 pages
Sec1 Introduction to Bioinformatics
No ratings yet
Sec1 Introduction to Bioinformatics
20 pages
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
No ratings yet
Bioinformatics and Omics Topic: Database and Biological Database With Examples Assignment-3
5 pages
Bioinformatics Lecture Notes Database
No ratings yet
Bioinformatics Lecture Notes Database
28 pages
BCH 505 Bioinformatics 3(2 2) Databases
No ratings yet
BCH 505 Bioinformatics 3(2 2) Databases
17 pages
Module 2 (Bioinformatics)
No ratings yet
Module 2 (Bioinformatics)
81 pages
Bio PPT
No ratings yet
Bio PPT
35 pages
Introduction To Bioinformatics (Databases)
No ratings yet
Introduction To Bioinformatics (Databases)
28 pages
Biological Databases: - Bio-Informatics
No ratings yet
Biological Databases: - Bio-Informatics
16 pages
Bioinformatics Biological Database
No ratings yet
Bioinformatics Biological Database
31 pages
Capture D'écran . 2023-03-14 À 00.15.22
No ratings yet
Capture D'écran . 2023-03-14 À 00.15.22
54 pages
BCH 516-1
No ratings yet
BCH 516-1
32 pages
Bioinformatics PPT Section B Data Storage and Retrival Group 3
No ratings yet
Bioinformatics PPT Section B Data Storage and Retrival Group 3
36 pages
Bio in For Ma Tics
No ratings yet
Bio in For Ma Tics
52 pages
UNIT II
No ratings yet
UNIT II
23 pages
Bioinformatics Database and Applications
100% (3)
Bioinformatics Database and Applications
82 pages
M Lec 01 & 02 Biological Database
No ratings yet
M Lec 01 & 02 Biological Database
50 pages
2024.HF_BioInformatics_Lec3p
No ratings yet
2024.HF_BioInformatics_Lec3p
11 pages
Generating Structural Data Analysis
No ratings yet
Generating Structural Data Analysis
8 pages
Day 1
No ratings yet
Day 1
38 pages
Database
No ratings yet
Database
40 pages
Tics - A Brief Introduction
No ratings yet
Tics - A Brief Introduction
4 pages
Databases in Bioinformatics - An Introduction
No ratings yet
Databases in Bioinformatics - An Introduction
11 pages
Lec2 Databases
No ratings yet
Lec2 Databases
135 pages
Basics of Bioinformatics in Biological Research
No ratings yet
Basics of Bioinformatics in Biological Research
5 pages
Online Biological Databases: A/Prof. Ly Le
No ratings yet
Online Biological Databases: A/Prof. Ly Le
64 pages
9. Biological Databases
No ratings yet
9. Biological Databases
17 pages
Essential Info Notes-1
No ratings yet
Essential Info Notes-1
57 pages
Database
No ratings yet
Database
16 pages
CH12
No ratings yet
CH12
8 pages
Bioinfo U2 KD 2
No ratings yet
Bioinfo U2 KD 2
3 pages
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
No ratings yet
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
75 pages
#1 L1 BioDatabases
No ratings yet
#1 L1 BioDatabases
89 pages
Bio in For Matics
No ratings yet
Bio in For Matics
26 pages
"MBG1002 Biological Databases Week II
No ratings yet
"MBG1002 Biological Databases Week II
37 pages
Biological Database 1
No ratings yet
Biological Database 1
50 pages
Biological Data and Database Biological Data
No ratings yet
Biological Data and Database Biological Data
10 pages
4Bioinformaticsdatabases
No ratings yet
4Bioinformaticsdatabases
71 pages
Bioinformatics Lab Notebook: Comsats University, Islamabad
No ratings yet
Bioinformatics Lab Notebook: Comsats University, Islamabad
27 pages
Biol BDs Singapore
No ratings yet
Biol BDs Singapore
24 pages
BIOINFORMATICS - eNOTES
No ratings yet
BIOINFORMATICS - eNOTES
23 pages
Bif501 Handouts PDF Bif
No ratings yet
Bif501 Handouts PDF Bif
197 pages
8024 Bio Info
No ratings yet
8024 Bio Info
28 pages
Bioinformatics Overview
100% (1)
Bioinformatics Overview
18 pages
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
No ratings yet
FALLSEM2019-20 BIT2001 ETH VL2019201000690 Reference Material I 11-Jul-2019 Unit I New
48 pages
Basics of Bioinformatics in Biological Research
No ratings yet
Basics of Bioinformatics in Biological Research
5 pages
Biological Databases: DR Z Chikwambi Biotechnology
No ratings yet
Biological Databases: DR Z Chikwambi Biotechnology
47 pages
Lab 1
No ratings yet
Lab 1
39 pages
المحاضرة 2
No ratings yet
المحاضرة 2
16 pages
1. Databases
No ratings yet
1. Databases
34 pages
Bioinformatics
No ratings yet
Bioinformatics
47 pages
A Review Article On Bioinformatics Tools and Software
No ratings yet
A Review Article On Bioinformatics Tools and Software
14 pages
Biological Databases (1)
No ratings yet
Biological Databases (1)
41 pages
Data Base in Bioinformatics
No ratings yet
Data Base in Bioinformatics
30 pages
CMSC 838T - Lecture 9: Bioinformatics Databases
No ratings yet
CMSC 838T - Lecture 9: Bioinformatics Databases
65 pages
Nucleic_Acid_Databases
No ratings yet
Nucleic_Acid_Databases
37 pages
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
No ratings yet
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
105 pages
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
Bioinformatics Unveiled
From Everand
Bioinformatics Unveiled
Joan Melody
No ratings yet
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
ELS Benefits of Genetically Modified Organisms
No ratings yet
ELS Benefits of Genetically Modified Organisms
17 pages
237 Full
No ratings yet
237 Full
5 pages
BIOL2120 3 Transport Across Membranes
No ratings yet
BIOL2120 3 Transport Across Membranes
60 pages
Lecture 13 Recombination and Transposition From Internet
No ratings yet
Lecture 13 Recombination and Transposition From Internet
112 pages
Bio Final
No ratings yet
Bio Final
8 pages
Occupancy Theory
No ratings yet
Occupancy Theory
9 pages
MCQs About Cell Biology
No ratings yet
MCQs About Cell Biology
3 pages
Biology For The IB Diploma Chapter 3 Summary
No ratings yet
Biology For The IB Diploma Chapter 3 Summary
7 pages
Lesson 4-7: Molecular Structure of DNA, RNA, and Proteins: Rhoda S.R. Cayanan, RPH, LPT
100% (2)
Lesson 4-7: Molecular Structure of DNA, RNA, and Proteins: Rhoda S.R. Cayanan, RPH, LPT
75 pages
Plants 09 01355
No ratings yet
Plants 09 01355
47 pages
Science10 - Q3 - Mod4 - Protein Synthesis - Ver3
50% (2)
Science10 - Q3 - Mod4 - Protein Synthesis - Ver3
28 pages
Notes - Module 1 Cells As The Basis of Life Sibel
No ratings yet
Notes - Module 1 Cells As The Basis of Life Sibel
10 pages
Mutation SHO
No ratings yet
Mutation SHO
6 pages
GRE Syllabus
No ratings yet
GRE Syllabus
18 pages
Phylogenetics PDF
No ratings yet
Phylogenetics PDF
21 pages
GEN-BIO-2-Q3-REVIEWER
No ratings yet
GEN-BIO-2-Q3-REVIEWER
11 pages
General Biology 1
No ratings yet
General Biology 1
32 pages
Regulation of Cell Cycle
No ratings yet
Regulation of Cell Cycle
5 pages
Full download Food Authenticity and Traceability 1st Edition Michele Lees pdf docx
100% (12)
Full download Food Authenticity and Traceability 1st Edition Michele Lees pdf docx
60 pages
1. Introduction to Pharmaceutical Biotechnology
No ratings yet
1. Introduction to Pharmaceutical Biotechnology
16 pages
Immunology Assignment 1
100% (1)
Immunology Assignment 1
3 pages
Protein Library
No ratings yet
Protein Library
11 pages
GenElute™ HP High Performance Plasmid Midiprep Kit (NA0200) - Bulletin
100% (1)
GenElute™ HP High Performance Plasmid Midiprep Kit (NA0200) - Bulletin
7 pages
Genetics and Nutrition: Review
No ratings yet
Genetics and Nutrition: Review
7 pages
2020 Real-Time PCR Assay For Detecting Lactobacillus Plantarum Group Using Species Subspecies Specific Genes Identified by Comparative Genomics
No ratings yet
2020 Real-Time PCR Assay For Detecting Lactobacillus Plantarum Group Using Species Subspecies Specific Genes Identified by Comparative Genomics
37 pages
m
No ratings yet
m
7 pages
Paper 4
No ratings yet
Paper 4
9 pages
4. Molecular Cloning Methods
No ratings yet
4. Molecular Cloning Methods
66 pages
Cells: Structure and Function
No ratings yet
Cells: Structure and Function
31 pages

Lecture 5- DataBase

Uploaded by

Lecture 5- DataBase

Uploaded by

ISC 211 Introduction

• In retrieving the DNA sequence, search can be limited to “organism”,

You might also like