0% found this document useful (0 votes)

30 views

Ref Seq

The RefSeq database is a curated collection of nucleotide sequences and protein sequences maintained by the National Center for Biotechnology Information. It provides a single record for each biological molecule for major organisms ranging from viruses to bacteria to eukaryotes. RefSeq aims to provide separate and linked records for genomic DNA, gene transcripts, and protein products. It currently represents over 121,000 named organisms.

Uploaded by

william919

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views

Ref Seq

Uploaded by

william919

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

RefSeq

The Reference Sequence (RefSeq) database[1]

Refseq
is an open access, annotated and curated
collection of publicly available nucleotide
sequences (DNA, RNA) and their protein
products. RefSeq was first introduced in
2000.[2][3] This database is built by National
Center for Biotechnology Information (NCBI), Content
and, unlike GenBank, provides only a single Description curated non-redundant sequence
record for each natural biological molecule (i.e.
database of genomes.
DNA, RNA or protein) for major organisms
ranging from viruses to bacteria to eukaryotes. Contact
Research center National Center for Biotechnology
For each model organism, RefSeq aims to
Information
provide separate and linked records for the
[1]
genomic DNA, the gene transcripts, and the Primary citation Pruitt KD & al. (2005)
proteins arising from those transcripts. RefSeq Access
is limited to major organisms for which
Website https://www.ncbi.nlm.nih.gov/RefSeq
sufficient data are available (121,461 distinct
"named" organisms as of July 2022),[4] while
GenBank includes sequences for any organism submitted (approximately 504,000 formally described
species).[5]

RefSeq categories
RefSeq collection comprises different data types, with different origins, so it is necessary to establish
standard categories and identifiers to store each data type. The most important categories are:

RefSeq accession categories and molecule types

Category Description

NC Complete genomic molecules

NG Incomplete genomic region

NM mRNA

NR ncRNA
NP Protein

XM predicted mRNA model

XR predicted ncRNA model

XP predicted Protein model (eukaryotic sequences)

WP predicted Protein model (prokaryotic sequences)

For more details and more categories, see Table 1 (https://www.ncbi.nlm.nih.gov/books/NBK21091/table/c
h18.T.entrez_queries_to_retrieve_sets_o) in Chapter 18 of the book The Reference Sequence (RefSeq)
Database (https://www.ncbi.nlm.nih.gov/books/NBK21091).

RefSeq Projects
Several projects to improve RefSeq services are currently in development by the NCBI, often in
collaboration with research centers such as EMBL-EBI:

Consensus CDS (CCDS): This project aims to identify a core set of human and mouse
protein-coding regions and standardize sets of genes with high and consistent levels of
genomic annotation quality. This project was announced in 2009 and is still in
development.[6][7]
RefSeq Functional Elements (RefSeqFE): It is focused on describing non-genic functional
elements which are gene regulatory regions such as: enhancers, silencers, DNase I
hypersensitive regions, DNA replication origins etc.). The current scope of this project is
restricted to the human and mouse genomes.[8]
RefSeqGene: Its main goal is to define genomic sequences to be used as reference
standards for well-characterized genes. Previously described mRNA, protein and
chromosome sequences have the weaknesses of not providing explicit genomic coordinates
of gene flanking and intronic regions as well as showing awkwardly large coordinates that
change with every new genome assembly. The RefSeqGene project is designed to
eliminate these errors.[9]
Targeted Loci: This project records molecular markers, specially protein-coding and
ribosomal RNA loci that are used for phylogenetic and barcoding analysis. The scope of this
project includes sequences for Archaea, Bacteria and Fungi organisms, accessible via
Entrez and BLAST queries. It also includes GenBank sequences for Animals, Plants and
Protists, accessible via BLAST queries.[10]
Virus Variation (ViV): It is a specific resource of sequence data processing pipelines and
analysis tools for display and retrieval of sequences from several viral groups such as
influenza virus, ebolavirus, MERS coronavirus or Zika virus. New viruses, processing
pipelines, tools and other features are included regularly.[11]
RefSeq Select: This project aims to select datasets of RefSeq Select transcripts, as the
most representative for every protein-coding gene, based on multiple criteria: prior use in
clinical databases, transcript expression, evolutionary conservation of the coding region etc.
Since many genes are represented by multiple RefSeq transcripts/proteins due to the
biological process of alternative splicing, this complexity is problematic for studies such as
comparative genomics or exchange of clinical variant data.[12]
MANE (Matched Annotation from the NCBI and EMBL-EBI): It is a collaborative project
between NCBI and EMBL-EBI whose main goal is to define a set of transcripts and their
proteins for all the protein-coding genes in the human genome. By doing that, the differences
in transcripts annotation between RefSeq and Ensembl/GENCODE annotation systems are
reduced. A MANE Select transcripts set are created as a useful universal standard for
clinical reporting and comparative or evolutionary genomics. A second MANE Plus Clinical
set are also created with additional transcripts to report all Pathogenic (P) or Likely
Pathogenic (LP) clinical variants available in public resources.[13] This project was
announced in 2018 and is expected to finish in 2022.

Statistics
According to the RefSeq release 213 (July 2022), the number of species represented in the database by
counting distinct taxonomic IDs are as follows:[4]

Taxonomic ID Species

Archaea 1443
Bacteria 69122

Complete 121461

Fungi 16869
Invertebrate 5715

Mitochondrion 13648

Plant 9177
Plasmid 6073

Plastid 9430

Protozoa 746
Vertebrate (mammalian) 1509

Viral 11620

Vertebrate (other) 5237

Other 4

The counts of accession and basepairs per molecule type are:[4]

Molecule type Accessions Basepairs/residues

Genomics 40,758,769 2.923212393984 × 1012

RNA 45,781,716 1.22253022047 × 1011

Protein 234,520,053 9.129062394 × 1010

See also
GenBank
Sequence analysis
Sequence profiling tool
Sequence motif
UniProt
List of sequenced eukaryotic genomes
List of sequenced archaeal genomes

References
1. Pruitt KD, Tatusova T, Maglott DR (January 2005). "NCBI Reference Sequence (RefSeq): a
curated non-redundant sequence database of genomes, transcripts and proteins" (https://ww
w.ncbi.nlm.nih.gov/pmc/articles/PMC539979). Nucleic Acids Research. 33 (Database
issue): D501–D504. doi:10.1093/nar/gki025 (https://doi.org/10.1093%2Fnar%2Fgki025).
PMC 539979 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC539979). PMID 15608248 (http
s://pubmed.ncbi.nlm.nih.gov/15608248).
2. Maglott DR, Katz KS, Sicotte H, Pruitt KD (January 2000). "NCBI's LocusLink and RefSeq"
(https://www.ncbi.nlm.nih.gov/pmc/articles/PMC102393). Nucleic Acids Research. 28 (1):
126–128. doi:10.1093/nar/28.1.126 (https://doi.org/10.1093%2Fnar%2F28.1.126).
PMC 102393 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC102393). PMID 10592200 (http
s://pubmed.ncbi.nlm.nih.gov/10592200).
3. Pruitt KD, Katz KS, Sicotte H, Maglott DR (January 2000). "Introducing RefSeq and
LocusLink: curated human genome resources at the NCBI". Trends in Genetics. 16 (1): 44–
47. doi:10.1016/s0168-9525(99)01882-x (https://doi.org/10.1016%2Fs0168-9525%2899%29
01882-x). PMID 10637631 (https://pubmed.ncbi.nlm.nih.gov/10637631).
4. RefSeq Release 213 Statistics (http://ftp.ncbi.nlm.nih.gov/refseq/release/release-notes/)
(Report). National Library of Medicine. 11 July 2022. Retrieved 20 July 2022.
5. Sayers EW, Cavanaugh M, Clark K, Pruitt KD, Schoch CL, Sherry ST, Karsch-Mizrachi I
(January 2022). "GenBank" (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8690257).
Nucleic Acids Research. 50 (D1): D161–D164. doi:10.1093/nar/gkab1135 (https://doi.org/10.
1093%2Fnar%2Fgkab1135). PMC 8690257 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC
8690257). PMID 34850943 (https://pubmed.ncbi.nlm.nih.gov/34850943).
6. Pruitt KD, Harrow J, Harte RA, Wallin C, Diekhans M, Maglott DR, et al. (July 2009). "The
consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set
for the human and mouse genomes" (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC270443
9). Genome Research. 19 (7): 1316–1323. doi:10.1101/gr.080531.108 (https://doi.org/10.110
1%2Fgr.080531.108). PMC 2704439 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC27044
39). PMID 19498102 (https://pubmed.ncbi.nlm.nih.gov/19498102).
7. Pujar S, O'Leary NA, Farrell CM, Loveland JE, Mudge JM, Wallin C, et al. (January 2018).
"Consensus coding sequence (CCDS) database: a standardized set of human and mouse
protein-coding regions supported by expert curation" (https://www.ncbi.nlm.nih.gov/pmc/artic
les/PMC5753299). Nucleic Acids Research. 46 (D1): D221–D228. doi:10.1093/nar/gkx1031
(https://doi.org/10.1093%2Fnar%2Fgkx1031). PMC 5753299 (https://www.ncbi.nlm.nih.gov/p
mc/articles/PMC5753299). PMID 29126148 (https://pubmed.ncbi.nlm.nih.gov/29126148).
8. Farrell CM, Goldfarb T, Rangwala SH, Astashyn A, Ermolaeva OD, Hem V, et al. (January
2022). "RefSeq Functional Elements as experimentally assayed nongenic reference
standards and functional interactions in human and mouse" (https://www.ncbi.nlm.nih.gov/p
mc/articles/PMC8744684). Genome Research. 32 (1): 175–188. doi:10.1101/gr.275819.121
(https://doi.org/10.1101%2Fgr.275819.121). PMC 8744684 (https://www.ncbi.nlm.nih.gov/pm
c/articles/PMC8744684). PMID 34876495 (https://pubmed.ncbi.nlm.nih.gov/34876495).
9. Gulley ML, Braziel RM, Halling KC, Hsi ED, Kant JA, Nikiforova MN, et al. (June 2007).
"Clinical laboratory reports in molecular pathology". Archives of Pathology & Laboratory
Medicine. 131 (6): 852–863. doi:10.5858/2007-131-852-CLRIMP (https://doi.org/10.5858%2
F2007-131-852-CLRIMP). PMID 17550311 (https://pubmed.ncbi.nlm.nih.gov/17550311).
10. "NCBI RefSeq Targeted Loci Project" (https://www.ncbi.nlm.nih.gov/refseq/targetedloci/).
www.ncbi.nlm.nih.gov. Retrieved 2022-07-27.
11. Hatcher EL, Zhdanov SA, Bao Y, Blinkova O, Nawrocki EP, Ostapchuck Y, et al. (January
2017). "Virus Variation Resource - improved response to emergent viral outbreaks" (https://w
ww.ncbi.nlm.nih.gov/pmc/articles/PMC5210549). Nucleic Acids Research. 45 (D1): D482–
D490. doi:10.1093/nar/gkw1065 (https://doi.org/10.1093%2Fnar%2Fgkw1065).
PMC 5210549 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5210549). PMID 27899678
(https://pubmed.ncbi.nlm.nih.gov/27899678).
12. "NCBI RefSeq Select" (https://www.ncbi.nlm.nih.gov/refseq/refseq_select/).
www.ncbi.nlm.nih.gov. Retrieved 2022-07-27.
13. Morales J, Pujar S, Loveland JE, Astashyn A, Bennett R, Berry A, et al. (April 2022). "A joint
NCBI and EMBL-EBI transcript set for clinical genomics and research" (https://www.ncbi.nl
m.nih.gov/pmc/articles/PMC9007741). Nature. 604 (7905): 310–315. doi:10.1038/s41586-
022-04558-8 (https://doi.org/10.1038%2Fs41586-022-04558-8). PMC 9007741 (https://www.
ncbi.nlm.nih.gov/pmc/articles/PMC9007741). PMID 35388217 (https://pubmed.ncbi.nlm.nih.
gov/35388217).

Sources
This article incorporates public domain material from NCBI Handbook (https://www.ncbi.nl
m.nih.gov/books/bv.fcgi?call=bv.View..ShowTOC&rid=handbook.TOC&depth=2). National
Center for Biotechnology Information.

External links
RefSeq (https://www.ncbi.nlm.nih.gov/RefSeq)
GenBank, RefSeq, TPA and UniProt: What's in a Name? (https://www.ncbi.nlm.nih.gov/book
s/NBK21105/#ch1.Appendix_GenBank_RefSeq_TPA_and_UniP)

Retrieved from "https://en.wikipedia.org/w/index.php?title=RefSeq&oldid=1145721600"

Hmm202 Practical 1 Worksheet
No ratings yet
Hmm202 Practical 1 Worksheet
7 pages
5.1.6. Alternative Methods For Control of Microbiological Quality
No ratings yet
5.1.6. Alternative Methods For Control of Microbiological Quality
10 pages
Chapter 6
No ratings yet
Chapter 6
22 pages
BIOINFORMATICS PRACTICAL FILE
No ratings yet
BIOINFORMATICS PRACTICAL FILE
12 pages
NCBI Handbook
No ratings yet
NCBI Handbook
391 pages
Molecular Genetics - Lab Manual - 22 May 2021
No ratings yet
Molecular Genetics - Lab Manual - 22 May 2021
36 pages
Database Dalam Bioinformatika
No ratings yet
Database Dalam Bioinformatika
34 pages
NCBI Genome
No ratings yet
NCBI Genome
37 pages
Bioinformatics Unit I
No ratings yet
Bioinformatics Unit I
6 pages
Bioinformatics Day3
No ratings yet
Bioinformatics Day3
4 pages
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
No ratings yet
Bioinformatics Tools For Nucleotide Sequence Analysis and Database Exploration
75 pages
4Bioinformaticsdatabases
No ratings yet
4Bioinformaticsdatabases
71 pages
Introduction to Bioinformatics, Sequence and Genome Analysis
From Everand
Introduction to Bioinformatics, Sequence and Genome Analysis
Jerry H. Swift
No ratings yet
Adv Bi Unit 1
No ratings yet
Adv Bi Unit 1
39 pages
GlOsario Bioinformatica
No ratings yet
GlOsario Bioinformatica
5 pages
Bioinfi U3 Part -1
No ratings yet
Bioinfi U3 Part -1
4 pages
Lec 3 Terms and Definitions in Bioinformatics
No ratings yet
Lec 3 Terms and Definitions in Bioinformatics
8 pages
National Center For Biotechnology Information
No ratings yet
National Center For Biotechnology Information
4 pages
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Genomics
No ratings yet
Genomics
8 pages
Impact of Gene Annotation On RNA-seq Data Analysis Shanrong Zhao and Baohong Zhang
No ratings yet
Impact of Gene Annotation On RNA-seq Data Analysis Shanrong Zhao and Baohong Zhang
23 pages
Data Retrieval
67% (3)
Data Retrieval
17 pages
Module 1_Session 3_Part 1
No ratings yet
Module 1_Session 3_Part 1
17 pages
Genbank: National Center For Biotechnology Information
No ratings yet
Genbank: National Center For Biotechnology Information
5 pages
Biological Data and Database Biological Data
No ratings yet
Biological Data and Database Biological Data
10 pages
A Review Article On Bioinformatics Tools and Software
No ratings yet
A Review Article On Bioinformatics Tools and Software
14 pages
Lista de Bases de Datos
No ratings yet
Lista de Bases de Datos
13 pages
List of Biological Databases
No ratings yet
List of Biological Databases
9 pages
Online Biological Databases: A/Prof. Ly Le
No ratings yet
Online Biological Databases: A/Prof. Ly Le
64 pages
Databases 2 Kd
No ratings yet
Databases 2 Kd
4 pages
Genome Annotation
No ratings yet
Genome Annotation
24 pages
Gen Bank
No ratings yet
Gen Bank
6 pages
NCBI Resources
No ratings yet
NCBI Resources
13 pages
Techniques and Analysis
No ratings yet
Techniques and Analysis
1 page
Lecture 5- DataBase
No ratings yet
Lecture 5- DataBase
18 pages
Chapter 1: Genbank: The Nucleotide Sequence Database: Ilene Mizrachi
No ratings yet
Chapter 1: Genbank: The Nucleotide Sequence Database: Ilene Mizrachi
14 pages
GKE024
No ratings yet
GKE024
4 pages
Lec 2 Bioinformatics Glossary
No ratings yet
Lec 2 Bioinformatics Glossary
6 pages
Group # 13
No ratings yet
Group # 13
49 pages
System Biology Assignment
No ratings yet
System Biology Assignment
17 pages
Generating Structural Data Analysis
No ratings yet
Generating Structural Data Analysis
8 pages
DNA Code Basics
From Everand
DNA Code Basics
Zara Sagan
No ratings yet
Bioinformatics Definition
No ratings yet
Bioinformatics Definition
11 pages
Genome Project (1)
No ratings yet
Genome Project (1)
11 pages
COMP90016 2023 06 Data Sources
No ratings yet
COMP90016 2023 06 Data Sources
64 pages
Nucleic_Acid_Databases
No ratings yet
Nucleic_Acid_Databases
37 pages
NT Seq Database
No ratings yet
NT Seq Database
4 pages
Introduction To Databases - NCBI, PDB and Uniprot
No ratings yet
Introduction To Databases - NCBI, PDB and Uniprot
5 pages
Complete_Bulk_RNA_Sequencing_Presentation
No ratings yet
Complete_Bulk_RNA_Sequencing_Presentation
10 pages
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
No ratings yet
Bioinform-Tica-Pdf-May-6-2010-12-38-Pm-3-5-Meg
105 pages
Class 03-04-03
No ratings yet
Class 03-04-03
123 pages
Biological Databases Genbank
No ratings yet
Biological Databases Genbank
31 pages
DATAbases1KD
No ratings yet
DATAbases1KD
5 pages
Bioinformatics Databases
No ratings yet
Bioinformatics Databases
10 pages
Unit I
No ratings yet
Unit I
28 pages
Structure and Function of Sars-Cov-2 Spike Protein: A Multiple Sequence Alignment (Msa) Study
No ratings yet
Structure and Function of Sars-Cov-2 Spike Protein: A Multiple Sequence Alignment (Msa) Study
11 pages
Datos de Bases de Enzimas
No ratings yet
Datos de Bases de Enzimas
2 pages
Databases Bioinformatics
No ratings yet
Databases Bioinformatics
42 pages
Bioinformatics (STH Sir)
No ratings yet
Bioinformatics (STH Sir)
13 pages
4.2
No ratings yet
4.2
18 pages
BCH 505 Bioinformatics 3(2 2) Databases
No ratings yet
BCH 505 Bioinformatics 3(2 2) Databases
17 pages
ok
No ratings yet
ok
29 pages
17373.selected Works in Bioinformatics by Xuhua Xia PDF
No ratings yet
17373.selected Works in Bioinformatics by Xuhua Xia PDF
190 pages
Swiss Model
No ratings yet
Swiss Model
4 pages
Similarity Matrix of Proteins
No ratings yet
Similarity Matrix of Proteins
2 pages
Mi RBase
No ratings yet
Mi RBase
2 pages
CATH Database
No ratings yet
CATH Database
3 pages
Structural Classification of Proteins Database
100% (1)
Structural Classification of Proteins Database
8 pages
Protein Data Bank
No ratings yet
Protein Data Bank
5 pages
UCSC Malaria Genome Browser
No ratings yet
UCSC Malaria Genome Browser
2 pages
National Cancer Institute
No ratings yet
National Cancer Institute
12 pages
Rat Genome Database
No ratings yet
Rat Genome Database
11 pages
Bioinformatic Harvester
No ratings yet
Bioinformatic Harvester
3 pages
Ensembl Genome Database Project
No ratings yet
Ensembl Genome Database Project
8 pages
23 and Me
No ratings yet
23 and Me
17 pages
International HapMap Project
No ratings yet
International HapMap Project
7 pages
United States Plant Patent: Firoozbadly Et Al. Aug. 4, 2015
No ratings yet
United States Plant Patent: Firoozbadly Et Al. Aug. 4, 2015
19 pages
2ndQ SECOND SUMMATIVE TEST EARTH AND LIFE
No ratings yet
2ndQ SECOND SUMMATIVE TEST EARTH AND LIFE
2 pages
Animal Breeding
No ratings yet
Animal Breeding
20 pages
Exercise and Problems Types of Nucleic Acids (Section 22.1)
No ratings yet
Exercise and Problems Types of Nucleic Acids (Section 22.1)
7 pages
Earth and Life Sci QRTR 2 Module 5 Perpetuation of Life Student Edition Grade 11 Descartes
100% (1)
Earth and Life Sci QRTR 2 Module 5 Perpetuation of Life Student Edition Grade 11 Descartes
46 pages
Sangwan Et Al., 2015
No ratings yet
Sangwan Et Al., 2015
12 pages
Buttner - JAP 07 - Microarray
No ratings yet
Buttner - JAP 07 - Microarray
12 pages
Bioabsorption of Metals
No ratings yet
Bioabsorption of Metals
26 pages
AP +Midterm+Review+Questions Boichem
No ratings yet
AP +Midterm+Review+Questions Boichem
30 pages
Sordaria Lab Report
No ratings yet
Sordaria Lab Report
8 pages
Dna The Book of Life
100% (1)
Dna The Book of Life
9 pages
Chap 17 From Gene To Protein Fix
No ratings yet
Chap 17 From Gene To Protein Fix
30 pages
Instant Access to Crop Evolution and Genetic Resources Agricultural and Horticultural Crops 1st Edition Darbeshwar Roy ebook Full Chapters
100% (8)
Instant Access to Crop Evolution and Genetic Resources Agricultural and Horticultural Crops 1st Edition Darbeshwar Roy ebook Full Chapters
37 pages
Ppt. Sex Linkage
No ratings yet
Ppt. Sex Linkage
38 pages
Rubric Genetic Disorder Project
No ratings yet
Rubric Genetic Disorder Project
4 pages
Transformation: Avery, Macleod and Mccarty in 1944
No ratings yet
Transformation: Avery, Macleod and Mccarty in 1944
7 pages
Gene: Fine Structure of Gene
No ratings yet
Gene: Fine Structure of Gene
92 pages
hgp
No ratings yet
hgp
34 pages
Edexcel International GCSE Biology Chapter 16 Learning Plan
No ratings yet
Edexcel International GCSE Biology Chapter 16 Learning Plan
2 pages
(Edu - Joshuatly.com) Trial Malacca STPM 2012 Biology Paper 1 (A6DCE16B) PDF
No ratings yet
(Edu - Joshuatly.com) Trial Malacca STPM 2012 Biology Paper 1 (A6DCE16B) PDF
23 pages
ANGRAU Journal - Vol 49 (2) April-June, 2021 My Article
No ratings yet
ANGRAU Journal - Vol 49 (2) April-June, 2021 My Article
164 pages
2019 Plant Science JRF
No ratings yet
2019 Plant Science JRF
45 pages
Dna: The Genetic Code: Gabriel A. Abrigonda, L.Agr
No ratings yet
Dna: The Genetic Code: Gabriel A. Abrigonda, L.Agr
15 pages
L1. Exploring Life-2024
No ratings yet
L1. Exploring Life-2024
39 pages
Zoology PG Syllabus MSC
No ratings yet
Zoology PG Syllabus MSC
33 pages
Evolution and Genetics For Psychology 1st Edition Nettle All Chapter Instant Download
100% (6)
Evolution and Genetics For Psychology 1st Edition Nettle All Chapter Instant Download
84 pages
TNPSC Vas: NEW Syllabus
No ratings yet
TNPSC Vas: NEW Syllabus
12 pages

Ref Seq

Uploaded by

Ref Seq

Uploaded by

RefSeq

The Reference Sequence (RefSeq) database[1]

RefSeq accession categories and molecule types

NC Complete genomic molecules

XM predicted mRNA model

XR predicted ncRNA model

WP predicted Protein model (prokaryotic sequences)

Vertebrate (other) 5237

The counts of accession and basepairs per molecule type are:[4]

Molecule type Accessions Basepairs/residues

Genomics 40,758,769 2.923212393984 × 1012

RNA 45,781,716 1.22253022047 × 1011

Protein 234,520,053 9.129062394 × 1010

Retrieved from "https://en.wikipedia.org/w/index.php?title=RefSeq&oldid=1145721600"

You might also like