0% found this document useful (0 votes)
34 views7 pages

lncRNome A Comprehensive Knowledgebase of Human Long Noncoding RNAs

Uploaded by

Bahlibi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
34 views7 pages

lncRNome A Comprehensive Knowledgebase of Human Long Noncoding RNAs

Uploaded by

Bahlibi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Database, Vol. 2013, Article ID bat034, doi:10.

1093/database/bat034
.............................................................................................................................................................................................................................................................................................

Original article
lncRNome: a comprehensive knowledgebase
of human long noncoding RNAs
Deeksha Bhartiya1,y, Koustav Pal2,y, Sourav Ghosh3, Shruti Kapoor3, Saakshi Jalali1,
Bharat Panwar4, Sakshi Jain2, Satish Sati3, Shantanu Sengupta3, Chetana Sachidanandan3,
Gajendra Pal Singh Raghava4, Sridhar Sivasubbu3 and Vinod Scaria1,*
1
GN Ramachandran Knowledge Center for Genome Informatics, CSIR Institute of Genomics and Integrative Biology, Mall Road, Delhi 110007, India,
2
CSIR Open Source Drug Discovery Unit, Council of Scientific and Industrial Research, Anusandhan Bhavan, Delhi 110001, India, 3Genomics and

Downloaded from http://database.oxfordjournals.org/ by guest on February 6, 2014


Molecular Medicine, CSIR Institute of Genomics and Integrative Biology, Mall Road, Delhi 110007, India and 4Bioinformatics Centre, CSIR Institute
of Microbial Technology, Sector 39-A, Chandigarh 160036, India

*Corresponding author: Tel: +91 9650466002; Fax: +91 11 27667471; Email: [email protected]
y
These authors contributed equally to this work.

Submitted 17 October 2012; Revised 8 April 2013; Accepted 11 April 2013

Citation details: Bhartiya,D., Pal,K., Ghosh,S., et al. lncRNome: a comprehensive knowledgebase of human long noncoding RNAs. Database (2013)
Vol. 2013: article ID bat034; doi:10.1093/database/bat034.

.............................................................................................................................................................................................................................................................................................

The advent of high-throughput genome scale technologies has enabled us to unravel a large amount of the previously
unknown transcriptionally active regions of the genome. Recent genome-wide studies have provided annotations of a
large repertoire of various classes of noncoding transcripts. Long noncoding RNAs (lncRNAs) form a major proportion of
these novel annotated noncoding transcripts, and presently known to be involved in a number of functionally distinct
biological processes. Over 18 000 transcripts are presently annotated as lncRNA, and encompass previously annotated
classes of noncoding transcripts including large intergenic noncoding RNA, antisense RNA and processed pseudogenes.
There is a significant gap in the resources providing a stable annotation, cross-referencing and biologically relevant infor-
mation. lncRNome has been envisioned with the aim of filling this gap by integrating annotations on a wide variety of
biologically significant information into a comprehensive knowledgebase. To the best of our knowledge, lncRNome is one
of the largest and most comprehensive resources for lncRNAs.
Database URL: http://genome.igib.res.in/lncRNome
.............................................................................................................................................................................................................................................................................................

The noncoding RNA transcripts have been classified into a


Introduction number of subclasses, with the most popular classification
The availability of technology to annotate transcriptomes being based on their size, such as the class of small noncod-
at the genome-scale and single-nucleotide resolution has in ing RNAs, which include the well-annotated microRNAs
the recent years provided a new outlook at the transcribed (miRNAs) (6), small nucleolar RNAs (snoRNAs), long noncod-
regions within the Human genome (1–3). Contrary to the ing RNAs (lncRNAs) and so on.
popular belief, a large number of genomic loci have been Long noncoding RNAs (lncRNAs), by definition, are tran-
presently annotated to be transcriptionally active (4). scripts that are >200 nucleotides in length and do not have
Many of these regions do not have the potential to the potential to encode for proteins exceeding lengths of
encode for functional proteins and thus constitute a class 30 amino acids (7, 8). Transcriptome annotation in recent
of transcripts, popularly annotated as noncoding RNA (5). years has significantly expanded the repertoire of lncRNAs,

.............................................................................................................................................................................................................................................................................................
ß The Author(s) 2013. Published by Oxford University Press.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/
licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly
cited. Page 1 of 6
(page number not for citation purposes)
Original article Database, Vol. 2013, Article ID bat034, doi:10.1093/database/bat034
.............................................................................................................................................................................................................................................................................................

not just in humans, but also in other model systems like per the needs of a user. To this end, the structure was
mouse (9) and zebrafish (10, 11). Although noncoding tran- designed following consultation with a number of experi-
scripts with >200 nucleotide lengths have been clubbed mental and computational biologists. We created the data-
together in a general classification of lncRNAs, the mem- base to serve as a comprehensive, user-friendly and
bers of this class have significant differences in their biolo- biologically relevant knowledgebase on human lncRNAs
gical function, genomic loci and regulation. This class built on MySQL 5.6 and having a PHP-based web interface.
includes previously known classes of ncRNAs including the In brief, each lncRNA gene has a single page with
large intergenic noncoding RNA, transcribed pseudogenes, basic linkouts to other relevant databases, annotation sets
antisense transcripts and several others, including the and relevant categories of information linked in tabs. Five
annotated classes of functionally distinct transcripts such categories of information are presently available linked
as Xist, which is involved in X inactivation (12) and Hotair with each lncRNA, which includes (i) General Information,
(13), involved in epigenetic regulation. (ii) Sequence and Structure, (iii) Interactions and Processing,
Functionally, the lncRNA class encompasses a wide var- (iv) Variations and Conservation and (v) Epigenetic
iety of distinct functions like X-chromosome inactivation, Modifications. These categories are connected to the
modulation of chromatin structure, regulation of transcrip- genome browser along with the conservation scores of all
tional and posttranscriptional processes and epigenetic lncRNA transcripts (Supplementary File S1).
modifications (14). The biological function of lncRNAs is The category ‘General Information’ hosts information

Downloaded from http://database.oxfordjournals.org/ by guest on February 6, 2014


modulated through interaction with other biomolecules like the gene name, Ensembl gene ID, gene type, gene
in the cell, such as DNA, RNA and proteins (15). Recent status, Ensembl transcript ID, transcript name, transcript
evidence has also indicated putative regulatory roles type, transcript status, chromosome, strand and genomic
for smaller RNAs processed from lncRNAs, as well as loci, all of which have been fetched from Gencode release
for lncRNAs themselves that harbor regulatory motifs 12 (http://www.gencodegenes.org) (34). The gene names
(16, 17). lncRNAs could be regulated in a different way were used to map the HGNC ID, Refseq ID, Havana gene
than protein-coding genes (18). Recent evidence also sug- ID, Havana transcript ID, NCBI ID and chromosomal loci
gests the role of lncRNAs in several diseases including a from HUGO Gene Nomenclature Committee website (35).
number of cancers like lung cancers, colorectal and blood The length was calculated using the genomic loci. The
neoplasia (7, 19). Candidate lncRNAs like NEAT2 and details about lncRNA description, disease associations,
MALAT1 have been studied in detail with their relations interactions, overexpression and references were manually
with metastasis in cancers (20–22). Additional candidates curated through literature. The alternate transcripts were
like ANRIL have been implicated in diseases like atheroscler- derived using in-house scripts and all lncRNAs were pro-
osis, (23, 24) while a number of candidate genome-wide vided stable internal IDs.
association loci map to regions presently annotated as The lncRNA sequences were downloaded from UCSC
lncRNA genes (25). It has been also suggested that a con- Genome Browser Database (36), and the structures were
ceptual understanding of lncRNA as a function of the bio- predicted using RNAfold version 1.8.5. Both the parenthesis
logical interactions would help to understand disease structure and the minimum free energy structure predicted
processes and develop potential drug targets (26). using the default parameters have been provided.
There are several comprehensive databases for other The third category comprises lncRNA interactions with
ncRNAs like miRNAs (27–30), snoRNAs (31); however, proteins and other RNAs, lncRNA processing, predicted
there is a paucity of such databases integrating biologically open reading frames (ORFs) and various motifs. The data-
significant annotations for lncRNAs. Although there are base hosts 937 quadruplex and 40 hairpins motifs present in
lncRNA databases coming up like lncRNAdb (32), lncRNAs. Both the motifs have been predicted using tools
NONCODE (33), etc., the extent of lncRNA annotations developed ‘in-house’, Quadfinder (37) and HairpinFetcher,
still remains stringent. lncRNome has been formulated to respectively. It also hosts 3716 miRNA binding sites on
integrate annotations on a wide variety of biologically sig- lncRNA. More than 10 000 binding sites for nine other pro-
nificant information into a comprehensive knowledgebase. teins, which have been summarized in the section ‘Datasets
To the best of our knowledge, lncRNome is one of the and Features’, have been provided. These datasets have
largest catalogs for lncRNAs till date, and is available been mapped using PAR-CLIP (38) and CLIP-Seq datasets
online at the URL: http://genome.igib.res.in/lncRNome. as described in the later sections. There are 6808 predicted
protein-binding sites also provided in the database, which
were predicted using Support Vector Machine–based evalu-
Database design and architecture ation of interaction propensities. The 1692 small RNA
The lncRNome database has been designed keeping in processing sites have also been provided as described in
mind both experimental and computational biologists, so the sections below. The fourth category consists of
as to provide ready access to biologically relevant data as 345 351 genomic variations mapped to lncRNAs. The

.............................................................................................................................................................................................................................................................................................

Page 2 of 6
Database, Vol. 2013, Article ID bat034, doi:10.1093/database/bat034 Original article
.............................................................................................................................................................................................................................................................................................

database of single nucleotide polymorphisms (dbSNP) transcripts (36). RNA structures were computed using
SNPs were downloaded from UCSC genome browser and RNAfold with default parameters, which is part of the
mapped to lncRNAs. Conservation scores of 66 573 sites Vienna RNA package version 1.8.5. Our group has previ-
within lncRNAs have been provided in this category. The ously suggested the presence of G-quadruplex motifs in
fifth category provides 11 790 epigenetic marks in the pro- lncRNAs that could have potential regulatory functions
moters of lncRNAs. The datasets were downloaded from (39). To enable researchers to further take up experiments
the NIH Human Epigenome Roadmap project and in this area, predictions of potential G-quadruplex forming
mapped to lncRNA promoters. The detailed methods are motifs in entire lncRNA transcripts predicted using
available as Supplementary methods. Quadfinder have been included (37), as well as potential
The database also features a comprehensive search hairpin structures in the lncRNA have been identified
option, which enables users to search through lncRNome using HairpinFetcher.
using different keywords, such as, lncRNA names, Ensembl
IDs, known targets, SNPs, diseases, etc. In addition, a separ- lncRNA processing
ate browse option also allows users to browse the database A recent study conducted by our lab has pointed to a subset
through either using the chromosome numbers or different of lncRNAs, which could be potentially processed to small
lncRNA biotypes. The database also features a genome RNAs having downstream regulatory functions by having a
browser, which can be used to browse through the dual transcriptional output (40). The same analysis was

Downloaded from http://database.oxfordjournals.org/ by guest on February 6, 2014


genome for representative features and also provides a replicated on the present large datasets of lncRNAs. In
visual representation of the associated genomic annota- brief, smallRNA clusters were derived from DeepBase (41),
tions available within the database mentioned above a comprehensive database of smallRNA annotations
along with the conservation scores of lncRNAs. derived from smallRNA sequencing experiments available
in the public domain and overlaid on the lncRNA annota-
tions to derive information on potential lncRNAs that could
Datasets and features be processed to smallRNAs.
Long noncoding RNA annotations Protein-RNA interactions
lncRNA annotations were derived from Gencode release 12 Recent high-throughput experimental methods for analysis
(http://www.gencodegenes.org) (34), which consists of of interactions through pull down and sequencing tech-
11 790 lncRNA genes and 18 855 transcripts. The lncRNAs niques have provided critical insights into the landscape
transcripts are classified into 10 different biotypes, the stat- of protein–RNA interactions in the human genome (42).
istics of which has been provided in the Figure 1. In add- One of the major datasets of protein–RNA interactions is
ition, the datasets of lncRNAs and their HGNC IDs were derived from PAR-CLIP (38) experiments for Argonaute
derived from the Human Gene Nomenclature Committee (Ago) proteins, which are critical components of the RISC
website (35), which consisted of 1073 lncRNAs. Additional machinery involved in miRNA targeting. A comprehensive
mappings were derived for 99 human lncRNAs from mapping of potential Ago binding sites in the lncRNA tran-
lncRNAdb and from literature through manual curation scriptome is provided by mapping the reads to the human
and overlapped with each other based on genomic coord- transcriptome. Experimental datasets also exist for other
inates (Figure 2). A stable internal ID is also provided for proteins including IGF2BP2, IGF2BP3, IGF2P1, PTB, PUM2,
easy access and to enable cross-referencing between the QKI, TNRC6A, TNRC6B and TNRC6C, which have also been
different IDs regularly used by different sequence data- mapped to the lncRNA transcripts. Because the number of
bases. The consensus IDs forms the primary reference key experimental datasets for protein–RNA interactions is
within lncRNome and has also been used to reference al- scarce, we also incorporated a computational prediction
ternate transcript isoforms. Wherever appropriate, all method involving Support Vector Machine–based predic-
lncRNAs have also been linked back to relevant databases tion of residues in RNA, which could have probable propen-
such as Ensembl, HGNC and NCBI for quick cross-reference. sity to interact with proteins (Panwar and Raghava 2012,
The manual annotation of the functionally characterized unpublished results).
lncRNAs is provided, which includes information about the
disease associations, expression and functional significance. Genomic variations and conservation
The annotations are collected from literature surveys and Genome-wide association studies in the recent past have
manual curations. suggested disease associations, which could be modulated
by lncRNAs (43). In addition, a number of genomic loci pre-
Sequence and structure and motifs viously shown to be associated with diseases have now
The lncRNA sequences were downloaded from UCSC been indicated to fall within lncRNA gene loci. To facilitate
Genome Browser using genomic locations of individual further in-depth analysis and experimental validation of

.............................................................................................................................................................................................................................................................................................

Page 3 of 6
Original article Database, Vol. 2013, Article ID bat034, doi:10.1093/database/bat034
.............................................................................................................................................................................................................................................................................................

Downloaded from http://database.oxfordjournals.org/ by guest on February 6, 2014


Figure 1. Distribution of Gencode release 12 lncRNAs according to different biotypes.

effect of variations on lncRNA, we have included a compre- Conclusions and future


hensive mapping of genomic variations in lncRNA loci. In
brief, the variations corresponding to dbSNP 135 were
perspectives
downloaded (44) and mapped to respective genomic loca- lncRNome is designed to primarily serve as an evidence-
tions of lncRNAs. In addition, disease associated variations based resource of lncRNAs and their functionality in
were derived from the NIH Catalog of published genome- humans. To this end, we have provided stable reference
wide association studies and mapped to respective rsIDs. IDs for lncRNA genes and alternate transcript isoforms of
The PhastCons conservation scores were downloaded from a gene with cross-references to other sequence and anno-
UCSC and the genomic loci were mapped to lncRNAs (45). tation databases to ensure interoperability and stable
referencing. The knowledge base integrates biologically
Epigenetic modifications oriented datasets and resources on lncRNA and manual an-
notations wherever applicable with the aim of providing a
A recent report from our group suggests that the pro-
one-stop solution for annotation information on lncRNAs.
moters of lncRNAs could be potentially regulated by mech-
The interface allows an easy access to various features of
anisms that are distinct from protein-coding genes
lncRNAs comprised within five categories and their suble-
supporting the role of lncRNAs in epigenetic regulation of
vels (Supplementary Figure S1). The category ‘General’ pro-
genes. To capture the epigenetic marks, in terms of DNA
vides all the basic annotations of each lncRNA including
methylation and histone marks, we have provided a com-
genomic loci, the associated diseases and various linkouts.
prehensive access to epigenetic marks in the promoters of
The sequences and the predicted structure of the lncRNA
lncRNAs. Briefly the raw datasets were downloaded from
are provided in the category ‘Sequence and Structure’. The
the NIH Human Epigenome Roadmap project and mapped
lncRNA structures are poorly understood and it becomes
and analyzed as described in Sati et al. (46). The epigenetic
indispensable to characterize the structures to elucidate
marks are also available for browsing through the genome
the structure–function relationships. Specific lncRNA struc-
browser. The datasets and genomic mappings are compiled
tures are essential for binding to proteins, RNA and other
in Table 1.
biomolecules, and to have a better mechanistic insight of
lncRNA function, elucidation of its structure becomes
Predicted peptides important. lncRNome provides information of various hair-
The open reading frames were predicted for all the pin and quadruplex motifs in lncRNAs found to be essen-
lncRNAs using the Sixpack (http://www.ebi.ac.uk/Tools/st/ tial for regulation of a lot of biological processes. Both
emboss_sixpack/) tool from EMBOSS. The tool translates experimental and prediction datasets on RNA–protein
the given sequence in six frames and peptides starting interactions have been provided for lncRNAs revealing var-
with Methionine and with length 10 amino acids. ious protein and RNA interacting partners of lncRNAs

.............................................................................................................................................................................................................................................................................................

Page 4 of 6
Database, Vol. 2013, Article ID bat034, doi:10.1093/database/bat034 Original article
.............................................................................................................................................................................................................................................................................................

availability of a large number of transcriptome sequencing


datasets in public domain, lncRNome in the present form
has many gaps. The primary gap being the paucity of infor-
mation on expression of lncRNAs in different tissues. With
the availability of genome-wide transcriptome annotation
of many tissues in the public domain, we would enrich the
database with this information. We intend to collaborate
with other international consortiums to enable cross-
linking and sharing of resources seamlessly. In future, we
envisage the database to be available as a community-
curated and semantically linked interoperable data
resource.

Supplementary Data
Supplementary data are available at Database Online.

Downloaded from http://database.oxfordjournals.org/ by guest on February 6, 2014


Figure 2. Comparison of annotations between other data-
bases/datasets on long noncoding RNAs.
Acknowledgements
The authors acknowledge Dr V. Sabareesh, Dr Harleen Kaur
Table 1. Total fields in the database along with the genomic and Ms Ayesha Pasha for reviewing the manuscript. D.B.
loci mapped
and S.S. acknowledge a Senior Research fellowship from
Serial No. Database fields Total genomic CSIR, and S.K. acknowledges support through a Junior
loci mapped Research fellowship from CSIR. The authors acknowledge
the funding for this work from the Council of Scientific
1 Total lncRNAs 18 855
and Industrial Research (CSIR), India through the project
2 Hairpins 40
GENCODE-C (BSC00123).
3 Methylation and histone 11 790
modifications Conflict of interest. None declared.
4 miRNA binding sites 3716
5 Quadruplexes 937
6 Predicted protein-binding 6808 References
sites on lncRNA
1. Ozsolak,F. and Milos,P.M. (2011) RNA sequencing: advances, chal-
7 Small RNA clusters lenges and opportunities. Nat. Rev. Genet., 12, 87–98.
8 Single nucleotide polymorphisms 295 851 2. Pareek,C.S., Smoczynski,R. and Tretyn,A. (2011) Sequencing tech-
nologies and genome sequencing. J. Appl. Genet., 52, 413–435.
3. Zhang,J., Chiodini,R., Badr,A. et al. (2011) The impact of next-gen-
eration sequencing on genomics. J. Genet. Genomics, 38, 95–109.
(Supplementary Figure S1). Although the exact mechanism
4. Clark,M.B., Amaral,P.P., Schlesinger,F.J. et al. (2011) The reality of
how the lncRNA interacts with different partners is still not
pervasive transcription. PLoS Biol., 9, e1000625.
known, our data provide a startup point to the community
5. Ulveling,D., Francastel,C. and Hube,F. (2011) When one is better
to understand the various regulatory interactions of than two: RNA with dual functions. Biochimie., 93, 633–644.
lncRNAs with their respective partners. Genomic variations 6. Osman,A. (2012) MicroRNAs in health and disease–basic science
in lncRNAs have been studied to understand the effect of and clinical applications. Clin. Lab., 58, 393–402.
SNPs on biogenesis and functions of lncRNAs. The disease- 7. Li,X., Wu,Z., Fu,X. et al. (2012) Long noncodingRNAs: insights from
associated SNPs present in lncRNAs might provide informa- biological features and functions to diseases. Med. Res. Rev., 10,
tion about genotype to phenotype associations. The 517–553.

distribution of epigenetic marks like DNA methylation 8. Mercer,T.R., Dinger,M.E. and Mattick,J.S. (2009) Long non-coding
RNAs: insights into functions. Nat. Rev. Genet., 10, 155–159.
and histone modifications across transcription start site
9. Guttman,M., Amit,I., Garber,M. et al. (2009) Chromatin signature
(TSS) of lncRNAs might help in evaluating the effect of chro-
reveals over a thousand highly conserved large non-coding RNAs in
matin modifications on gene expression (Supplementary mammals. Nature, 458, 223–227.
Figure S1). 10. Pauli,A., Valen,E., Lin,M.F. et al. (2012) Systematic identification of
Because the field is emerging and many more lncRNAs long noncoding RNAs expressed during zebrafish embryogenesis.
are being discovered and annotated, thanks to the Genome Res., 22, 577–591.

.............................................................................................................................................................................................................................................................................................

Page 5 of 6
Original article Database, Vol. 2013, Article ID bat034, doi:10.1093/database/bat034
.............................................................................................................................................................................................................................................................................................

11. Ulitsky,I., Shkumatava,A., Jan,C.H. et al. (2011) Conserved function 29. Kin,T., Yamada,K., Terai,G. et al. (2007) fRNAdb: a platform for
of lincRNAs in vertebrate embryonic development despite rapid mining/annotating functional RNA candidates from non-coding
sequence evolution. Cell, 147, 1537–1550. RNA sequences. Nucleic Acids Res., 35(Database issue), D145–D148.
12. Brockdorff,N. (2011) Chromosome silencing mechanisms in 30. Wang,X. (2008) miRDB: a microRNA target prediction and func-
X-chromosome inactivation: unknown unknowns. Development, tional annotation database with a wiki interface. RNA, 14,
138, 5057–5065. 1012–1017.
13. Gutschner,T. and Diederichs,S. (2012) The Hallmarks of Cancer: 31. Ellis,J.C., Brown,D.D. and Brown,J.W. (2010) The small nucleolar
a long non-coding RNA point of view. RNA. Biol., 9, 703–719. ribonucleoprotein (snoRNP) database. RNA, 16, 664–666.
14. Derrien,T., Guigo,R. and Johnson,R. (2011) The long non-coding 32. Amaral,P.P., Clark,M.B., Gascoigne,D.K. et al. (2011) lncRNAdb: a
RNAs: a new (P)layer in the ‘Dark Matter’. Front Genet., 2, 107. reference database for long noncoding RNAs. Nucleic Acids Res.,
15. Wilusz,J.E., Sunwoo,H. and Spector,D.L. (2009) Long noncoding 39(Database issue), D146–D151.
RNAs: functional surprises from the RNA world. Genes Dev., 23, 33. Bu,D., Yu,K., Sun,S. et al. (2012) NONCODE v3.0: integrative anno-
1494–1504. tation of long noncoding RNAs. Nucleic Acids Res., 40(Database
16. Wang,K.C. and Chang,H.Y. (2011) Molecular mechanisms of long issue), D210–D215.
noncoding RNAs. Mol. Cell, 43, 904–914. 34. Harrow,J., Denoeud,F., Frankish,A. et al. (2006) GENCODE: produ-
17. Yan,B. and Wang,Z. (2012) Long Noncoding RNA: its physiological cing a reference annotation for ENCODE. Genome Biol., 7 (Suppl.
and pathological roles. DNA Cell Biol., 31 (Suppl. 1), S34–S41. 1), S4.1–S4.9.
18. Moran,V.A., Perera,R.J. and Khalil,A.M. (2012) Emerging functional 35. Povey,S., Lovering,R., Bruford,E. et al. (2001) The HUGO gene no-
and mechanistic paradigms of mammalian long non-coding RNAs. menclature committee (HGNC). Hum. Genet., 109, 678–680.

Downloaded from http://database.oxfordjournals.org/ by guest on February 6, 2014


Nucleic Acids Res., 40, 6391–6400. 36. Kent,W.J., Sugnet,C.W., Furey,T.S. et al. (2002) The human genome
19. Harries,L.W. (2012) Long non-coding RNAs and human disease. browser at UCSC. Genome Res., 12, 996–1006.
Biochem. Soc. Trans., 40, 902–906. 37. Scaria,V., Hariharan,M., Arora,A. et al. (2006) Quadfinder: server
20. Tani,H., Nakamura,Y., Ijiri,K. et al. (2010) Stability of MALAT-1, a for identification and analysis of quadruplex-forming motifs in nu-
nuclear long non-coding RNA in mammalian cells, varies in various cleotide sequences. Nucleic Acids Res., 34, W683–W685.
cancer cells. Drug Discov. Ther., 4, 235–239. 38. Jayaraj,G.G., Pandey,S., Scaria,V. et al. (2012) Potential G-quadru-
21. Xu,C., Yang,M., Tian,J. et al. (2011) MALAT-1: a long non-coding plexes in the human long non-coding transcriptome. RNA. Biol., 9,
RNA and its important 3’ end functional motif in colorectal cancer 81–86.
metastasis. Int. J. Oncol., 39, 169–175. 39. Ascano,M., Hafner,M., Cekan,P. et al. (2012) Identification of RNA-
22. Kim,Y.S., Hwan,J.D., Bae,S. et al. (2010) Identification of differen- protein interaction networks using PAR-CLIP. Wiley Interdiscip. Rev.
tially expressed genes using an annealing control primer system in RNA, 3, 159–177.
stage III serous ovarian carcinoma. BMC Cancer, 10, 576. 40. Jalali,S., Jayaraj,G.G. and Scaria,V. (2012) Integrative transcriptome
23. Congrains,A., Kamide,K., Katsuya,T. et al. (2012) CVD-associated analysis suggest processing of a subset of long non-coding RNAs to
non-coding RNA, ANRIL, modulates expression of atherogenic small RNAs. Biol. Direct, 7, 25.
pathways in VSMC. Biochem. Biophys. Res. Commun., 419, 612–616. 41. Yang,J.H. and Qu,L.H. (2012) DeepBase: annotation and discovery
24. Holdt,L.M. and Teupser,D. (2012) Recent studies of the human of microRNAs and other noncoding RNAs from deep-sequencing
chromosome 9p21 locus, which is associated with atherosclerosis data. Methods Mol. Biol., 822, 233–248.
in human populations. Arterioscler. Thromb. Vasc. Biol., 32, 42. Konig,J., Zarnack,K., Luscombe,N.M. et al. (2012) Protein-RNA inter-
196–206. actions: new genomic technologies and perspectives. Nat. Rev.
25. Pasmant,E., Sabbagh,A., Vidaud,M. et al. (2011) ANRIL, a long, non- Genet., 13, 77–83.
coding RNA, is an unexpected major hotspot in GWAS. FASEB J., 43. Jin,G., Sun,J., Isaacs,S.D. et al. (2011) Human polymorphisms at long
25, 444–448. non-coding RNAs (lncRNAs) and association with prostate cancer
26. Bhartiya,D., Kapoor,S., Jalali,S. et al. (2012) Conceptual approaches risk. Carcinogenesis, 32, 1655–1659.
for lncRNA drug discovery and future strategies. Expert. Opin. Drug 44. Sherry,S.T., Ward,M.H., Kholodov,M. et al. (2001) dbSNP: the NCBI
Discov., 7, 503–513. database of genetic variation. Nucleic Acids Res., 29, 308–311.
27. Griffiths-Jones,S., Grocock,R.J., van Dongen,S. et al. (2006) miRBase: 45. Siepel,A., Bejerano,G., Pedersen,J.S. et al. (2005) Evolutionarily con-
microRNA sequences, targets and gene nomenclature. Nucleic Acids served elements in vertebrate, insect, worm, and yeast genomes.
Res., 34(Database issue), D140–D144. Genome Res., 15, 1034–1050.
28. Hsu,S.D., Chu,C.H., Tsou,A.P. et al. (2008) miRNAMap 2.0: genomic 46. Sati,S., Ghosh,S., Jain,V. et al. (2012) Genome-wide analysis reveals
maps of microRNAs in metazoan genomes. Nucleic Acids Res., distinct patterns of epigenetic features in long non-coding RNA
36(Database issue), D165–D169. loci. Nucleic Acids Res., 40, 10018–10031.
.............................................................................................................................................................................................................................................................................................

.............................................................................................................................................................................................................................................................................................

Page 6 of 6
Copyright of Database: The Journal of Biological Databases & Curation is the property of
Oxford University Press / USA and its content may not be copied or emailed to multiple sites
or posted to a listserv without the copyright holder's express written permission. However,
users may print, download, or email articles for individual use.

You might also like