Dna Doc Karthik
Dna Doc Karthik
On
DNA COMPUTING
Submitted by
SELAVADI KARTHIK
(189L1A0593)
BACHELOR OF TECHNOLOGY
in
Computer Science and Engineering
CERTIFICATE
This is to certify that the project entitled “DNA COMPUTING” is the bonafide
All endeavors over a long period can be successful only with the advice
and support of many well-wishers. I take this opportunity to express my
gratitude and appreciation to all of them.
SELAVADI KARTHIK
(189L1A0523)
SIDDARTHA EDUCATIONAL ACADEMY GROUP OF INSTITUTIONS
DECLARATION
I hereby declare that the project work entitled “DNA COMPUTING”
SELAVADI KARTHIK
(189L1A0523)
ABSTRACT
In this paper I intend to present the computing technology that has a great future - DNA
COMPUTING. DNA computing can be viewed as a manifestation of an emerging new area
of science made possible by our rapidly developing ability to control the molecular world.
DNA computing is in its infancy and its implications are only beginning to be explored. The
paper begins with a brief description of DNA and its structure. An introduction to DNA
COMPUTING and its origin have been given. Adleman experiment has been discussed,
which gives solution to the “HAMILTONIAN PATH PROBLEM” by the application of
DNA COMPUTING. The salient features of DNA Computer (one that uses DNA
computing as its basic method of problem solving) have been mentioned. An insight into
the advantages, disadvantages, applications and limitations of DNA-computing has been
made. Finally, the paper discusses the various stages in its path of development at present
and the expectations in the near future
CONTENTS
Page No:
1 INTRODUCTION.........…...........................................1 – 2
3. DNA FUNDAMENTALS............................................5 – 10
3.1 What is DNA?
3.2 Structure of DNA
4. PRINCIPLES OF DNA COMPUTING.....................11 – 13
1. INTRODUCTION
The twentieth century will be remembered for three major
achievements – The evolution of computers, decoding of the human
genome and evolution from Newtonian physics to quantum physics.
Since the beginning of time man has performed computations or
calculation. The method and nature of these computations has
however changed from manual in the stone ages to mechanical in the
medieval ages to electronic in the new computer age. Computers, by
definition, are machines which receive input, manipulate and store the
input, and produce an output. They've quickly grown in the size and
processing power. Computers are commonly known to consist of
integrated circuits mainly constructed of silicon; however, a computer
is never considered to be "alive". What is going to be the future of
computing systems? Can we look beyond silicon to embrace other
mediums for computing? Computers inspired by biological or
physical systems are possible alternatives. Microprocessors made of
silicon will eventually reach their limits of speed and miniaturization.
Chip makers need a new material to produce faster computing speeds.
You won't believe where scientists have found the new material they
need to build the next generation of microprocessors. Millions of
natural supercomputers exist inside living organisms, including your
body. DNA (Deoxyribo Nucleic Acid) molecules, the material our
genes are made of, have the potential to perform calculations many
times faster than the world's most powerful human-built computers.
Technological advances however could use these building blocks of
our genome in creating computer processors and data storage, and
catapult processing speeds to incomprehensible levels not possible by
today's standards. A DNA-based computer has solved a logic problem
that no person could complete by hand, setting a new milestone for
this infant technology that could someday surpass the electronic
digital computer in certain areas. DNA might one day be integrated
into a computer chip to create a so-called Biochip that will push
There are two reasons for using molecular biology to solve computational problems. (i) The
information density of DNA is much greater than that of silicon:
1 bit can be stored in approximately one cubic nanometer. Others storage media, such as
videotapes, can store 1 bit in 1,000,000,000,000 cubic nanometer.
(ii) Operations on DNA are massively parallel: a test tube of DNA can contain trillions of
strands. Each operation on a test tube of DNA is carried out on all strands in the tube in
parallel.
3. DNA FUNDAMENTALS
DNA nucleotides differ only by their bases (B). There are two classes of nitrogen bases
called purines (double-ringed structures) and pyrimidines (single-ringed structures). The
four bases in DNA's alphabet are: purines pyrimidines Adenine A Thymine T Guanine G
Cytosine C
Linking of Nucleotides
The DNA monomers can link in two ways: Phosphodiester bond and Hydrogen bond.
Phosphodiester Bond: The 5´-phosphate group of one nucleotide is joined with the 3´-
hydroxyl group of the other - strong (covalent) bond - directionality: 5´—3´ or 3´—5´
Hydrogen Bond: The base of one nucleotide interacts with the base of another - base
pairing (weak bond)
A hydrogen bond is a weak chemical bond that occurs between hydrogen atoms and more
electronegative atoms, like oxygen, nitrogen and fluorine. The participating atoms can be
located on the same molecule (adjacent nucleotides) or on different molecules (adjacent
nucleotides on different DNA strands). Hydrogen bonds do not involve the exchange or
sharing of electrons like covalent and ionic bonds. The weak attraction is like that between
the opposite poles of a magnet. Hydrogen bonds occur over short distances and can be
easily formed and broken. They can also stabilize a molecule.
Strands of DNA are made of the sugar and phosphate portions of the nucleotides, while the
middle parts are made of the nitrogenous bases.
The nitrogenous bases on the two strands of DNA pair up, purine with pyrimidine (A with
T, G with C), and are held together by weak hydrogen bonds. A — T (2 hydrogen bonds) C
— G (3 hydrogen bonds)
James D. Watson and Francis H. C. Crick deduced double-helix structure of DNA in 1953
and got Nobel Prize in 1962. Adenine and thymine always bond together as a pair, and
cytosine and guanine bond together as a pair. The pairs link together like rungs in a ladder:
Watson and Crick discovered that DNA had two sides, or strands, and that these strands
were twisted together like a twisted ladder -- the double helix. The sides of the ladder
comprise the sugar-phosphate portions of adjacent nucleotides bonded together. The
phosphate of one nucleotide is covalently bound (a bond in which one or more pairs of
electrons are shared by two atoms) to the sugar of the next nucleotide. The hydrogen bonds
between phosphates cause the DNA strand to twist. The nitrogenous bases point inward on
the ladder and form pairs with bases on the other side, like rungs. Each base pair is formed
from two complementary nucleotides (purine with pyrimidine) bound together by hydrogen
bonds. The base pairs in DNA are adenine with thymine and cytosine with guanine.
• Solenoid Supercoil
• Chromatin
• Chromosomes
1. Double Helix — the DNA in a single cell contains 2.9 x 109 base pairs and would be a
meter long. 2. Histones / Nucleosomes — DNA is wound around a histone protein core to
form a nucleosome. This gives a 5- to 9- fold reduction in length.
3. Solenoids — Nucleosomes (beads on a string) supercoil and form solenoid structures. 4-
6-fold reduction in length.
4. Chromatin — Solenoid turns loop around a protein-RNA scaffold to form Minibands.
18-fold reduction in length.
5. Chromosomes — Minibands further condense to form Chromosomes, the form of DNA
as seen during cell division and genetics studies.
The amount of information gathered on the molecular biology of DNA over the last 40
years is almost overwhelming in scope. So instead of getting bogged down in biochemical
and biological details of DNA, we'll concentrate on only the information relevant to DNA
computing.
The data density of DNA is impressive. Just like a string of binary data is encoded with
ones and zeros, a strand of DNA is encoded with four bases, represented by the letters A, T,
C, and G. The bases (also known as nucleotides) are spaced every 0.35 nanometers along
the DNA molecule, giving DNA a remarkable data density of nearly 18 Mbits per inch. In
two dimensions, if you assume one base per square nanometer, the data density is over one
million Gbits per square inch. Compare this to the data density of a typical high
performance hard drive, which is about 7 Gbits per square inch -- a factor of over 100,000
smaller.
Another important property of DNA is its double stranded nature. The bases A and T, and
C and G, can bind together, forming base pairs. Therefore every DNA sequence has a
natural complement. For example if sequence S is ATTACGTCG, its complement, S', is
TAATGCAGC. Both S and S' will come together (or hybridize) to form double stranded
DNA. This complementarities makes DNA a unique data structure for computation and can
be exploited in many ways. Error correction is one example. Errors in DNA happen due to
many factors. Occasionally, DNA enzymes simply make mistakes, cutting where they
shouldn't, or inserting a T for a G. DNA can also be damaged by thermal energy and UV
energy from the sun. If the error occurs in one of the strands of double stranded DNA,
repair enzymes can restore the proper DNA sequence by using the complement strand as a
reference.
In this sense, double stranded DNA is similar to a RAID 1 array, where data is mirrored on
two drives, allowing data to be recovered from the second drive if errors occur on the first.
In biological systems, this facility for error correction means that the error rate can be quite
low. For example, in DNA replication, there is one error for every 10^9 copied bases or in
other words an error rate of 10^-9. (In comparison, hard drives have read error rates of only
10^-13 for Reed-Solomon correction)
Operations in parallel
In the cell, DNA is modified biochemically by a variety of enzymes, which are tiny protein
machines that read and process DNA according to nature's design. There is a wide variety
and number of these "operational" proteins, which manipulate DNA on the molecular level.
For example, there are enzymes that cut DNA and enzymes that paste it back together.
Other enzymes function as copiers, and others as repair units. Molecular biology,
Biochemistry, and Biotechnology have developed techniques that allow us to perform many
of these cellular functions in the test tube. It's this cellular machinery, along with some
synthetic chemistry, that makes up the palette of operations available for computation.
Just like a CPU has a basic suite of operations like addition, bit-shifting, logical operators
(AND, OR, NOT NOR), etc. that allow it to perform even the most complex calculations,
DNA has cutting, copying, pasting, repairing, and many others. And note that in the test
tube, enzymes do not function sequentially, working on one DNA at a time. Rather, many
copies of the enzyme can work on many DNA molecules simultaneously. This is the power
of DNA computing, that it can work in a massively parallel fashion.
DNA as Software:
Think of DNA as software, and enzymes as hardware. Put them together in a test tube. The
way in which these molecules undergo chemical reactions with each other allows simple
operations to be performed as a by-product of the reactions. The scientists tell the devices
what to do by controlling the composition of the DNA software molecules. It's a completely
different approach to pushing electrons around a dry circuit in a conventional computer.
To the naked eye, the DNA computer looks like clear water solution in a test tube. There is
no mechanical device. A trillion bio-molecular devices could fit into a single drop of water.
Instead of showing up on a computer screen, results are analyzed using a technique that
allows scientists to see the length of the DNA output molecule. "Once the input, software,
and hardware molecules are mixed in a solution it operates to completion without
intervention," said David Hawksett, the science judge at Guinness World Records. "If you
want to present the output to the naked eye, human manipulation is needed".
A single strand of DNA is similar to a string consisting of a combination of four different
symbols A G C T. Mathematically this means we have at our disposal a letter alphabet, Σ =
{A GC T} to encode information which is more than enough considering that an electronic
computer needs only two digits and for the same purpose. In a DNA computer, computation
takes place in test tubes or on a glass slide coated in 24K gold. The input and output are
both strands of DNA, whose genetic sequences encode certain information. A program on a
DNA computer is executed as a series of biochemical operations, which have the effect of
synthesizing, extracting, modifying and cloning the DNA strands.
A DNA computer, as the name implies, uses DNA strands to store information and taps the
recombinative properties of DNA to perform operations. A small test tube of DNA strands
suspended in a solution could yield millions to billions of simultaneous interactions at
speeds — in theory — faster than today's fastest supercomputers.
DNA computer uses the recombinative property of dna to perform operations.The main
benefit of using DNA computers to solve complex problems is that different possible
solutions are created all at once. This is known as parallel processing. Humans and most
electronic computers attempt to solve the problem one process at a time (linear processing).
DNA itself provides the added benefits of being a cheap, energy-efficient resource.
In a different perspective, more than 10 trillion DNA molecules can fit into an area no
larger than 1 cubic centimeter. With this, a DNA computer could hold 10 terabytes of data
and perform 10 trillion calculations at a time.
In a traditional computer, data are represented by and stored as strings of zeros and ones.
With a DNA computer, a sequence of its four basic nucleotides — adenine, cytosine,
guanine, and thymine — is used to represent and store data on a strand of DNA.
Calculations in a traditional computer are performed by moving data into a processing unit
where binary operations are performed. Essentially, the operations turn miniaturized circuits
off or on corresponding to the zeros and ones that represent the string of data.
As concerning the operations that can be performed on DNA strands the proposed models
of DNA computation are based on various combinations of the following primitive bio-
operations:
synthesizing a desired polynomial-length strand used in all models.
Amplifying (copying): make copies of DNA strands by using the Polymerase Chain
Reaction PCR. The DNA polymerase enzymes perform several functions including
replication of DNA. The replication reaction requires a guiding DNA single-strand called
template, and a shorter oligonucleotide called a primer, that is annealed to it.
Separating the strands by length using a technique called gel electrophoresis that
makes possible the separation of strands by length
Extracting those strands that contain a given pattern as a substring by using affinity
purification.
contains the restriction site within its sequence is cut by the enzyme at that location.
Ligating: paste DNA strands with compatible sticky ends by using DNA ligases.
Indeed, another enzyme called DNA ligase, will bond together, or ``ligate'', the end
of a DNA strand to another strand.
Substituting: substitute, insert or delete DNA sequences by using PCR site-specific
oligonucleotide mutagenesis.
Fig: Substitution
Adleman´s Algorithm:
Input: A directed graph G with n vertices, and designated vertices vin and vout.
To implement step 1, each node of the graph was encoded as a random 20-base strand of
DNA. Then, for each edge of the graph, a different 20- base oligonucleotide was generated
that contains the second half of the source code plus the first half of the target node
To implement step 2, the product of step 1 was amplified by PCR using oligonucleotide
primers representing vin and vout and ligase enzyme. This amplified and thus retained only
those molecules encoding paths that begin with vin and end with vout. ~1014 computations
are carried out in a single second.
For implementing step 3, agarose gel electrophoresis allowed separation and recovery of
DNA strands of the correct length. The desired path, if it exists, would pass through all
seven nodes, each of which was assigned a length of 20 bases. Thus PCR products encoding
the desired path would have to be 140 bp.
Step 4 was accomplished by successive use of affinity purification for each node other than
the start and end nodes.
The solution strand has to be filtered from the test tube:
GCAG TCGG ACTG GGCT ATGT CCGA
Thus we see in a graph with n vertices, there are a possible (n-1)! permutations of the
vertices between beginning and ending vertex. To explore each permutation, a traditional
computer must perform O(n!) operations to explore all possible cycles. However, the DNA
computing model only requires the representative oligos. Once placed in solution, those
oligos will anneal in parallel, providing all possible paths in the graph at roughly the same
time. That is equivalent to O(1) operations, or constant time. In addition, no more space
than what was originally provided is needed to contain the constructed paths.
Silicon microprocessors have been the heart of the computing world for more than 40 years.
In that time, manufacturers have crammed more and more electronic devices onto their
microprocessors. In accordance with Moore's Law, the number of electronic devices put on
a microprocessor has doubled every 18 months. Moore's Law is named after Intel founder
Gordon Moore, who predicted in 1965 that microprocessors would double in complexity
every two years. Many have predicted that Moore's Law will soon reach its end, because of
the physical speed and miniaturization limitations of silicon microprocessors.
Surpassing Silicon?
DNA's key advantage is that it will make computers smaller than any computer that has
come before them, while at the same time holding more data. One pound of DNA has the
capacity to store more information than all the electronic computers ever built; and the
computing power of a teardrop-sized DNA computer, using the DNA logic gates, will be
more powerful than the world's most powerful supercomputer. More than 10 trillion DNA
molecules can fit into an area no larger than 1 cubic centimeter (0.06 cubic inches). With
this small amount of DNA, a computer would be able to hold 10 terabytes of data, and
perform 10 trillion calculations at a time. By adding more DNA, more calculations could be
performed. Unlike conventional computers, DNA computers perform calculations parallel
to other calculations. Conventional computers operate linearly, taking on tasks one at a
time. It is parallel computing that allows DNA to solve complex mathematical problems in
hours, whereas it might take electrical computers hundreds of years to complete them. The
first DNA computers are unlikely to feature word processing, e-mailing and solitaire
programs. Instead, their powerful computing power will be used by national governments
for cracking secret codes, or by airlines wanting to map more efficient routes. Studying
DNA computers may also lead us to a better understanding of a more complex computer --
the human brain. DNA computers will be capable of storing billions of times more data
(say, at a density of 1 bit per cubic nanometer -- a trillion times less space) than your
personal computer. The DNA computer has very low energy consumption, so if it is put
inside the cell it would not require much energy to work. Using DNA logic gates the DNA
computers will be more powerful than the world's most powerful supercomputer. DNA
computers perform calculations parallel to other calculations. Speed: Combining DNA
strands as demonstrated by Dr Adleman, made computations equivalent to 10^9 or
better,arguably over 100 times faster than fastest computer. Minimal storage requirements:
DNA stores memory at a density of about one bit per cubic nanometer where conventional
storage media requires 10^12 cubic nanometers to storage one bit. Minimal power
requirements: No power is required for DNA computing while computation is taking place.
The chemical bonds that are the building blocks of DNA happen without any outside power
source.
Memory Capacity Ultra-High (one bit per cubic High(10^12 cubic nanometers
nanometer) to store one bit)
Data Density One million Gbits per square 7 Gbits per square inch
inch
Computational Power More powerful than any Less powerful
supercomputer
Computer Size Smaller than any computer Larger size
power requirements Not required Required
Cost Cheaper Expensive
9.ADVANTAGES
10. DISADVANTAGES
Generating solution sets, even for some relatively simple problems, may require
impractically large amounts of memory (lots and lots of DNA strands are required)
Many empirical uncertainties, including those involving: actual error rates, the
generation of optimal encoding techniques, and the ability to perform necessary bio-
operations conveniently in vitro (for every correct answer there are millions of
incorrect paths generated that are worthless).
DNA computers could not (at this point) replace traditional computers.
They are not programmable and the average dunce cannot sit down at a familiar
keyboard and get to work.
It requires human assistance.
12. CONCLUSION
Before you trash your silicon-based computer and start trying to process words with DNA,
remember that it'll be a while before the wet computers show up in showrooms. DNA
computers can't be found at your local electronics store yet. The technology is still in
development. Biomolecular computers, made of DNA and other biological molecules, only
exist today in a few specialized labs, remote from the regular computer user. DNA
computer components -- logic gates and biochips -- will take years to develop into a
practical, workable DNA computer. If such a computer is ever built, scientists say that it
will be more compact, accurate and efficient than conventional computers. The current
applications of DNA chips are restricted to the field of medicine. Affymetrix Inc pioneered
the research in the field of DNA medicine. However now many companies such as
Motorola and Corning and the Hewlett-Packard spinoff Agilent Technologies have joined
this rapidly growing technology. Each of these challengers is applying its industrial
expertise to making its own DNA microarrays or chips. DNA chips or arrays have been
used to solve many problems in the field of medicine. These DNA computers can be used in
fluids, such as a sample of blood or in the body, and make decisions at the level of a single
cell. DNA computers could conceivably be implanted in the body to both diagnose and kill
cancer cells or monitor and treat diabetes by dispensing insulin when needed. The first DNA
computers are unlikely to feature word processing, e-mailing and solitaire programs.
Instead, their powerful computing power will be used by national governments for cracking
secret codes, or by airlines wanting to map more efficient routes. Studying DNA computers
may also lead us to a better understanding of a more complex computer -- the human brain.
In the future, some speculate, there may be hybrid machines that use traditional silicon for
normal processing tasks but have DNA co-processors that can take over specific tasks they
would be more suitable for.
13. REFERENCES
[1]. Paun, G., Rozenberg, G., and Salomaa, A., DNA Computing, Springer,1998.
DNA COMPUTING-GRAPH ALGORITHMS [lec-12.pdf] G. P. Raja Sekhar, Dept. of
Mathematics, IIT, Kharagpur
[2]. Leonard M. Adleman, Computing with DNA, Scientific American, August 1998.
From Microsoft to Biosoft Computing with DNA, Lila Kari, Department of Computer
Science University of Western Ontario
[3]. L.Adleman. On constructing a molecular computer. 1st DIMACS workshop on DNA
based computers, Princeton, 1995. In DIMACS series, vol.27 (1996)
[4]. L.Adleman, P.Rothemund, S.Roweis, E.Winfree. On applying molecular computation to
the Data Encryption Standard. 2nd DIMACS workshop on DNA based computers,
Princeton, 1996
WEBSITES:
[1]. http://www.howstuffworks.com/ Structure of DNA
[2]. http://www.rsa.com/ What is DNA computing?
[3]. http://www.features.techworld.com/ Computational nanotechnology
/ Nanobiotechnology
[4]. http://www.networkworld.com/ Are DNA Computer Chips a Reality?
[5]. http://www.ezinearticles.com/ Latest Paper presentation on DNA computing
[6]. http://www.nature.com/Nature Nanotechnology/Nanocomputing/
[7].http://www.latestinfo.com/ Dna-computing.html.