The development of multiple disciplines such as retrosynthetic chemical analysis, microfluidics, whole genome engineering, and protein and enzyme design are reaching the stage of development where the design and production of assembly lines for nanoscale parts seems conceivable. This paper discusses the development of these disciplines and how their convergence may be applied to accelerate the production of nanoscale parts. Utilizing existing and near-term technologies, a strategy for the production of a single assembly line for manufacturing a currently designed nanoscale machine part, containing ~2600 atoms, is estimated at ~$6 million. The developments necessary over the next decade to produce the nanopart assembly lines for the parts in programmable nanoassembly system, containing 6-8 million atoms, for an estimated $200,000 are examined. Continued developments suggest that within 20 years the design of assembly lines for the parts required for manufacturing nanorobots containing billions of atoms for less than $1 million is feasible.
A critical feature a PNA must possess to precisely assemble atoms or small molecules is stiffness -- "In designing machinery for molecular manufacturing, stiffness is a central concern." [Dre92, pg 448]. Unfortunately, the paths from wet (generally not-stiff) solution-phase organic chemistry and/or biotechnology to the dry (stiff) mechanosynthetic chemistry envisioned by Drexler & Merkle are not particularly clear to most observers. Drexler [Dre92, pg 471] provided an outline of a 4 stage development path for increasingly stiff assemblers but it appears, to the author, that there has been little progress along that path to date. As pointed out by Merkle [Mer99b], one of the stumbling blocks is that the complex molecules that we can currently manufacture, DNA, RNA & proteins, are polymers of subunits joined to each other at only two points. To get the structural stiffness in the parts required to build a PNA we will most likely want to be able to position and strongly bond atoms, or larger molecular groups, in 3 dimensions1. Do we know of any ways to do this? Yes. As pointed out by Drexler [Dre81], protein engineering may provide the path to the "second generation of molecular machinery whose components would not be coiled hydrated polypeptide chains but compact structures having three-dimensional covalent bonding". Subsequently he suggested [Dre94], the simultaneous use of multiple stabilization techniques in "designer" proteins, including chemical cross-linking, could produce molecular parts of intermediate stiffness that would move us from Stage 1 to Stage 2 assemblers. Methods for stiffening protein derived nanoparts were also discussed in [Dre92, Sect. 15.3]. A review of the progress made during the 90's [Fre99, Chap. 2], shows that for the most part stiff molecules which we can synthesize, such as pagodane or calixarene, are still at a very primitive level (a few dozen atoms). The largest stiff molecules that we can assemble are buckytubes, but we are using bulk manufacturing processes, not directed molecular assembly, to do this. Scientists are actively engineering proteins, particularly enzymes, but this is typically done for industrial biocatalysis applications [e.g. Cha98], not for the assembly of the parts needed for a PNA. Zyvex is pursuing the development of PNA nanoparts via the dry mechanosynthetic path based on atomic force microscopes but this is generally acknowledged to be a very difficult approach. In 1998, Freitas described a novel path to first generation molecular assemblers [Fre98] and extended this by proposing mechanoenzymes might provide the means of implementing this path [Fre00a]. This paper significantly extends these ideas by showing that small molecule and enzyme design and the bioengineering of microorganisms have advanced sufficiently that we may envision combining small "designer" molecules with larger "designer" enzymes to provide a clear path for the manufacture of PNA parts and their assembly.
One of the difficulties of developing realistic strategies to build
a PNA is the lack of a widespread realistic comprehension of the size scale
of the problem. As Figure 1 documents, a PNA
is significantly larger than the biological assembler, the ribosome.
PNAs have been estimated to require 3-4,000,000 atoms. It has been
estimated a functional PNA with support structures and interfaces to the
external systems providing it with power and instructions could require
~8,000,000 atoms [Fre99, pg 65]2.
An example of a part that could be used as a small subcomponent in a PNA
is a fine-motion controller
with ~2600 atoms, weighing ~30,000 Daltons, approximately the same mass
as an average cellular protein. Ribosomes, the biological assemblers
of proteins, have masses ranging from ~2,700,000 in prokaryotes to ~4,200,000
in eukaryotes [EMB94]. This would equate
to ~200,000 to ~300,000 atoms. Because the biological assembler was
developed with virtually no intelligence or sophisticated computer-aided
design tools we may suppose that the design of a PNA is not beyond the
grasp of the combined capabilities of nanoengineers and software assistants.
The fact that single individuals can design nanoparts, such as the fine-motion
controller, with 1/1000th - 1/1600th of the atoms of a PNA, in a few
person-months [Dre01], suggests that ~150-260
person-years would be required to the design the parts contained in a PNA.
But because each PNA contains many identical or highly similar parts (e.g.
toroidal worm drives, intersegment bearings, drive gears and shafts) that
only need to be designed once, suggests that its design time requirements
may reduced by 5-10×. Thus, teams of 20-50 design engineers
could design one assembly of parts as complex as a PNA each year (using
only the software tools and computing power available in the early 1990s(!)).
One may expect that computer-aided-nanopart-design, or evolved and selected
designs [see Nano@Home],
would reduce required amount of human thought time -- the expensive resource
-- still further. Arguments along these lines lead to the conclusion
that the "design" of PNAs containing millions of atoms, and eventually
nanobots containing billions of atoms, is potentially within our grasp.
![]() Image of Drexler's Nanoassembler arm in relative size with (a) the Ribosome [the biological assembler] and (b) the Fine-motion controller [a small subset of the components that would be required for a full nanoassembly capabilities]. |
So if you can design nanoparts and assemblies of nanoparts -- can you construct them? Maybe. The first problem to be solved is, "How does one assemble the complex parts such as the fine-motion controller?". Solution-based self-assembly, used by multi-component cellular complexes such as the ribosome or the proteosome, seems difficult because you not only have to design the entire part and its subcomponents, but you have to design it to put itself together. That additional complexity makes the problem much harder. And it doesn't solve the problem that we really need a large number of covalent bonds in the assembled part. So we have to be more clever. To bootstrap the assembly of nanoparts we may have to do precisely what nature does -- build tools that assist in the creation of complex covalently bonded structures. There are two possible paths for this we may consider.
One path would require that chemists prepare molecules that would react and bond as desired if their positions and orientations are precisely controlled (e.g. controlled position based assembly). Another path would require that enzymologists design enzymes that can grip molecules, prepare them for the desired reactions and approximately position them such that they have reasonably high probabilities of reacting in the desired fashion (enzymatic assisted assembly). So there may be a tradeoff in the positional accuracy requirements vs. the functional complexity requirements in the design of tools to assemble nanoparts.
Now it becomes appropriate to ask how wide is the divide between where we are and where we would like to be.
![]() |
![]() |
|
|
|
![]() |
|
|
|
|
One point of interest regarding rotaxane and catenane is that they involve threading one molecule through another molecule! This form of complex chemical synthesis is very difficult to accomplish using solution chemistry where thermally induced motion causes the molecules to assume random orientations with regard to one another. How can we solve this problem? Chemists utilize tricks like adding hydrophobic rings to the molecules which will be attracted to each other in a polar solution like water. Another possible solution is to utilize molecular "grippers" engineered into proteins to "catch" the precursors of the final molecules in the solution and then bring them into correct orientation for assembly into the final part. Merkle [Mer99] suggested that the positional control of the building blocks would allow the extension of the normal reactions found in biological systems to include free-radical chemistry. It is becoming clearer that natural enzymes make use of these processes [Hol00]. Thus one can imagine a multi-functional protein being engineered with the following properties:
An alternative, suggested by Rafal Smigrodzki, would be to engineer gripper molecules, such as proteins, which are designed to self-assemble in a precise way. This behavior is similar to the assembly of multimeric enzyme complexes or the assembly of viral capsids. The nanopart subcomponents would be positioned at precise locations on these molecules most likely through designing small pockets or grabbing moeities that precisely hold and orient the subcomponent. The self-assembly properties of the gripper molecules, would precisely position the subcomponents with respect to each other allowing them to bond into a larger nanopart subcomponent. Heating the multi-gripper assembled-subcomponent complex would result in its disassembly freeing the assembeld-subcomponent nanopart. Initial steps towards designed self-assembling proteins have already been demonstrated [Pad01].
What is the range of capabilities that enzymes possess? Well the simplest capability may be that of adding single atom or a small group of atoms (e.g. hydrogenases (EC 1.12.-.-, oxygenases (EC 1.13.-.- ), methyltransferases (EC 2.1.1.- ), phosphotransferases (EC 2.7.-.-) and sulfurtransferases (EC 2.8.1.-)) while more complex enzymes connect larger atomic assemblies (e.g. nucleotidyltransferases (EC 2.7.7.- ) and ligases (EC 6.-.-.-)). Enzymes are known that mediate the transfer of atoms ranging from hydrogen (MW: 1 Dalton (D); e.g. hydrogen dehydrogenase from Rhodococcus opacus) to tungsten (MW: 183D, e.g. aldehyde ferredoxin oxidoreductase from Pyrococcus furiosus)3. The most frequently manipulated atoms range from carbon (MW: 12D) to zinc (MW: 65D). More complex molecular groups are also manipulated by enzymes, including amino acids (MW: 89-204D), and DNA and RNA bases (MW: 111-151D). Most enzymes in cells are involved in manipulating small atoms and molecules, MW < ~250D. There are enzymes however that are involved in manufacturing complex covalently bound molecules, such as vitamins, enzyme cofactors, antibiotics, toxins, etc. These are discussed further in Appendix A. These molecules may have masses up to several thousand Daltons. Molecules larger than this, such as the proteins themselves, messenger RNA and DNA are manipulated by complex machinery utilizing multiple proteins and other cofactors such as transfer RNA. These include the spliceosome, the ribosome, the proteosome and the DNA replication complex.
Natural proteins are built from only 20 amino acids. But these are a small subset of the amino acids that are known. Various chemical suppliers sell dozens of synthetic amino acids and variants [Bio01, Chi01, Pre99, Syn01]. The incorporation of artificial amino acids into proteins is reviewed in [Dem98] and [And99] discusses the construction or protein helices using mixtures of natural and artificial amino acids. From our knowledge of the 3-D structure required to "grab" specific molecules or catalyze reactions, we can envision designing and manufacturing synthetic amino acids with the "gripping" or "reacting" capabilities similar to those found in natural enzymes. These "grips" may be used to hold onto single atoms, small molecules like nitrogen (N2) or methane (CH4) or even more complex molecules like amino acids, DNA bases, etc. The flexible amino acid backbone of proteins may then be envisioned as a robotic arm, except it has more degrees of freedom than commonly manufactured robotic arms and its size is measured in nanometers. By designing synthetic enzymes consisting of synthetic amino acids, we can envision grabbing molecular parts in a solution and then, as the enzyme folds, bring them into proper alignment and cause them to react. This is Protein-Directed Part Assembly. Proteins often use metal ions such as Zn++, Ca++ or Mg++ to bring together different parts of the protein. Proteins can be designed such that they may fold differently when a succession of metal ions is made available. This could be considered Ion-Directed Part Assembly. If the proteins were in microfluidic devices [Des97, Bok98], the ions could be pumped into the chambers where the proteins were located. If the proteins were within microorganisms, various external signaling molecules in the solution could activate pumps built into the cell membrane that pumped specific ions in or out of the cell.
Having the flexible architecture of proteins allows you to take advantage of assembly strategies that are not used by biological systems. For example, one can design an artificial enzyme such that when heated, it unfolds making the grippers available to the solution. The grippers then grab simple subcomponents that have been synthesized via normal chemical synthesis methods. Cooling the enzyme will allow it to refold in a precise manner bringing the parts into correct orientation with each other such that natural reactions occur between the pre-prepared parts or catalyzed reactions enabled by precise positioning of catalytic "reactors". Heating the enzyme a second time will allow it to disgorge the newly synthesized nanopart. Proteins such as this that manage chemical reactions by positioning parts and not by directly influencing the reaction may be termed "robozymes". Some people have objected to biologically based nanotechnology because they consider the flexibility of enzymes to be a handicap. Here we can see how it is possible to turn that feature into an advantage.
We are also not limited to using enzymes to form the covalent bonds required in nanoparts. The development of RNA-based ribozymes and even DNA-zymes is ongoing [Hag96, War00, Bre97] and there may be reactions for which they are better suited than proteins.
Over the next decade we can expect the development of "whole genome engineering" [Bra00]. How may this be used? One application that springs to mind is "designer" bacteria to improve human health. Currently humans obtain only a fraction of their vitamin requirements, primarily B-12 and K from bacteria in their intestines. Significant benefits could be derived from bacteria engineered to produce greater quantities of and an expanded variety of vitamins These include reductions in birth defects [Moy01] and cancer [Ame01] as well as the long-term expenses associated multi-vitamin supplements that are currently taken to avoid such problems. If one were to develop such bacteria, it would be highly attractive to include a means to promote their retention in the body when antibiotics are used to treat infections of pathogenic bacteria. So slight modifications to the structures of antibiotic targets (the ribosome, the cell wall manufacturing system, etc.) could be engineered into these "designer" bacteria to make them tolerant of current antibiotic therapies. But it is necessary to ensure that the instructions in the genes of these organisms that allow them to resist the antibiotics not be allowed to escape into the pathogenic organisms. Further it is desirable to ensure that any antibiotic resistance that might develop in the pathogenic microorganisms, something that is a growing problem [Ell99, Ric01, Ryb01, Clo01], be untransferable to engineered microorganisms. One of the most straight forward ways of creating such a "firewall" between the natural bacterial world and the "designer" bacteria is to change the genetic code of the "designer" bacteria. Such a genetic code would be able to manufacture the same proteins found in bacteria but would contain no useful coding information if it jumped the firewall into another organism. Similarly, code jumping into the engineered microorganism would be non-functional. The net result of this approach would be to slowly back us away from the "green goo" situation that currently exists in Nature.
As Table 1 (A) shows, the genetic code is highly
redundant. Only two of the amino
acids, Methionine and Tryptophan have unique mappings from a single
messenger RNA codon triplet. Once whole genome engineering is feasible,
there is no reason that the redundancy be retained. Table
1 (B) shows one example of an alternate genetic code. This code
is designed to swap acidic and alkalinic amino acids and exchanges large
amino acids with small amino acids. A side effect of this approach
is to change many hydrophilic side-chains that one expects to find on the
exterior surfaces of protein into hydrophobic residues that one expects
to find on the interior of proteins. This causes proteins assembled
from naturally coded DNA will either not fold at all or if it folds will
produce non-functional enzymes. Alternate genetic codes could certainly
be designed by computer simulations of the substitutions that would cause
the greatest disruption to known protein structures. The alternate
code makes available 43 empty slots that can be utilized to extend the
genetic code with codes for synthetic amino acids. Some amino acids
such as Cysteine and Proline contribute important properties to protein
structure that would make their removal from the genetic code difficult.
Others such as Leucine and Isoleucine have similar properties that might
allow one to substitute for another in the genetic code. If this
proves feasible, then several additional slots could be made available.
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| A = Adenine, C = Cytosine, G = Guanine, U = Uracil, | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Why would one want to extend the genetic code for synthetic amino acids? It may be difficult to design grippers for some nanoscale parts using the natural code. One could imagine multiple genetic codes designed with artificial amino acids precisely tailored towards gripping or manipulating parts with specific features or atoms of specific sizes. One might have one genetic code optimized for manipulating molecules containing silicon and germanium (for the semiconductor industry) and a completely different code optimized for manipulating molecules containing gallium and arsenic (for the optoelectronics industry). Furthermore, if one desires to make proteins with higher structural strength that may allow us to approach the 'holy grail' of mechanosynthetic nanoassembly [Dre92, Chap. 8], then the addition of other synthetic amino acids to the code grants humans (and computers) designing stronger-than-natural proteins [Dre94] many greater opportunities for assembly path design and optimization.
It is useful to note that food crops engineered on the basis of an alternate genetic code would eliminate a primary complaint of the neo-luddite "naturalists" who are afraid that cross-pollination by genetically modified crops will eventually fatally pollute the "natural" crops provided by nature (and so carefully bred and selected by humans). Crops based on a different genetic code cannot swap genes with their "natural" progenitors. They can however produce the same proteins, cellular structures and additional molecules (flavors, etc.) that are found in our food supply.
In addition, one must engineer the organism so it can import the novel amino acid from the environment, or manufacture the amino acid itself. As the chemical synthesis of amino acids is a $500+ million/yr industry [BCC98], there will certainly be incentives and interest on the part of manufacturers to fund the development of bacteria that contain the enzyme pathways to directly synthesize artificial amino acids and thereby lower manufacturing costs. Manufacturers would expose shallow pools of such bacteria to sunlight and simply harvest the manufactured products.
Can we go beyond the limit of a code that maps a triplet of 4 bases
into 63 amino acids? As Freitas points out [Fre99,
pg 44, based on Fay92], we could expand the code
by adding additional nucleotide pairs to the normal A=T(U) and C
G)
used in DNA (RNA). That would allow the expansion of the genetic code to
63 = 216 codons for 6 base code and 83 = 512 codons
for an 8 base code. However the reengineering of DNA polymerase,
RNA polymerase, the DNA repair enzymes, the ribosome and the new nucleotide
synthesis and degradation pathways required to accomplish this would not
be insignificant.13
A less difficult approach would be to simply increase the number of bases
which must be paired between the messenger and the transfer RNAs by the
ribosome. Increasing the 3-base code to a 4-base code would allow
44 = 256 mappings (allowing 255 amino acids), while increasing
it to a 5-base code would allow 45 = 1024 mappings (1023 amino
acids). Presumably this might be accomplished with only a small amount
of engineering to the ribosome and transfer RNAs. Admittedly this
does decrease the coding density in DNA but this is likely to be more than
offset by a significant increase in the phase space of proteins that can
be manufactured. An expanded discussion of the size of the synthesis
phase space for engineered genetic codes is in [Bra01d].
It is worth noting that one can find at least two natural examples of "extended" genetic codes. The first example is the use of the UGA (opal) stop codon to code for the insertion of the modified amino-acid selenocysteine (the 21st amino acid) in both prokaryotes [Zin87] and eukaryotes [Sta87, Lei88]. Interestingly, the mechanism for this extension in eukaryotes relies on a SECIS element in the upstream 3' untranslated region of the mRNA and effects all UGA codons in the mRNA while in prokaryotes the SECIS elements are immediately downstream from specific UGA codons and only impacts individual codon mappings [Tuj00]. This suggests two possible methods for extending the genetic code without a need for expanding the code from DNA base triplets to quadruplets. The first would be to add elongation factors to the ribosome that significantly change the mapping (of triplets into a specific amino acid) for an entire mRNA (the eukaryotic approach) and the second would be to modify the translational behavior of a ribosome complex based on the local molecular mRNA environment (the prokaryotic approach).
The second example of an "extended" genetic code is found in Methanosarcina barkeri which uses the UAG (amber) stop codon to code for the novel (22nd) amino acid pyrrolysine [Sri02].
The first demonstrations that protein design was feasible were led by William DeGrado of DuPont in the late 1980s [Reg88]. Progress since then may be reviewed in [DeG89, Bry95, DeG99]. To test their understanding of protein structure and function scientists have developed protein and enzyme engineering [Hah90, Han92, Get92, Cho98, Bry98, Woo00, Pas01, Bak03]. There are number of laboratories who have these disciplines as their primary focus [PDL01]. The problem of predicting the structure of folded proteins is yielding increasingly accurate results [Pil01] and projects such as Folding@Home [Shi00, Pan01] and IBM's Blue Gene Project [All01] are likely to finally solve this problem. Though early efforts at designing enzymes based on peptide mimetics were unsuccessful, methods involving catalytic antibodies and reengineered enzymes have been successful [Cor96]. Catalytic antibodies have been developed that can perform reactions that are unachievable by normal chemical methods [Hil00]. Scientists can take pre-existing enzymes and engineer both where the enzymes act [Wen94, Pom98] as well as what they act on [Sor97, Rot01], can engineer how they fold [Nau01] and control their stability [Wak94, Col97, Pet99, Leh00, Che01]. Textbooks have been written on enzyme catalysis and structure [Bra99, Fer99, Sil99, Cop00, Les01] and frequent conferences focus on protein and enzyme engineering [EEC, Jar98]. In part because of recent demonstrations of the feasibility of automated protein design [May96, Dah97, Jia97, Coo00, Bol01], scientists now feel that the de novo design of enzymes seems feasible [Far01]. In situations where enzyme engineering alone is insufficient, chemical alterations may be used to modify enzyme properties [DeS99]. The discipline of protein and enzyme engineering has become sufficiently robust that commercial firms can obtain funding for abilities such as the design of novel Zinc Finger proteins for gene regulation [Sangamo BioSciences (SGMO)] and the evolution of enzymes with superior catalytic properties [Maxygen (MAXY)].
It is worth noting the fraction of genes identified in the sequenced genomes that are involved in metabolism: Yeast: 1062 (~17% of the total), Drosophila: ~1900 (~13%), C. elegans: ~2000 (10%), Human: ~3200 (10%), Mustard weed (Arabadopsis): ~4600 (18%) [Bra01c]. These genes have been organized into families by the PRINTS [Att00], PFAM [Bat00] and PROSITE [Hof99] databases which in turn are being further integrated by databases such as InterPro [Apw00] and MetaFam [Sil01]. The New Folds data from the Protein Database shows a declining number of novel folds being entered into the database. Novel enzyme functions will continue to be discovered as we sequence novel microorganisms, or the organisms with unusual capabilities such as the manufacture of siliceous shells in diatoms or protein-CaCO3 composite shell of abalones, but our discovery of protein structural complexity seems to be approaching a significant fraction of the phase space that has been explored by natural evolution. This is to be expected because evolution has primarily been using a cut-and-paste approach to increase the complexity of higher level organisms. While these numbers may increase somewhat, as unknown genes are characterized, it is likely that we currently have in databases the code for much of the enzyme functionality that nature has evolved. We are even starting to discover cases where evolution produced two separate paths for the same enzymatic function [Gal98].
Where does this lead? First, robust databases of structural classifications such as SCOP, CATH and FFSP have been developed. Second, similarity matching methods, as predicted by Holm & Sander [Hol94, Hol96] has been realized in programs such as Dali [Hol96], CE [Shi98], ProSup [Lac00], PartsList [Qia01] and others. The implications are that continued refinements in these methods will increase the accuracy of protein modeling without the requirement for X-ray crystallography or NMR studies. Thus one can predict fewer researchers will be involved in reverse engineering the 3D structures of natural organisms and more can become available for the engineering of new structures required for nanopart assembly.
We can frame the assembly problem for the
fine-motion
controller (MW: ~30,000 Daltons), one of
Drexler'snanoparts,
by looking at the tradeoff in the complexity of synthesizing the sub-component
building blocks (SBB), with molecular weights from ~50-5000 Daltons, and
the complexity of designing and synthesizing the enzymes (with MW from
10,000-100,000 Daltons) to put SBBs together.
Table 2
shows the number of building blocks that must be combined to equal
in size a 30,000-Dalton
fine-motion
controller device and the minimum number of assembly enzymes required
in a one-pass linear assembly process.
From this table the tradeoff in the building block size and the number of assembly enzymes required can be seen. One approach could use small common building blocks but this requires the design of many novel enzymes to assemble them. Each of these enzymes would have to be designed to grip the growing assembly, direct the next small building block to the proper location of the assembly and mediate its catalytic addition. Another approach would utilize large building blocks that would be manufactured through complex (and low-yielding) chemical synthesis paths that would be assembled by many fewer newly designed enzymes. As there are many more organic chemists capable of working on the synthesis of complex molecules and many fewer individuals with experience in enzyme design, the fastest path would be to use the highest molecular weight building blocks we can imagine assembling and the fewest number of newly designed enzymes. Because complex synthesis paths will generally have low yields, one could gradually replace the chemical reactions with the lowest yields with enzymatic reactions that increase the production efficiency of the large building blocks. For the calculations below, we will assume organic chemists can readily produce SBBs with a MW of ~1000, meaning the number of novel enzymes that must be designed is 29. In practice, the SBB size chosen will depend on the rate of advancement computer-aided retrosynthesis analysis (discussed below). As automated retrosynthesis capabilities advance, the size of the building blocks can increase, limited primarily by the yield of the synthesis steps4.
Computer-aided design of chemical synthesis paths has a long history, dating back to the development of LHASA (Logic and Heuristic Applied Synthesis Analysis: description) in 1964 [Cor64, Cor69, Jud85, Jud90, Jud92]. Other efforts have included SECS [Wip77], SYNCHEM [Gel77], PASCOP [Kau81], CAMEO [Jor90], SynTree [Fig91], HOLOWin and functions in WODCA. OSET is an open-source effort to develop computer-aided organic synthesis. These programs and others have been analyzed in detail by Fick [Fic96]. One of the more interesting members of the group is Hendrickson's SynGen [Hen01, Hen95] which is based on a careful analysis of reaction descriptions [Hen92, Hen90]. The approach used by SynGen -- backward chaining from the product to the reactants using known reactions -- has proven useful in completely different fields such as the study of nucleosynthesis and the abundance of elements in stars [Koc98]. So it may be considered a general method that could be applied to the development of tools for the retroassembly of nanoparts.
SynGen's relevance to this discussion is that it requires only a couple of minutes of computer time on a Macintosh to analyze several thousand synthesis paths and select the least expensive. Other work in the field is focused on selecting synthesis paths that utilize, or produce as byproducts, the least toxic chemicals. Given these robust software tools, we will assert that the design of complex chemical synthesis sequences for the production of SBBs with a molecular weights of ~1000 Daltons is a small fraction of the time required to design enzymes where the software tools are much less robust. We would also assert that the progress that has been made in computer-aided design of molecular synthesis paths will be repeated for enzyme design. It should however, have a shorter development time (<< 20-30 years) due to the greater availability of fast, inexpensive computer systems and the existence of a growing number of individuals with knowledge bases including programming methodologies, organic chemistry and protein structure.
We will conservatively propose that the design of each assembly line enzyme or robozyme requires ~2 person-years to design and test. The reasons for this are as follows:
However, even if the funds were available to fund the design of a PNA system, a problem arises because there are probably only a few hundred people in the world today who are skilled in the discipline of protein design.6 This creates a problem because the time required to design the enzymes to produce the parts for a PNA could be as high as 29 enzymes × 2 yrs/enzyme × 3000 times as many atoms = 174,000 person-years.7 Even if all of the people with the necessary education were working on this project (a very unrealistic assumption) it would still take decades to complete. Clearly support for the education of such individuals needs to be increased. Fortunately, there are many chemists who could move into enzyme design with relatively little reeducation8. The high cost of human labor for protein design and the scarcity of protein designers suggest there is a huge incentive to develop more automated methods for protein design.
The number of enzymes or robozymes required for the production of intermediate sized nanoparts, such as the fine-motion controller, would seem to range from dozens to hundreds. The genes and regulatory sequences required to manufacture these proteins can easily be added to a bacterial genome using whole genome engineering [Bra00]. Thus we will produce microorganisms, where each organism has as its raison d'être the assembly of a complete nanopart! These nanoparts could be harvested and subsequently assembled into complete nanosystems such as a PNA using MEMS or NEMS methods (see Figure 6 for more details).
The fine-motion controller has only moderate amount of redundancy. It has sub-components that range from non-redundant (the tip and the shaft which holds the positioning rings in place), to two-fold redundancy (the shaft end-plates and the 2 large positioning rings and attached positioning arms) to 4-fold redundancy (the central positioning rings). A PNA in contrast is a highly redundant structure consisting of many molecular components that may be nothing more than simple repeats of the same atomic pattern or patterns that may be scaled up/or down in size (for example as the positioning components develop increasingly finer resolution). The circular outer casings that provide the strong structural stiffness are examples of repeats. Their diameter is ~30 nm implying a ~94 nm circumference which would translate to a ring of ~610 carbon atoms. State of the art protein design might require 5-6 years to design a protein such as a functional replacement for the photosynthetic reaction center, but 2nd generation designs may be produced in an order of magnitude less time [Far01]. We extrapolate this experience to suggest that while the initial design of an enzyme to polymerize molecular subunits into curved ring structures or a robozyme to put them together may require a large amount of work, subsequent enzymes performing similar functions are likely to require significantly less design time.
As specified by Freitas [Fre96], a nanorobot designed to supplement the functions of red blood cells (O2 delivery, CO2 removal), known as a respirocyte, contains ~18 billion structural atoms. Even though its mass is ~2250× greater than a PNA, because it has a 12-fold structural redundancy its design cost is only ~200× greater after allowing for some non-redundant components. It is useful to note that the assembly lines for many of the nanoparts that nanorobots require such as nanocomputers, fuel cells, molecular sensors, communications receivers and transmitters, etc. (for more detail see [Fre99]) need only be designed once and can be reused in a variety of nanorobots.
Table 3 summarizes the cost savings in protein
design.
Table 4 uses combined savings that are quite conservative
compared with those suggested by Table 3 to provide
an estimate of how the combined cost savings will effect the manufacuturing
costs for the assembly lines to build nanosystems at various levels of
complexity.
We can see from the table, that the design and manufacture of an assembly line for nanoparts such as the fine-motion controller starts to become a research effort that could be undertaken by a university lab or small corporation sometime between now and 2005. Large projects typically carry multi-million dollar price tags. For example, IBM is spending ~$100 million on the development of Blue Gene, the development cost of the fat substitute Olestra cost Proctor and Gamble ~$200 million and the typical figure used to develop and take a drug successfully to market is $400 million. So while the development for the PNA assembly lines is beyond the budget of all but the richest governments today, between 2005 and 2010 we can see the cost falling to that which may be considered by large corporations and eventually startups that nanoliterate VC firms would fund. It is worth noting that spread over a 5 year period, the estimated cost of the PNA in 2005 is less than 1% of the current annual funding available for the U.S. National Nanotechnology Initiative!
Given the benefits that one would predict can be provided by molecular nanotechnology and molecular assemblers, the cost of producing the infrastructure required to manufacture PNAs falls from ~$60/person now to 6 cents per person in 2005 (based on the current U.S. population). That price seems to be a rather small cost for the benefits that would result. One of the key things to remember is that once the first PNA is produced, the subsequent manufacture of nanoparts by direct atomic assembly becomes much easier. Does that immediately negate the investment in the development of the nanobiotechnology based assembly path? Probably not. Until there is a very robust suite of nanoscale parts that have been designed and assembly paths have been developed using the PNA, it is likely that the approach discussed here will remain useful. It will make much more sense to devote time and resources to developing assembly paths for parts that can only be assembled by PNAs rather than attempt to compete with the nano-biotechnological manufacturing process for a PNA. In addition, postponing the development of full self-replicating capabilities for molecular nanotechnology (PNAs that can assemble PNAs) seems to be a good strategy to minimize the near-term development of the negative uses such abilities [Dre86, Fre00b]. So the use of nanobiotechnology to assemble the nanoparts required for PNAs creates a firewall that prolongs the period during which defensive methods, such as PNAs that can disassemble PNAs, may be developed that minimize the risks posed by the development and spread of molecular nanotechnology.
This analysis leads to a number of recomendations if we are to follow the path outlined above. Government funding agencies should promote education and training in protein and enzyme design. Foundations and far sighted VCs should invest in the development of software directed towards the task of entirely automating the design, manufacture and testing of the enzymes needed for nanopart assembly-lines. Finally businesses should be attempting to understand the potential impact that inexpensive nanopart manufacture will have on their industry and be preparing themselves to take advantage of that.
![]() |
![]() |
Further Information: Biosynthesis of Cholesterol.
There are several interesting aspects of the synthesis of porphryins.
The first involves the joining of 2 linear d-aminolevulinic
acid molecules to form the C4N ring structure of porphobilinogen
by the enzyme ALA dehydratase. Then Uroporphyrinogen I synthase joins
6 porphobilinogen molecules as a linear string, using what can only be
considered a limited polymerase activity (enzymes can count!), and
releases 4 of them in the form of hydroxymethylbilane [Dev92].
Finally, uroporphyrinogen III cosynthase closes the heme ring structure
forming the spiro intermediate form and then breaks the ring structure,
flips one of the porphobilinogen subunits and relinks the ring producing
uroporphyrinogen III.
![]() |
![]() |
|
|
C34H32FeN4O4 MW: 616.55 |
Further Information: Biosynthesis of Heme.
![]() |
![]() |
Molecules like these show that enzymes are fully capable of constructing complex molecules with covalent bonds. Other molecules with complex 3-D structures include the antibiotic Erythromycin A (C37H67NO13, MW: 734.05) and the neurotoxin responsible for 'red tide' Brevetoxin B (C50H70O14, MW: 895.2) [Nic95]. Perhaps the largest common molecule synthesized is Vitamin B-12 (C63H88CoN14O14P, MW: 1355.55) [Woo79]. Even higher molecular weight natural molecules are known and have been synthesized. These include vancomycin (C66H77N9O24Cl2, MW: 1485.73) [Nag88, Bog01], palytoxin (C129H234N3O54, MW: 2691.3) [Suh94] and maitotoxin (C164H256O68S2Na2, MW: 3425.8) [Tac96, Kis98, Mur00].
Dave Woodcock, a professor at Okanagan University College maintains a page listing the features of many of the most complex natural molecules known. The National Cancer Institute's Molecular Targets Drug Discovery Program maintains catalog of natural compounds with useful properties. Many of these are non-polymeric compounds with 50-100 carbon atoms. Indiana University's Molecular Structure Center has links to similar pages under their Simple, Common, and Interesting Molecules page.
Background Information: Natural Toxins.
|
|
![]() |
Diagram of the complete process flow for the development of nanobots using nanobiotechnology. Using software designed to (a) evolve nanopart designs, e.g. Nano@Home, or (b) Nanopart Design Tools (NanoCAD), Nanoscale Part Designs are produced. These are then retroassembled into basic building blocks and specifications for the enzymes that must put those building blocks together. The building blocks are further disassembled using currently existing retrosynthesis programs producing the building block chemical synthesis pathways (not shown). As the "Computer Aided Nanopart Design" and "Computer Aided Enzyme Design" systems develop, they must be integrated with each other such that "Design for Assembly" is achieved. Further development would replace "Computer Aided Enzyme Design" with "Automated Enzyme Design" to lower costs. Once a set of enzymes is designed they are produced in Engineered Microorganisms. The resulting proteins may be harvested and used in Microfluidic Flow-Through Channels through which the previously synthesized complex building blocks flow such that assembled nanoparts result. Alternatively, a complete set of assembly enzymes plus any necessary synthesis enzymes or building-block import transporters can be engineered into a microorganism that manufactures the subcomponents and assembles the nanopart.12 Nanoparts are assembled into more complex structures, perhaps using NEMS/MEMS based robotics for assistance. Finally programmable nanoassemblers are produced that can lead to the production of real nanorobots [Fre96]. |