In a recent Perspectives article in Nature Communications, NIST’s Elizabeth Strychalski and co-authors from industry and academia offer a framework for engineering whole genomes of organisms.
Since the turn of this century, as researchers have progressed from engineering the genomes of tiny viruses to more complex bacteria, the necessary collaborations have grown correspondingly larger and more complex. The synthetic polio virus—7,500 base pairs long—required three researchers to complete back in 2002. The soon-to-be completed synthetic yeast genome, at more than 12 million base pairs, has required nearly 175 collaborators. Strychalski and authors estimate that the billions of base pairs in mammals’ DNA will require about 500 researchers to engineer—unless research teams adopt efficiencies.
The challenges of large-scale scientific endeavors extend beyond the work in the lab to design and assemble a genome, to managing workflows and data, to legal and contractual matters. The fields of physics and astronomy have resolved similar issues to build their supercolliders and space telescopes; In the field of biology, only the Human Genome Project, which decoded our DNA, has been a similarly massive effort. Nearly 3,000 researchers shared authorship on the article in Nature revealing the reading of the human genome’s “first draft.”
The intent of the Nature Communications article, says Strychalski, is “to break down the ambitious goal of engineering a large genome into tractable pieces that take place in individual labs.” The article gives guidance for using existing or developing new “technologies, repositories, standards, and frameworks,” according to the authors.
The authors’ recommendations span the entire “design, build, test, learn” cycle of genome engineering, including
The authors point out that other scientific communities have already successfully navigated similar challenges, and so genome engineers should seek to adopt solutions from the aerospace and semiconductor industries, as examples.
“What kinds of projects are possible if you can organize at scale?” asks Strychalski.
Paper: Bartley, B.A., Beal, J., Karr, J.R. et al. Organizing genome engineering for the gigabase scale. Nat Commun 11, 689 (2020). DOI: https://doi.org/10.1038/s41467-020-14314-z