Microsoft Has a Plan to Add DNA Data Storage to Its Cloud

Story image for DNA from MIT Technology Review

Tech companies think biology may solve a looming data storage problem.

Based on early research involving the storage of movies and documents in DNA, Microsoft is developing an apparatus that uses biology to replace tape drives, researchers at the company say.

Computer architects at Microsoft Research say the company has formalized a goal of having an operational storage system based on DNA working inside a data center toward the end of this decade. The aim is a “proto-commercial system in three years storing some amount of data on DNA in one of our data centers, for at least a boutique application,” says Doug Carmean, a partner architect at Microsoft Research. He describes the eventual device as the size of a large, 1970s-era Xerox copier.

Internally, Microsoft harbors the even more ambitious goal of replacing tape drives, a common format used for archiving information. “We hope to get it branded as ‘Your Storage with DNA,’” says Carmean.

The plans signal how seriously some tech companies are taking the seemingly strange idea of saving videos, photos, or valuable documents in the same molecule our genes are made of. The reason, says Victor Zhirnov, chief scientist of the Semiconductor Research Corporation, is that efforts to shrink computer memory are hitting physical limits, but DNA can store data at incredible densities.

Subscribe to Weekend Reads
Our guide to stories in the archives that put technology in perspective.

Manage your newsletter preferencesFormatted in DNA, every movie ever made would fit inside a volume smaller than a sugar cube.

“DNA is the densest known storage medium in the universe, just based on the laws of physics. That is the reason why people are looking into this,” says Zhirnov. “And the problem we are solving is the exponential growth of stored information.”

Last July, Microsoft publicly announced it had stored 200 megabytes of data in DNA strands, including a music video, setting a record. The work, described in a paper published in March on the pre-print server Biorxiv, has been led by Carmean and Karin Strauss, both of Microsoft Research, and the University of Washington laboratory of computer scientist Luis Ceze.

Major obstacles to a practical storage system remain. Converting digital bits into DNA code (made up of chains of nucleotides labeled A, G, C, and T) remains laborious and expensive because of the chemical process used to manufacture DNA strands. In its demonstration project, Microsoft used 13,448,372 unique pieces of DNA. Experts say buying that much material on the open market would cost $800,000.

“The main issue with DNA storage is the cost,” says Yaniv Erlich, a professor at Columbia University who earlier this year reported a novel approach to DNA data storage. “So the main question is whether Microsoft solved this problem.” Based on their publication, Erlich says, “I did not see any progress towards this goal, but maybe they have something in their pipeline.”

Will DNA data storage prove commercially viable?

Share your thoughts.

According to Microsoft, the cost of DNA storage needs to fall by a factor of 10,000 before it becomes widely adopted. While many experts say that’s unlikely, Microsoft believes such advances could occur if the computer industry demands them.

Automating the process of writing digital data into DNA will also be critical. Based on the several weeks it took to carry out their experiment, Carmean estimates that the rate of moving data into DNA was only 400 bytes per second. Microsoft says that needs to increase to 100 megabytes per second.

Reading the data out is easier. That was done using a high-speed sequencing machine, including to recall specific parts of the files, analogous to random-access memory on a computer. Even a two-fold improvement in DNA reading would make that aspect of the system efficient enough for commercial use, Microsoft thinks.

Because writing and retrieving data into DNA is slow, any early use of the technology will be restricted to special situations. That could include data that needs to be archived for legal or regulatory reasons, such as police body-cam video or medical records.

Microsoft currently works with Twist Bioscience, a DNA manufacturer located in San Francisco. Twist is one of a number of newly formed companies trying to improve DNA production, a list that now includes startups DNAScript, Nuclera Nucleics, Evonetix, Molecular Assemblies, Catalog DNA, Helixworks, and a spin-off of Oxford Nanopore called Genome Foundry.

One exciting possibility being pursued by some of the startups is to replace the 40-year-old chemical process used to make DNA with one that employs enzymes, as our own bodies do. Jean Bolot, scientific director of Technicolor Research, in Los Altos, says it is funding such work at Harvard University, in the laboratory

I am a blogger with the main motive of writing articles at my choice of level. I do love to write articles and keep my website updated regularly , if you love my article then be sure to share with your friends as they would love to read my article...

What's your reaction?

Related Posts

1 of 70