& cplSiteName &

Microsoft Researching Storage Based on Biological DNA

Craig Matsumoto
9/1/2017
50%
50%

SAN JOSE -- @Scale -- Microsoft is researching DNA-based storage, a technology that promises to compact a data center's worth of information into a space the size of a few sugar cubes.

DNA-based storage would require less space than other media, and consume less power -- important in the age of web-scale data centers. But the medium is really slow, with data retrieval times of about 10 MBytes per week.

"Most of this, by the way, is dominated by FedEx moving test tubes around," said Luis Ceze, a Microsoft researcher and a professor at the University of Washington, who presented on DNA storage during the keynote at Facebook's @Scale conference today.

Read time can be speeded up in some obvious ways -- putting everything in the same building, for example. And given how quickly DNA sequencing is advancing, Ceze believes 100Gbit/s read speeds might eventually be possible.

It helps that the data doesn't need to be retrieved perfectly (more on that in a moment).

The primary motivation for this research is to save power at web scale. But there are other factors, too. Every other storage mechanism is reaching its limits and will inevitably deteriorate. DNA offers the promise of more efficient storage that can last hundreds of thousands of years.

(By the way, all the DNA here is synthetic. It's not as if they're injecting mice with data.)

DNA is made up of combinations of four nucleotides: adenine, cytosine, guanine and thymine. So the translation from bits into nucleotides seems straightforward -- 00 could be "A," 01 could be "C," and so on.

Of course, it's not that easy. Repeating one nucleotide too many times -- such as C-C-C-C -- makes the sequence more fragile and harder to read, so Microsoft adds coding tricks to prevent such combinations. Long chains are prone to instability, so Microsoft puts only 150 nucleotides in a chain -- but that means adding codes to preserve the proper order of these chains.

And finally, DNA replication is inherently imperfect, so error-correcting codes go in there as well.

Reading the data involves DNA sequencing, but you can't exactly grab a nucleotide chain with tweezers. The approach is to use polymerase -- an enzyme used to sequence DNA and RNA molecules -- to make lots of copies of a strand of interest (that's another benefit to DNA storage: unlimited copies nearly for free) and sequence a bunch of them, finding a consensus about what the chain was supposed to be.

That brings up a point: This type of storage smashes the expectations of precision that we used to have with tapes and hard disks. That's OK, though, because software itself might give up some of that precision for the sake of saving energy.

Ceze referred to an area of study called approximate computing, where a processor's "thinking" can be made less thorough in exchange for consuming less power. It's the same way our brains work, he said; full attention takes more energy. This approach of accepting good-enough rather than perfect might be practical in some cases, "because most applications do not require perfect communication and storage accuracy," he said.

— Craig Matsumoto, Editor-in-Chief, Light Reading

(4)  | 
Comment  | 
Print  | 
Newest First  |  Oldest First  |  Threaded View        ADD A COMMENT
Featured Video
From The Founder
John Chambers is still as passionate about business and innovation as he ever was at Cisco, finds Steve Saunders.
Flash Poll
Upcoming Live Events
June 26, 2018, Nice, France
September 12, 2018, Los Angeles, CA
September 24-26, 2018, Westin Westminster, Denver
October 9, 2018, The Westin Times Square, New York
October 23, 2018, Georgia World Congress Centre, Atlanta, GA
November 7-8, 2018, London, United Kingdom
November 8, 2018, The Montcalm by Marble Arch, London
November 15, 2018, The Westin Times Square, New York
December 4-6, 2018, Lisbon, Portugal
All Upcoming Live Events
Hot Topics
Comcast's Bid for Content, Growth & Whatever Comes Next
Phil Harvey, US News Editor, 6/13/2018
Ciena CTO Says No to Skynet, Advocates Adaptive Networks
Kelsey Kusterer Ziser, Editor, 6/14/2018
Source Packet Routing Gets Real in 2018
Sterling Perrin, Principal Analyst, Heavy Reading, 6/15/2018
The Telco Debt Binge May End Badly
Scott Raynovich, Founder and Principal Analyst, Futuriom, 6/15/2018
Animals with Phones
Backing Up Your Work Is Crucial Click Here
Live Digital Audio

A CSP's digital transformation involves so much more than technology. Crucial – and often most challenging – is the cultural transformation that goes along with it. As Sigma's Chief Technology Officer, Catherine Michel has extensive experience with technology as she leads the company's entire product portfolio and strategy. But she's also no stranger to merging technology and culture, having taken a company — Tribold — from inception to acquisition (by Sigma in 2013), and she continues to advise service providers on how to drive their own transformations. This impressive female leader and vocal advocate for other women in the industry will join Women in Comms for a live radio show to discuss all things digital transformation, including the cultural transformation that goes along with it.

Like Us on Facebook
Twitter Feed