Light Reading
Google's Andromeda infrastructure is designed to provide bare-metal performance and low latency to customers worldwide.

Google's Andromeda Relieves Cloud Strain

Mitch Wagner
8/26/2014
50%
50%

MOUNTAIN VIEW, Calif. -- Hot Interconnects -- Google relies on its Andromeda networking platform to deliver a global cloud infrastructure that gives customers the security and performance benefits of local private networks.

"We want bare-metal performance and low latency for the services we deliver," said Google (Nasdaq: GOOG) Distinguished Engineer Amin Vahdat, delivering a keynote at the conference here today.

SDN is key to delivering the needed performance and security, he said.

SDN at its most fundamental involves separating the control plane from the data plane, Vahdat noted. "A logically centralized hierarchical control plane beats peer-to-peer every time," he said. The data plane can run at network speed, while the control plane can run on commodity hardware, scaling as needed. The control plane requires 1% of the overhead of the entire network, Vahdat said.

But managing that infrastructure requires new tools and skills, he said.

"It turns out that running a hundred or a thousand servers is a very difficult operation. You can't hire people out of college who know how to operate a hundred or a thousand servers," Vahdat said. Tools are often designed for homogeneous environments and individual systems. Human reaction time is too slow to deliver "five nines" of uptime, maintenance outages are unacceptable, and the network becomes a bottleneck and source of outages.

Google looks to SDN and network functions virtualization (NFV) to orchestrate provisioning, high availability, and meet application performance requirements, Vahdat said. The technology must be distributed throughout the network, which is only as strong as its weakest link.

Andromeda is Google's code-name for its network virtualization platform. It's designed to provide each external user with the illusion that they're on a dedicated network with dedicated performance and its own IP address space. Applications require real-time high performance and low-latency communications to virtual machines. Users also require service chaining to tools such as load-balancing, and the ability to grow and shrink the number of servers available to applications as demand requires. (See Google, Microsoft Challenge Service Providers and Google's Andromeda Strain Is Spreading.)

Security is a huge requirement. "Large companies are constantly under attack. It's not a question of whether you're under attack but how big is the attack," Vahdat said.

Power and cooling are the major costs of a global infrastructure like Google's. "That's true of even your laptop at home if you're running it 24/7. At Google scale, that's very apparent," Vahdat said.

Google has a global infrastructure, with data centers and points of presence worldwide to provide low-latency access to services locally, rather than requiring customers to access a single point.

The company runs two networks. Its private, server-to-server network is bigger than its public network, and one of the world's largest SDN deployments. Connectivity between data centers is comparable to within data centers.

Andromeda provides significant performance improvements over a state-of-the-art baseline, as seen in Vahdat's slides:

The promise of cloud computing is just beginning.


Find out more about key developments related to the systems and technologies deployed in data centers on Light Reading's data center infrastructure channel.


"Many people think about cloud computing as being able to get on-demand access to computing. I don't have to go buy servers; I can rent them for a minute, or an hour, or a day. I can get burst capacity of as many servers as I like, whatever memory, configuration or disk, etc., that I like. I think actually yes, this is powerful, but this is really just the beginning," Vahdat said. "The really exciting parts of cloud computing are on the verge of happening."

These include a fundamentally easier operational model; higher uptime; state-of-the-art infrastructure services such as denial-of-service protection, load balancing, and storage; and new programming models for low latency and massive input-output performance.

What cloud doesn't do is take away the challenges of running an IT infrastructure. "Most cloud customers, if you poll them, say the operational overhead of running on the cloud is as hard or harder today than running on your own infrastructure," Vahdat said.

Click the photo below for a selection of Vahdat's slides -- and more.

He's Hydrated
Google Distinguished Engineer Amin Vahdat
Google Distinguished Engineer Amin Vahdat

— Mitch Wagner, Circle me on Google+Follow me on TwitterVisit my LinkedIn profileFollow me on Facebook, West Coast Bureau Chief, Light Reading. Got a tip about SDN or NFV? Send it to wagner@lightreading.com.

(8)  | 
Comment  | 
Print  | 
Newest First  |  Oldest First  |  Threaded View
pcharles09
50%
50%
pcharles09,
User Rank: Light Beer
8/31/2014 | 10:40:01 PM
Re: What's the protocol?
@jabailo,

Wouldn't the packet alterations get messy though? In the TCP case , if somethings off just a little bit. Also, there'd have to be extra overhead for error checking right?
jabailo
50%
50%
jabailo,
User Rank: Light Sabre
8/27/2014 | 10:52:17 PM
Re: What's the protocol?
Like in a private network you can make assumptions that you can't make when shipping it out across the public Internet.

For example, I'm looking at this diagram:

http://www.freesoft.org/CIE/Course/Section4/8.htm

What about all that space for "source port" and "destination port".   Inside your own network, do you need to allocate that many bits?

Seems like for every bit you can reduce in a packet you get that much greater throughput.

 
pcharles09
50%
50%
pcharles09,
User Rank: Light Beer
8/27/2014 | 5:58:28 PM
Re: What's the protocol?
@jabailo,

Ahh ok I see. That's a good point.
kq4ym
50%
50%
kq4ym,
User Rank: Light Sabre
8/27/2014 | 9:45:56 AM
Androworld
Google certainly is going to lead the way in NFV/SDN services and of course the free PR they get for all announcements is not a bad thing for them either. Security is still going to be an ongoing problem and probably others are going to see their way to NFV and the cloud just to help solve that issue.
jabailo
50%
50%
jabailo,
User Rank: Light Sabre
8/27/2014 | 12:56:38 AM
Re: What's the protocol?
Sure, I was just thinking they could reduce the packet sizes by removing some of the headers, since it all runs "in-house" on their cloud.
pcharles09
50%
50%
pcharles09,
User Rank: Light Beer
8/27/2014 | 12:29:06 AM
Re: What's the protocol?
@jabailo,

My guess is because that's what they've always used. No reason to change unless there's a problem/vulnerability.
jabailo
50%
50%
jabailo,
User Rank: Light Sabre
8/26/2014 | 10:51:07 PM
What's the protocol?
Just how raw is the data layer protocol?  Is there any reason for it to be tcp/ip? 
Atlantis-dude
50%
50%
Atlantis-dude,
User Rank: Light Sabre
8/26/2014 | 7:42:42 PM
Autopilot
Is Andromeda the same as Azure's Autopilot? And what is the baseline?
Flash Poll
From The Founder
It's clear to me that the communications industry is divided into two types of people, and only one is living in the real world.
LRTV Interviews
From 4G to 5G: Alcatel-Lucent's Dave Geary

11|25|14   |   09:09   |   (0) comments


Dave Geary, President of Wireless at Alcatel-Lucent, talks about the evolution of the 4G market, small cells, partnerships, 5G and the IoT.
LRTV Huawei Video Resource Center
Building a Secure Telefonica Network With Huawei's High-End Firewall

11|24|14   |   4:37   |   (0) comments


Andrew Davies, IP architect of the Telefonica, a leading digital communications company, discusses the Huawei security gateway solution and putting the solution into the testbed.
LRTV Huawei Video Resource Center
Huawei Partners with Spirent to Verify CE12816's 10GE Port & TRILL Networking Capabilities

11|24|14   |   2:50   |   (0) comments


Spirent Communications is the world's leading supplier for telecom testing appliances and solutions. Spirent has been in a close partnership with Huawei for a long time.
LRTV Huawei Video Resource Center
Saudi Airlines & Its ICT Transformation

11|24|14   |   2:07   |   (0) comments


In this video, Saudi Airlines discusses its network problems and how Huawei's Agile Network is its all-in-one solution.
LRTV Huawei Video Resource Center
Huawei's Agile Switch Benefiting Saudi Arabia's Yamamah Hospital

11|24|14   |   2:40   |   (0) comments


Saudi Arabia's Yamamah Hospital speaks about how Huawei's Agile Switch has improved the medical service's network infrastructure.
LRTV Huawei Video Resource Center
FanPlay & Huawei Build a Wireless Agile Smart Stadium

11|24|14   |   2:13   |   (0) comments


FanPlay is a cloud-based white label service, which is effectively a football fan engagement platform underpinned by mobile payment technology.
LRTV Huawei Video Resource Center
Building an Agile Stadium

11|24|14   |   3:54   |   (0) comments


Stadiums may be thousands of tons of concrete and steel, but they now need to be agile. Being at the stadium may not be as alluring as it once was. Sports franchises and stadium operators discuss how to get fans back.
LRTV Huawei Video Resource Center
Huawei Helps ChinaCache Tackle Challenges in the Internet Industry

11|24|14   |   3:09   |   (0) comments


ChinaCache is China's largest content distribution network supplier. Huawei's CE12800 has provided ChinaCache with very strong support in its establishment of an infrastructure network.
LRTV Huawei Video Resource Center
Cefinity on Managed Security Services & Next-Generation Firewall

11|24|14   |   7:05   |   (0) comments


Cefinity is a cloud management service provider in Southeast Asia. Ivan Zhang, CEO of the company, discusses the implementation of security service management in the cloud era.
LRTV Huawei Video Resource Center
Huawei's Agile Gateway in the Eyes of Cefinity

11|24|14   |   2:11   |   (0) comments


Cefinity is a managed service provider for enterprise networks. The company currently uses Huawei's AR series routers for the most complete range of functions. CEO Ivan Zhang speaks about the advantages of the AR series routers.
LRTV Huawei Video Resource Center
CTO of Bus-Online Talks About Huawei's Agile Gateway

11|24|14   |   2:53   |   (0) comments


Bus-Online covers around 100 million users everyday. In addition to providing mobile TV, and advertising services to the public, Bus-Online has also entered the field of mobile Internet.
LRTV Huawei Video Resource Center
Amsterdam ArenA as an Agile Campus

11|24|14   |   3:31   |   (0) comments


The Amsterdam ArenA, home of the Ajax soccer team, can be a crowded space. ArenA has partnered with Huawei to work on bringing ample bandwidth to 53,000 people at the same time.
Upcoming Live Events
December 2, 2014, New York City
December 3, 2014, New York City
December 8-10, 2014, Reykjavik, Iceland
February 10, 2015, Atlanta, GA
April 14, 2015, New York City, NY
May 6, 2015, McCormick Convention Center, Chicago, IL
May 13-14, 2015, The Westin Peachtree, Atlanta, GA
June 9-10, 2015, Chicago, IL
Infographics
Irish Telecom outlines the rise of VoIP technology, including its adoption within businesses and their perception of its quality.
Hot Topics
Bell Labs Chief Slams 'Toy' Networks
Robert Clark, 11/19/2014
$38.3M: Ain't That a Kik in the SMS
Sarah Reedy, Senior Editor, 11/20/2014
Do You Have a 2020 Vision?
Dennis Mendyk, Vice President of Research, Heavy Reading, 11/21/2014
Operators Should Block Ads to Get Their Cut, Startup Says
Sarah Reedy, Senior Editor, 11/24/2014
$35B+ Spectrum Auction Dings Verizon, Shines Dish
Dan Jones, Mobile Editor, 11/24/2014
Like Us on Facebook
Twitter Feed