& cplSiteName &

Google & Netflix Launch 'Kayenta' for Cloud-Scale Continuous Software Delivery

Mitch Wagner
4/10/2018
50%
50%

Enterprises looking to deploy application updates rapidly without breaking things are getting help from two of the biggest and most successful hyperscale cloud providers.

Google (Nasdaq: GOOG) and Netflix Inc. (Nasdaq: NFLX) are teaming up to launch "Kayenta," an open source tool for continuous software delivery at cloud scale.

Kayenta is designed to help cloud application providers get beyond the old "waterfall" software delivery method, where updates take months or years to come out, and move to a continuous update cycle, where updates are happening all the time.

Cloud-scale applications are complex and easily broken, much like this 1931 device from cartoonist Rube Goldberg.
Cloud-scale applications are complex and easily broken, much like this 1931 device from cartoonist Rube Goldberg.


Boost your knowledge of cloud-native software and innovations driving data center transformations! Join us in Austin at the fifth annual Big Communications Event May 14-16. The event is free for communications service providers -- secure your seat today!


The problem with continuous updates to software delivered over the cloud -- a.k.a. software-as-a-service, or SaaS -- is that developers find it difficult to test software to be sure it doesn't break production systems before deployment. The solution to that problem is to deploy to a small number of users at first, test for problems, and then deploy to a larger number if no problems are found. If problems are found, then roll back, adjust and iterate. This process is called "canary analysis," after the old practice of coal miners bringing canaries to work (great for the coal miners, hard on the canaries).

Using canary analysis, it's easy to detect problems if the update crashes your application, but slight degradations of service can be difficult to detect, and yet extremely harmful if deployed to users on a global scale, Andrew Phillips, Google Cloud product manager, said in an interview.

"As humans, we're bad at detecting small changes, and we're very bad at determining whether a small change is in the statistically expected range of fluctuations," Phillips said. In other words, it can be difficult to determine whether a small change in application performance is due to a code update, or whether the change is just random.

That's where Kayenta comes in. Kayenta is an open source tool that works with Spinnaker -- an open source continuous deployment tool initially developed by Netflix -- to automate rolling out software updates at small scale, test for small changes, and then either roll the update out at wider scale or roll it back for bug fix, Phillips said.

"Every organization says, on the one hand, we must move faster, but we also have to stay safe -- can't afford to break all our production applications," Phillips said. Kayenta is designed to help enterprises resolve that paradox.

"Developed jointly by Google and Netflix, Kayenta is an evolution of Netflix's internal canary system, reimagined to be completely open, extensible, and capable of handling more advanced use cases," according to a post on the Google blog Tuesday. "It gives enterprise teams the confidence to quickly push production changes by reducing error-prone, time-intensive, and cumbersome manual or ad-hoc canary analysis."

Kayenta apparently competes with at least one startup. Jyoti Bansal, who founded AppDynamics, which sold to Cisco for $3.7 billion last year, is focused on "continuous application delivery as a service" with his new startup, Harness. Harness is designed to let app developers get new features and upgrades out to users fast, while also ensuring security and application stability. (See AppDynamics Founder Launches 'Harness' for Continuous App Delivery.)

And in a related development, startup Gremlin is looking to make "chaos engineering" widely available -- taking out components of an Internet application, such as individual servers or connections -- on a controlled basis, to test whether the system recovers gracefully. (See Gremlin Looks to Bring 'Chaos Engineering' to the Masses)

Related posts:

— Mitch Wagner Follow me on Twitter Visit my LinkedIn profile Visit my blog Follow me on Facebook Editor, Enterprise Cloud, Light Reading

(5)  | 
Comment  | 
Print  | 
Newest First  |  Oldest First  |  Threaded View        ADD A COMMENT
mendyk
50%
50%
mendyk,
User Rank: Light Sabre
4/24/2018 | 2:49:55 PM
Re: Full speed
Microsoft gave up worrying about the effects of their dreaded updates on users long ago.
Michelle
50%
50%
Michelle,
User Rank: Light Sabre
4/24/2018 | 2:10:55 PM
Re: Full speed
I've had problems with Windows 8 and 10 this month. It really seems like the latest round of updates weren't tested before release. To be fair, I have had problems with Windows 10 updates for the last 2-3 months...
kq4ym
50%
50%
kq4ym,
User Rank: Light Sabre
4/23/2018 | 12:02:56 PM
Re: Full speed
This would seem to be a solution to the canary method that will " test for problems, and then deploy to a larger number if no problems are found. If problems are found, then roll back, adjust and iterate." and Google/Netflix may very well have come up with a way to speed that up on a larger scale. I've experienced big problems with Windows 10 updates this month, that would seem to beg for their solution. Bears watching to see how effective it becomes.
Mitch Wagner
50%
50%
Mitch Wagner,
User Rank: Lightning
4/15/2018 | 4:06:30 PM
Re: Full speed
No linear relation but same philosophy - incremental changing and testing. 
Michelle
50%
50%
Michelle,
User Rank: Light Sabre
4/14/2018 | 1:10:07 PM
Full speed
This is an exciting release. Breaking things at scale is generally discouraged so this is a great solution. Does this have any relation to Netflix's Chaos Monkey? Is it a later iteration or built upon the same rules?
Featured Video
From The Founder
Light Reading founder Steve Saunders talks with VMware's Shekar Ayyar, who explains why cloud architectures are becoming more distributed, what that means for workloads, and why telcos can still be significant cloud services players.
Flash Poll
Upcoming Live Events
May 14-16, 2018, Austin Convention Center
May 14, 2018, Brazos Hall, Austin, Texas
September 24-26, 2018, Westin Westminster, Denver
October 9, 2018, The Westin Times Square, New York
October 23, 2018, Georgia World Congress Centre, Atlanta, GA
November 7-8, 2018, London, United Kingdom
November 8, 2018, The Montcalm by Marble Arch, London
November 15, 2018, The Westin Times Square, New York
December 4-6, 2018, Lisbon, Portugal
All Upcoming Live Events
Hot Topics
I'm Back for the Future of Communications
Phil Harvey, US News Editor, 4/20/2018
Verizon: Lack of Interoperability, Consistency Slows Automation
Carol Wilson, Editor-at-large, 4/18/2018
AT&T Exec Dishes That He's Not So Hot on Rival-Partner Comcast
Mari Silbey, Senior Editor, Cable/Video, 4/19/2018
Facebook Hearings Were the TIP of the Data Iceberg
Dan Jones, Mobile Editor, 4/20/2018
Pay-for-Play Is a Sticking Point in Congress
Mari Silbey, Senior Editor, Cable/Video, 4/18/2018
Live Digital Audio

A CSP's digital transformation involves so much more than technology. Crucial – and often most challenging – is the cultural transformation that goes along with it. As Sigma's Chief Technology Officer, Catherine Michel has extensive experience with technology as she leads the company's entire product portfolio and strategy. But she's also no stranger to merging technology and culture, having taken a company — Tribold — from inception to acquisition (by Sigma in 2013), and she continues to advise service providers on how to drive their own transformations. This impressive female leader and vocal advocate for other women in the industry will join Women in Comms for a live radio show to discuss all things digital transformation, including the cultural transformation that goes along with it.

Like Us on Facebook
Twitter Feed