How Data Science Can Boost Network Operations

Kiran Inampudi
9/29/2017
100%
0%

Data science techniques have been around for many years and successfully applied in several areas like fraud detection, personalized recommendations, etc. Most recently, these techniques are being leveraged in service provider and telco network operations. The combination of SDN/NFV and data science is becoming a powerful new approach for making networks more reliable and secure.

What is data science?
Data science involves using automated methods to analyze massive amounts of data and to extract knowledge from them. Data science is a broad discipline that includes statistics, computer science, applied mathematics, machine learning/AI and visualization.

One of the common use cases of machine learning is in email spam filters. The algorithms are trained by processing millions of emails that have been pre-categorized as either spam or not. The result is an application that can automatically identify the vast majority of junk email and can also continuously improve and adapt as more examples become available.

Relevance to SP network operations
As SPs adopt SDN/NFV, the underlying network infrastructure has become more complex and distributed. SP operations teams have to deal with the dynamic network with unprecedented change, scale and complexity. In this dynamic SP network environment, it is challenging to predefine and determine what will or could go wrong. Relying on the human correlation processes and manual methods that have been in place for past many decades are no longer effective.

Data science has the potential to transform the way SP network operations are done, including the reduction of manual effort involved in network monitoring, troubleshooting and optimization. However, the trick is how to do it in a way that provides clear business value, embedded into the SP operations workflow, and leveraging expert knowledge combined with the data.

Below is a list (not exhaustive) of emerging use cases of data science with in the context of SP network operations. As the SPs adopt modern technologies and operations practices, more applications will surface.

Reducing alert fatigue
In the new world of SDN/NFV, the number of components that need to be monitored and managed has increased exponentially compared to legacy networks. One of the most significant problems facing SP operations teams today is the overwhelming amount of information from distributed network components that generate logs and alerts.

With minimal prioritization and a high false-positive rate, it impossible for operations teams to focus on what matters. With data science techniques, it is possible to understand the context of the alerts and suppress the ones that are not relevant, resulting in a prioritized list of alerts for SP operations team to review and take action.

Proactive network optimization
Good performance and high availability are the primary goals of SP operations teams. They need to proactively detect, identify and resolve performance crises in their network.

Data science provides a methodology for quickly processing the large quantities of monitoring data generated by the network devices, finding repeating patterns in their behavior and building accurate models of their performance. Anomaly detection methods can be used to automatically spot deviations from normal system behavior that could correspond to network failures. A simple example could be if the number of link errors on a particular network interface in the last ten minutes is three standard deviations higher than on other links in the same network; this could indicate a problem.

Advanced security
Traditional security technologies rely on rules and signatures that only use stale information to find threats. The tactics of adversaries are evolving rapidly, and the number of advanced and unknown threats targeting SP networks continues to increase.

Algorithms can be trained to learn the SP environment and adapt to the threat landscape, making decisions about whether something is malicious, and then providing context for the expert to assist with rapid investigation.

Future of SP operations
Self-driving cars provide important insight into the path that data-driven automation is likely to follow. The general principles used in self-driving cars can be extended into SP network operations domain. Collecting massive amounts of data, allowing algorithms to navigate their way through routine tasks, implementing self-learning systems that can adapt to unpredictable situations. The result is likely to be smart network management software that can perform many SP operations tasks with a high degree of reliability.

Some of the hyper-scale operators (Facebook, LinkedIn, Netflix, etc.) are already using self-healing for some basic operational tasks. In the future, SP operations needs to move towards "management by exception," wherein most common errors and performance degradations are addressed via automated self-healing.

— Kiran Inampui, Global Solutions Management Lead, GSP Services, Cisco Systems Inc. (Nasdaq: CSCO)

(3)  | 
Comment  | 
Print  | 
Newest First  |  Oldest First  |  Threaded View        ADD A COMMENT
Phil_Britt
50%
50%
Phil_Britt,
User Rank: Light Sabre
10/2/2017 | 9:04:12 AM
Re: Alert fatigue
The more data science can get involved the better. The attacks, and, therefore, the alerts, will keep coming.
Associat21165
50%
50%
Associat21165,
User Rank: Light Beer
10/1/2017 | 2:21:35 PM
Are there existing Open source or commercial products that use this approach
Great Article !  We all know about products based on SIEM where the rules need to be explicitly defined for the set of actions to be initiated based on certain triggers.  With the recent advances in data sciene and machine learning it would be of interest to know if there are any ready made solutions or service providers who have taken this approach &  reasied their level of operational maturity to the next level ?
Michelle
50%
50%
Michelle,
User Rank: Light Sabre
9/30/2017 | 11:36:09 PM
Alert fatigue
Alert fatigue is a very real thing. It's good to see this addressed.
More Blogs from Column
Networking containerized environments is tough; Arista wants to help.
The emergence of the eSIM will make it easier for customers to change operators and force the industry to have a proper conversation about the largely overlooked prepaid side of the business.
AWS is transforming to reach beyond its hardcore developer base, says analyst Zeus Kerravala.
The security industry has an incredible opportunity to move forward and close the talent gap by thinking outside of the norm and taking a chance on technology-savvy women with translatable skills.
Network operators will each take their own unique journey to becoming more cloud-native. This heterogeneous, 'lumpy' universe will be with us for quite a while.
Featured Video
Flash Poll
Upcoming Live Events
March 12-14, 2019, Denver, Colorado
April 2, 2019, New York, New York
April 8, 2019, Las Vegas, Nevada
May 6, 2019, Denver, Colorado
May 6-8, 2019, Denver, Colorado
May 21, 2019, Nice, France
September 17-19, 2019, Dallas, Texas
October 1, 2019, New Orleans, Louisiana
December 5-3, 2019, Viena, Austria
All Upcoming Live Events