Assuring the Telco Cloud
As CSPs undergo a digital transformation -- running their business from a cloud environment, selling digital services and operating like webscale Internet companies -- assuring the telco cloud environment and business processes will take high priority. With networks virtualizing, services digitalizing and IoT looming large, the business risks are much higher than envisaged. Assurance of the telco cloud network and services will be key in assuring the new telco cloud business.
The telco cloud is defined as a virtualized telecom infrastructure to run digital services and agile operations. Accuracy, speed and error-free operations of the telco cloud are critical to the success of a digital business. Clever solutions, which derive and offer customer intelligence in addition to service assurance, will play a critical role in the success of the telco cloud business.
Certain OSS concepts will need revisiting to make them relevant to the telco cloud unknowns. These are Service Quality Management, creating a faultless telco cloud and enabling the digital service provider to be an "intelligent platform."
Service Quality Management for the telco cloud
Service Quality Management (SQM) is not a new concept; however, it will be an important one in the coming years. Current SQM focuses on proactive monitoring of customer-facing services, which have not always required reliable, secure, fast and always available networks. However, with the anticipated increasing rollout of telco cloud services in 2017, the current functionality of an SQM system will be stretched to cover the higher speed and scale of a digital services environment.
NFV, the underlying technology of telco cloud, has partly evolved as a consequence of the growing appetite of consumers for faster, on-demand and reliable services. Some of the most popular digital services in the new networks will be video streaming, telemetry, mobile gaming and home automation.
In addition, NFV is associated with demanding SLAs between the service provider and its customers. With VoLTE, ViLTE (video-over-LTE) and other advanced communication services launched as digital services, high levels of corporate SLAs will be required to compete with the slick services offered by the OTT providers. The SLA situation worsens with IoT, where inter-communicating sensored devices become the new "customers" and may make high demands on reliability and availability, if they are "mission-critical" connections such as those between autonomous cars or related to remote surgery.
The importance of SQM in the telco cloud can be assigned to the following key reasons:
In its digital avatar, SQM helps CSPs to address the new service challenges posed by the telco cloud.
A faultless telco cloud
The cloud-based digital services are expected to run on highly reliable and error-free networks. Digital services require real-time dynamic adaptation and customization of the communication network, which drive expectations or objectives for network/service/device failures to be reduced to a minimum.
Moreover, in an IoT world, failed devices or connections might not only breach SLAs with massive penalties but, more importantly, they might impact life-critical or mission-critical communication. Although complex mesh topologies with high availability and inbuilt redundancy will reduce the impact of such failures, they still require a system to discover, interpret and manage the faults.
With the network disruption induced by NFV and IoT, every new piece of equipment, software and device will bring its own failure points. In this environment, traditional network/device fault management needs to be raised to the next level.
Other than the mentioned technology turn (NFV and IoT), the revamping of fault management is necessitated by the demand for higher speed of service delivery and problem resolution. Monitoring and assessing the impact of failures on the new network elements and user devices is critical, especially when services are time-critical and, in many cases, life-critical too.
This justifies the evolution of current NOC/SOCs to a zero-touch operations center, where extensive automation will speed up the reporting, fault-finding and remediation. By feeding fault data to SQM systems, CSPs can instantaneously understand the impact of faults on services and, with the use of predictive algorithms, prevent faults from occurring.
Many use cases can be served through a highly automated, predictive fault management system:
The automation of operations center processes is key to achieving success in the virtualized and digitalized telco cloud environment. CSPs are working towards realizing a fully automated, zero-touch operations center using closed-loop corrective actions, complex algorithms and machine learning. And to support the dynamic SLAs of the telco cloud, the OSS is expected to support on-demand capacity configuration and dynamic topology changes, which can happen only through automated real-time network feedback and automatic configurations.
Analytics to evolve to an 'intelligent platform'
CSPs are ready to shake off the label of being the "dumb pipe" through the use of sophisticated analysis of the massive and valuable data traversing their networks. As digital service providers, they are looking at monetizing customer behavioral data as well as connectivity as they aggressively launch new digital services to challenge the growing popularity of OTT services.
Analytics capabilities can deliver trends on performance, capacity and faults using machine-learning tools. But more than the operational benefits of analytics, they provide critical intelligence that can be used for network monetization and service personalization, by understanding the usage of the telco cloud, the services it offers, its customers and devices. As an example, CSPs can proactively identify low-congestion zones/locations ("free zones") and rapidly fill spare capacity with revenue-generating traffic from new service offers such as video streaming, mobile TV or smartphone apps, contextualized by location, time and customer need.
In addition, with CSPs extending their business to become IoT service providers, machine-learning based analytics will be popular to manipulate data and generate critical business intelligence for each of the IoT industry verticals.
Underlying technologies to make telco cloud management successful
The next-generation telco cloud promises the creation and deployment of new services in shorter time periods, down from a few months to a few days. To respond to this need, CTIOs are now developing new architectures for service assurance, of which SQM, automation and analytics form a key component. The architectures are based on open APIs, Big Data clustering and OpenStack capabilities.
Other than the introduction of these new technologies to the underlying platforms, it is important to develop a microservices architecture, which uses DevOps-enabled iterative processes to quickly respond to customer needs by developing services faster. This is how the customer expectation of using new features every week or every few days will be realized. This also helps in conducting root cause analysis faster and resolving customer issues quickly.
An integrated approach of analytics, automation and SQM requires some drastic changes in the way data is churned, visualized and actioned. For a successful launch of the telco cloud, long-term assurance of digital services and the creation of business value out of data, it is critical to re-define the features of SQM, zero-touch predictive operation centers and analytics for data monetization.
Sandeep Raina, Product Marketing Director, MYCOM OSI