Saturday, February 15, 2020

AIOps for Telcos: Is it Automation or Insight? Ericsson Chooses

Whether AIOps is about insight or automation often is unclear, especially because the stated goal of AIOps to gain insights that lead to automated action, supervised or unsupervised by humans. Ericsson, for example, talking about applied artificial intelligence in communications networks, prefers the “automation” label. 

In large part, that might reflect the particular challenges of the connectivity service provider industry, where many revenue and business problems grow from unhappy customers and poor customer experience (dropped calls, no coverage, slow internet speeds, poor connection quality). 

“Previous Ericsson research shows that almost 50 percent of consumer network perception is based on personal experiences of the network, indicating the huge importance of network quality as the key to customer satisfaction,” Ericsson says. 


Customer satisfaction, in turn, is believed to be causally related to word of mouth referrals, prospect perceptions of quality and value, customer churn, customer acquisition and therefore revenue, profits and market share. 

“Whether customers give you US$2 or $20 ARPU, if you do not provide the quality they expect, they will churn,” says Vicente Cotino Director of Network Operation Maintenance, Orange Spain. 

To be sure, network performance is not the only driver of satisfaction. “At least one-third of the total NPS (net promoter score, a measure of customer willingness to recommend a product or service to others) score is derived from network performance.” 

Of the service providers surveyed by Ericsson, 80 percent use NPS as a key metric in operations. Also, several service providers indicate that 40 percent to 60 percent of operations key performance indicators are business-related.

Service providers, in other words, must do many things right, beyond ensuring that the network works as it is supposed to work. What is hard to untangle is the percentage of benefit that AIOps might provide, and whether it is automating changes or detecting issues that is “more important.” 

Clearly the two are related. Action is not possible without insight; nor is insight useful unless changes can be made fast. 

In surveys undertaken by Ericsson, 80 percent of respondents say automation is key for the cost and customer experience. About 90 percent of operations personnel say AI is important in protecting customer experience. 

As always, methodology matters. I do not know how the questions were framed. It is not clear how the responses might have been different if poll takers were asked about whether better network and consumer behavior insight and prediction would protect customer experience. 

But Ericsson frames the issue as one of “automation.” That might simply be a reflection of our general confusion about effectiveness and efficiency. People often speak about the importance of efficiency, which might be expressed as getting work done faster, with fewer people, at less cost or with fewer mistakes and rework. 

Less common are evaluations of effectiveness, which might be expressed as “doing the right things” to produce value or desired outcomes. A traditional way of illustrating the difference is to note that an organization gains little to nothing by automating things that should not be done. 

In principle, AIOps as applied to a communications network requires automated insight to produce proactive network responses. So far, we seem to have gotten better at insight, while IT managers still debate the value of unsupervised action by AI-driven systems to correct and modify network operations. 

Put simply, “nobody trusts the system to behave autonomously” at the moment.

Friday, February 14, 2020

Is AIOps About Automation or Insight?

For more than a year, it seems, there has been some debate about whether AIOPs fundamentally is about automation or insight. It can be hard to discern, as a fully-developed AIOps solution, capable of providing insights, also often also is said to include the ability to proactively diagnose and resolve issues. 

Part of the problem is that the approach is so new. Right now, developers are creating new capabilities around the ingestion of data, correlation of incidents and resolution of issues. AIOps is “about making sense of large volumes of information,” says Ciaran Byrne, OpsRamp VP. 

There is an element of remediation and automation, though, he notes. 

At the current stage of development, nobody is willing to entrust AI-enabled systems to reconfigure networks, servers and software autonomously. Beyond that, though, the fundamental value of AIOps arguably is not the automated response but the move “from hindsight to insight to foresight,” says Joris DeWinne, StackState solutions architect. 

Today, enterprises monitor the health of components and systems,  but the problem has been too many notifications, he says. At the next level, monitoring adds logs and metrics. Next-generation monitoring has been added by firms such as ServiceNow, AWS and Azure, using Kubernetes, for example.

AIOps will represent the next level after that, where prediction starts to be feasible, he argues. 

Thursday, February 13, 2020

AI Helps, Says Deepsense.AI

Artificial intelligence can solve a number of IT management challenges, argues Andy Thurai, Deepsense.AI U.S operations head. 

Any anomaly can signal the existence of a problem, as the default state of the infrastructure ecosystem is stable, he says. So early detection of an anomaly is usually a sign of a problem that has yet to be fully understood, he says. 

Root cause identification another advantage, when most enterprises have multiple to scores of separate monitoring systems, Thurai says. 

Service tickets analytics are another advantage of applying AI to operations. When fed data on tickets submitted to a service desk, an ML-based model can predict seasonal spikes and requests. This can help the service desk owner deploy help desk personnel as needed. So we predict the number of call center agents you need to have on site, says Thurai. 

AI also can be applied to detect seasonality trends and adjust capacity accordingly. 

Machine-powered analysis delivers insights that are beyond the reach of humans. Frequent pattern mining is possible because AI canfind correlations that are impossible for humans to detect. 

AIOps solutions also eliminate noise and allow staff to concentrate on the real underlying problems.

Incident Management on a Single Pane of Glass?

Sameer Hutheesing, AVP,  Business Development, GAVS Technologies, believes it now is possible to correlate most incident reports into a single system. That might not yet be a full AIOPs capability, allowing enterprises to close their network operations centers, but it is hard to argue this is not useful. 

video
 auto discover the network, apps, users

5 Years to Comprehensive AIOPs?

It is not so clear that AIOps is going to be widely deployed in less than five years, argues Frank Yue, Kemp Technologies solutions architect. 

“AIOps is more complex than you think,” said Yue. “It is hard for AI to figure out all the relevant connections” between apps and the network. 

“Network performance monitoring is separate from app performance monitoring; security policy management is different from the separate analytics for each subsystem,” Yue said. The network, server, app, security, storage, WAN and cloud all have to be integrated. 

Then there are the vendor specific solutions that are proprietary, and many different protocols are in use, including HTTP, DNS, BGP, Java and .net. Different languages including XML, Syslog, RTTP REST, SNMP, CLI also must be accommodated. There also are different functions, including server load balancing, routers, switches, Yue said.

It is hard for AI to know, in advance, the context of relationships between systems, apps, actions, he said. So a complete AIOPs platform must be multilingual, multi-vendor, mult-technology and multi-system. 

And “nobody wants to buy 10 AI solutions,” he added. 

There is, in other words, no present way to build a complete and comprehensive monitoring capability with a single pane of glass.

Monday, February 10, 2020

BiogPanda Touts Root Cause Capabilities

BigPanda says it is “the first AIOps solution to ingest changes from disparate change feeds and tools, and correlate and analyze these changes against alerts collected from enterprise monitoring tools to rapidly isolate the root cause change that resulted in an incident or outage.”

The BigPanda platform expansion speeds up incident and outage resolution by ingesting changes from any source of change data, including change management, change log, configuration management, and others, BigPanda says. 

BigPanda’s Root Cause Changes feature uses machine learning to correlate and analyze this dataset alongside the dataset of alerts collected from monitoring tools.

The Real-time Topology Mesh provides a real-time topology model across the entire IT stack, BigPanda says.



MWC and AI Smartphones

Mobile World Congress was largely about artificial intelligence, hence largely about “AI” smartphones. Such devices are likely to pose issue...