Transcript of "Complex Carrier Network Performance Data on Vertica Yields Performance and Customer Metrics for Empirix"
Complex Carrier Network Performance Data on Vertica
Yields Performance and Customer Metrics for Empirix
Transcript of a BrieﬁngsDirect podcast on how Empirix has leveraged HP Vertica to help
customers derive value from ever-expanding data.
Listen to the podcast. Find it on iTunes. Sponsor: HP
Dana Gardner: Hello, and welcome to the next edition of the HP Discover Performance
Podcast Series. I'm Dana Gardner, Principal Analyst at Interarbor Solutions, your
moderator for this ongoing discussion of IT innovation and how it’s making an
impact on people’s lives.
Once again, we’re focusing on how IT leaders are improving their business
performance for better access, use and analysis of their data and information.
This time we’re coming to you directly from the HP Vertica Big Data Conference
in Boston. [Disclosure: HP is a sponsor of BrieﬁngsDirect podcasts.]
Our next innovation case study interview explores how network testing, monitoring, and
analytics provider Empirix required and found unique and powerful data processing capabilities.
We'll learn how Empirix chose the HP Vertica analytics platform for its analytics engine to
continuously and proactively evaluate carrier network performance and customer experience
metrics to automatically identify issues as they emerge.
To learn more about how a combination of large-scale, real-time performance, and data access
made Vertica stand out to support such demands, please join me in welcoming our guest. We're
here with Navdeep Alam, Director of Engineering, Analytics and Prediction at Empirix, based in
Billerica, Massachusetts. Welcome to the show.
Navdeep Alam: Thank you for having me.
Gardner: It strikes me that the amount of data that's being generated on these networks is
phenomenal, a rapid creation of events. This is sort of the New York of data analysis. If you can
do it there, you can do it anywhere. Tell us a bit about what Empirix does and why you have such
demanding requirements for data processing and analysis?
Alam: With Empirix what we do, as you mentioned, is actively and passively monitor networks.
When you're in a network as a service provider, you have the opportunity to see the packets
within that network, both on the control plane and on the user plane. That just means you're
looking at signaling data and also user plane data -- what's going on with the behavior; what's
going at the data layer. That’s a vast amount of data, especially with mobile, and most people
doing stuff on their devices with data.
When you're in that network and you're tapping that data, there is a tremendous amount of data,
and there's a tremendous amount of insights about not only what's going on in
the network, but what's going on with the subscribers and users of that network.
Empirix is able to collect this data from our probes in the network, as well as
being able to look at other data points that might help augment the analysis.
Through our analytics platform we're able to analyze that data, correlate it,
mediate it, and drive metrics out of that data.
That’s a service for our customers, increasing value from that data, so that they can turn around a
return on investment (ROI) and understand how they can leverage their networks better to
increase operations and so forth. They can understand their customers better and begin to
analyze, slice and dice, and visualize data of this complex network.
They can use our platform as well to do proactive and predictive analysis, so that we can create
even better ROI for our customers by telling them what potentially might go wrong and what
might be the solution to get around that to avoid a catastrophe.
Gardner: It’s interesting that not only is this data being used for understanding the
performance on the network itself, but it's giving people business development and marketing
information about how people are using it and where the new opportunities might be.
Is that something fairly new? Were you able to do that with data before, or is it the
scale and ability to get in there and create analysis in near real time that’s allowed
for such a broad-based multilevel approach to data and analysis?
Alam: This is something we've gotten into. We deﬁnitely tried to do it before with success, but
we knew that in order to really tackle mobile and the increasing demands of data, we really had
to up the ante.
Our investment with HP Vertica and how we've introduced that in our new analytics platform,
Empirix IntelliSight 1.0 that's coming out this month is about leveraging that platform, not only
for scalability and our ability to ingest and process data, but to look at data in its more natural
format, both as discrete data, and also as aggregate data. We allow our customers to view that
data ad hoc and analyze that data.
It positioned us very well. Now that we have a central point from which all this data is being
processed and analyzed, we now run analytics directly at this data, increasing our data locality
and decreasing the data latency. This deﬁnitely ups our ante to do things much faster, in near real
Gardner: Obviously, the sensors, probes, agents, and the ability to pull in the information from
the network needs to reside or be at close proximity to the network, but how are you actually
deployed? Where does the infrastructure for doing the data analysis reside? Is it in the networks
themselves, or is there a remote site? Maybe you could just lay out the architecture of how this is
Alam: We get installed on site. Obviously, the future could change, but right now we're an on-
premise solution. We're right where the data is being generated, where it’s ﬂowing, and because
of that we're able to gain access to the data in real-time.
One of the things we learned is that this is a tremendous amount of data. It doesn't make sense
for us to just hold it and assume that we will do something interesting with it afterwards.
The way we've approached our customers is to say, "What kind of value do you seen in this data?
What kind of metrics or key performance indicators (KPIs), or what do you think is valuable in
this data? We then build a framework that deﬁnes the value that they can gain from data -- what
are the metrics and what kind of structure they want to apply to this data. We're not just
calculating metrics, but we're also applying some sort of model that gives this data some
As they go through what we call the Empirix Intelligent Data Mediation and Correlation (IDMC)
system, it's really an analytics calculator. It's putting our data into the Vertica system, so that at
that point we have meaningful, actionable data that can be used to trigger alarms, to showcase
thresholds, to give customers great insight to what's going on in their network.
Growing the business
From that, they can do various things, such as solve problems proactively, reach out to the
customers to deal with those issues, or to make better investments with their technology in order
to grow their business.
Gardner: How long have you been using Vertica and how did that come to be the choice that
you made? Perhaps you could also tell us a little bit about where you see things going in terms of
other capabilities that you might need or a roadmap for you?
Alam: We've been using Vertica for a few years, at least three or four, even before I came
onboard. And we're using Vertica primarily for its ability to input and read data very quickly. We
knew that, given our solutions, we needed to load a lot of data into the system and then read a lot
of data out of it fast and to do it at the same time.
At that time, the database systems we used just couldn't meet the demands for the ever-growing
data. So we leveraged Vertica there, and it was used more as an operational data store. When I
came on board about a year-and-a-half ago, we wanted to evolve our use of Vertica to be not just
for data warehousing, but a hybrid, because we knew that in supporting a lot of different types of
data, it was very hard for us to structure all of those types of data.
We wanted to create a framework from which we can deﬁne measures and metrics and KPIs and
store it in a more ﬂat system from which we can apply various models to make sense of that data.
That really presented us a lot of challenges, not only in scalability, but our ability to work and
play with data in various ways. Ultimately, we wanted to allow customers to play with this data
at will and to get response in seconds, not hours or minutes.
It required us to look at how we could leverage Vertica as an intelligent data-storage system from
which we could process data, store it, and then get answers out of that data very, very quickly.
Again, we were looking for responses in a second or so.
Now that we've put all of our data in the data basket, so to speak, with Vertica, we wanted to take
it to the next level. We have all this data, both looking at the whole data value chain from
discrete data to aggregate data all in one place, with conforming dimensions, where the one truth
of that data exists in one system.
We want to take it to the next step. Can we increase our analytical capabilities with the data? Can
we ﬁnd that signal from the noise now that we have all this data? Can we proactively ﬁnd the
patterns in the data, what's contributing to that problem, surface that to our customers, and
reduce the noise that they are presented with.?
Instead of showing them that 50 things are wrong, can I show them that 50 things are wrong,
but this one or two issues are actually impacting your network or your subscribers the most? Can
we proactively tell them what might be the cause or the reason towards that and how to solve it?
The faster we can load this data, the faster we can retrieve the value out of this data and ﬁnd that
needle in the haystack. That’s where the future resides for us.
Gardner: Clearly, you're creating value and selling insight to the network to your customers, but
I know other organizations have also looked at data as a source of revenue in itself. The analysis
could be something that you could market. Is there an opportunity with the insight you have in
various networks, maybe in some aggregate fashion, to create analysis of behavior, network use,
or patterns that would then become a revenue source for you, something that people would
subscribe to perhaps?
Alam: That's a possibility. Right now, our business has been all about empowering our
customers and giving them the ability to leverage that data for their end use. You can imagine, as
a service provider, having great insight into their customers and the over-the-top applications that
are being leveraged on their network. Could then they use our analytics and the metadata that
we're generating about their network to empower their business systems and their operations to
make smarter decisions? Can they change their marketing strategy or even their APIs about how
they service customers on their network to take advantage of the data that we are providing
The opportunity to grow other business opportunities from this data is tremendous, and it's going
to be exciting to see what our customers end up doing with their data.
Gardner: Are there any metrics of success that are particularly important for you. You've
mentioned, of course, scale and volume, but things like concurrency, the ability to do queries
from different places by different people, at the same time is important. Help me understand
what some of the other important elements of a good, strong data-analysis platform would be for
Alam: Concurrency is deﬁnitely important. For us it's about predictability or linear scalability.
We know that when we do reach those types of scenarios to support, let’s say, 10 concurrent
users or a 100 concurrent users, or to support a greater segmentation of data, because we have
gone from 10 terabytes to 30 terabytes, we don't have to change a line of code. We don't have to
change how or what we are doing with our data. Linear scalability, especially on commodity
hardware, gives us the ability to take our solution and expand it at will, in order to deal with any
type of bottlenecks.
Obviously, over time, we'll tune it so that we get better performance out of the hardware or
virtual hardware that we use. But we know that when we do hit these bottlenecks, and we will,
there is a way around that and it doesn't require us to recompile or rebuild something. We just
have to add more nodes, whether it’s virtual or hardware.
Gardner: Well, great. I am afraid we'll have to leave it there. We've been learning about how
network testing, monitoring, and analytics provider Empirix found unique and powerful data-
processing capabilities. And we've seen how they deployed the HP Vertica Analytics Platform to
provide better analytics to their customers in the network provider space.
So a big thank you to our guest, Navdeep Alam, the Director of Engineering, Analytics, and
Prediction at Empirix. Thank you, Navdeep.
Alam: Thank you.
Gardner: And thanks also to our audience for joining us for this special HP Discover
Performance Podcast coming to you from the HP Vertica Big Data Conference in Boston.
I'm Dana Gardner, Principal Analyst at Interarbor Solutions, your host for this ongoing series of
HP sponsored discussions. Thanks again for listening, and come back next time.
Listen to the podcast. Find it on iTunes. Sponsor: HP
Transcript of a BrieﬁngsDirect podcast on how Empirix has leveraged HP Vertica to help
customers derive value from ever-expanding data. Copyright Interarbor Solutions, LLC,
2005-2013. All rights reserved.
You may also be interested in:
• Advanced IT monitoring Delivers Predictive Diagnostics Focus to United Airlines
• HP Vertica Architecture Gives Massive Performance Boost to Toughest BI Queries for
• HP-Fueled Application Delivery Transformation Pays Ongoing Dividends for McKesson
• Podcast recap: HP Experts analyze and explain the HAVEn big data news from HP
• HP's Project HAVEn rationalizes HP's portfolio while giving businesses a path to total
• Insurance leader AIG drives business transformation and IT service performance through
center of excellence model
• HP BSM software newly harnesses big-data analysis to better predict, prevent, and
respond to IT issues