Normal vs Abnormal problemsIn video world not everything 100% good all the time....Impossible to prioritize problems with MomIn mom each event requires attention (microsoft operations manager) It wasn't possible to correlate, associate events to draw better conclusions.Manually process to understand what was going onHow many of that error occurred on that channel yesterday? MOM can't address that sort of question.Mom focused on ops and event driven. Designed for errors that occur in a vacuum. Running out of disk spaceAnalyzing app event logs rate of error, distribution of error, frequency – more statistical analysis then x breaks y is responseVideo as an app we all deal with; people take video for granted – MPEG as a group of standards – is really not what you would think of as a well defined standard. It's more a set of tools and suggestions on how you can use those to construct various things.Analyzing log files that process this videoWe needed to solve for quality – mom can't do thatQuality is hard to track for:
In MSN MOM, each event requires attentionCapability to analyze events in unified fashion or ask simple questions was missingNo way to correlate data together or associate events to get conclusionsTechnical integration problems – I had my hands tied. I had to have manually read the event logsMOM couldn’t answer the questions I was askingMOM is designed for errors that occur in a vacuum Technically interacting with video – mpeg as a group of s tandards is not well definedMore like a set of tools and suggestions on how to construct various thingsWe’re trying to analyze for quality – MOM had no concept/ capability to approach the problem that way
Top ErrorsTop Errors ChannelsError Services Yesterday vs TodayThey can drill down into the events when they happen, see what the events werePrimary dash that both operations and engineering use as a launch pointField service also using Splunk (more later)
We do look-ups EPG call sign, relevant IP addresses. MPEG program numbers, ingress streamDrill down dashboards – 2 channels from same provider – immediately provider issue – Service free buffering – Speficic host error – encoding gear misconfiguredOverall system health
Dserver – instant channel change (pre-buffer very quickly) Reliable UDPSettop boxes can keep track of video stream. If packet is missing, the settop box can request packet from Dserver. WE can deliver solution to problem Just in Time. Dserver creates logs that tell us how amny and which packets they send to customers.Initially we didn't do anythign with them. We didn't knwo what was in them.Hey guys we can figure out how to analyze session retry reports. And look we can tell you cuz we index, which of your customers is having problems with delivery of videostream by requesting packets. We can figure out if certain settop boxes are having issues and requires attention. I heard about it and said, i have splunk – we figured out how to encode
Dserver – instant channel change (pre-buffer very quickly) Reliable UDPSettop boxes can keep track of video stream. If packet is missing, the settop box can request packet from Dserver. WE can deliver solution to problem Just in Time. Dserver creates logs that tell us how amny and which packets they send to customers.Initially we didn't do anythign with them. We didn't knwo what was in them.Hey guys we can figure out how to analyze session retry reports. And look we can tell you cuz we index, which of your customers is having problems with delivery of videostream by requesting packets. We can figure out if certain settop boxes are having issues and requires attention. I heard about it and said, i have splunk – we figured out how to encode