The Thesis
What is NetworkManagement ?         Alarms  Service Status                       Control workflow  Trouble-shoot    Config...
What is NetworkManagement ?      Problems?      Alarm Monitoring      Service Management       - Monitor       - Configure...
Main Thesis Use domain-specific languages to  specify alarm and service models   Explicit knowledge   Text-based repres...
Research Structure                         Service Models                                                                 ...
Problems and                                            Contributions                        Alarm Models                 ...
Attacking the Problems                    Challenges                    Solutions                      Computer ScienceSer...
Publication Overview Journals                          Conferences/Workshops      IEEE IT Professional                 ...
Contents             Problems? – Input from Service              Providers The Alarm Problem             The Service Ma...
Coming Changes20Operators                             11
Research Efforts20Operators                               12
Alarm Chain                                                      ?Managed System                            Management Sys...
The Alarm ProblemMost network elements […] does nothave the notion of an alarm state.Devices emit event notifications when...
The Alarm Problem Too many                                         ?    > 1 / Sec    Which ones are relevant?    Sever...
Interpreting an Alarm*A0628/546 /08-07-01/10 H 38/ N=0407/TYP=ICT/CAT=SI  /EVENT=DAL/NCEN=AMS1  /AM=SMTA7/AGEO=S1-TR03-B06...
Confusing Alarm SeverityOriginal Severity             Priority set byfrom Device                   Operator               ...
Hard-to ManageSeverity Distribution                  Hollifield, B., Habibi, E.:                  The Alarm Management Han...
Alarm Type Distribution90%                          …3500      26                                  20
Alarm Monitoring                Domain-Specific ModelsModeling Alarms – Enable Automation and Increase Quality            ...
Research Structure                         Service Models                         Service Type                            ...
Alarms Today                                                           ? We have:    Alarm interface standards    Envel...
Alarm Model               BASS              Alarm Types              Predicates              Constraints              - In...
BASS       25
Bass Prototype and                  Validation                       .alarm                                           Corr...
Semantic Constraints173 warnings in approved and released alarm interface                                                 ...
Information Constraints to   Automate Correlation   Automatic identification of root-cause candidates                     ...
Alarm Monitoring              Data-Mining and Self-LearningAssigning Correct Severity Levels by Learning from Experts     ...
Research Structure                         Service Models                         Service Type                            ...
Learning Alarm PrioritiesDatabasesFrom RealServiceProvider                               Suggest        Neural Network    ...
Result                                  Distribution of Errors                                           Neural           ...
The ServiceManagement Problems                      33
Service Management ”Services are not currently managed well in any suite of  applications and require a tremendous amount...
Complex Structures“Service Models”                       Configuration               Class Name              Attribute    ...
Research Structure                         Service Models                         Service Type                            ...
My Two Tracks for Service     Management                 Configuration Changes   Service Type                             ...
Simplifed Structures          Configuration           Models               Models           Monitoring    Remove room for ...
SALmon ExampleBroadbandForum TR-126Triple PlayQoE Requirements                                    40
SALmon Test                           •   The TR-126                               model could be                         ...
My Two Tracks for Service     Management                 Configuration Changes   Service Type    IETF YANG                ...
Service Configuration and         Activation IETF Defined YANG as data-modeling language for  managing devices    “Repla...
SALmon and YANG               SALmon             IETF YANG            CommentModel          Object Oriented    Tree       ...
Conclusions For Research    Closer cooperation with equipment and service providers    Network management is in need of...
Future Work                                                                      Alarm Models SALmon features represented...
Errata Paper C :    Says trivial approach is correct in 17 % of the cases    Should be 11 % Section 2 :    Wrong “T”,...
Thank You !Klacke Wikström                                  Jörgen ÖfjellHåkan Millroth                                   ...
50
Stefan Wallin Ph D Presentation : Rethinking Network Management
Stefan Wallin Ph D Presentation : Rethinking Network Management
Stefan Wallin Ph D Presentation : Rethinking Network Management
Stefan Wallin Ph D Presentation : Rethinking Network Management
Upcoming SlideShare
Loading in …5
×

Stefan Wallin Ph D Presentation : Rethinking Network Management

347 views
277 views

Published on

Stefan Wallins Ph D Presentation 23 Feb 2013.
http://pure.ltu.se/portal/sv/publications/rethinking-network-management-solutions%28524ec0f6-7cb3-45bd-b350-72a21f0b7c6e%29.html

Published in: Education, Business, Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
347
On SlideShare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
18
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Stefan Wallin Ph D Presentation : Rethinking Network Management

  1. 1. The Thesis
  2. 2. What is NetworkManagement ? Alarms Service Status Control workflow Trouble-shoot Configure Service with trouble-tickets Configure Device 2
  3. 3. What is NetworkManagement ? Problems? Alarm Monitoring Service Management - Monitor - Configure 3
  4. 4. Main Thesis Use domain-specific languages to specify alarm and service models  Explicit knowledge  Text-based representation Use data-mining and self-learning to capture “hard-to-model” things  Tacit knowledge 4
  5. 5. Research Structure Service Models Alarm ModelsConfiguration Changes Service Type Status Calculation Causality Alarm Type Service Type Component Alarm Type Device Type Constraints Constraints 5
  6. 6. Problems and Contributions Alarm Models  Defined a Domain-Specific Alarm Type Language BASS for specifying Causality alarm models Alarm Type  Model Quality Constraints  Automatic Correlation  Data-Mining and Self-Learning to assign alarm severity levels Service Models  Domain-Specific Languages forConfiguration Changes Service Service Management Status Calculation Type Service  Defined SALmon for monitoring Type Component  Test of IETF YANG for Service Configuration Device Type Constraints 6
  7. 7. Attacking the Problems Challenges Solutions Computer ScienceService Providers Validations Solutions specialists fromEquipment Vendors me • LTU • Data Ductus • Tail-f • YALTS Journals Conferences 7
  8. 8. Publication Overview Journals  Conferences/Workshops  IEEE IT Professional  IFIP ManWeek  Springer Journal of Network and  IEEE IM Systems Management  IEEE NOMS  John Wiley & Sons International Journal of  Usenix LISA Network Management  IEEE AINA TeNAS  Inderscience International Journal of  IEEE SOSE Business Intelligence and Data-Mining  Springer Telecommunications Systems 8
  9. 9. Contents  Problems? – Input from Service Providers The Alarm Problem  The Service Management Problem Alarm Solutions  Service Management Solutions  BASS  Monitoring with SALmon  Alarm prioritization  Configuring with IETF YANG  Conclusions and Future Work  Acknowledgements 9
  10. 10. Coming Changes20Operators 11
  11. 11. Research Efforts20Operators 12
  12. 12. Alarm Chain ?Managed System Management System Estimated Resource Alarm Estimated Alarms Resource States Notifications Alarms States Alarm Type Resource Severity Raise / Clear Text 14
  13. 13. The Alarm ProblemMost network elements […] does nothave the notion of an alarm state.Devices emit event notifications wheneveran implementor thought this is a goodidea [around] 40% percent of the alarms are considered to be redundant as many alarms appear at the same time for one ’fault’. Many alarms are also repeated [...]. One alarm had for example appeared 65000 times in today’s browser. Correlation is hardly used even if it supported by the systems, [current correlation level is] 1-2 % maybe. 15
  14. 14. The Alarm Problem Too many ?  > 1 / Sec  Which ones are relevant?  Several alarms for the same fault Wrong severity levels Interpreting meaning and impact 16
  15. 15. Interpreting an Alarm*A0628/546 /08-07-01/10 H 38/ N=0407/TYP=ICT/CAT=SI /EVENT=DAL/NCEN=AMS1 /AM=SMTA7/AGEO=S1-TR03-B06-A085-R000 /TEXAL=IND RECEPTION/COMPL.INF: /AF=URMA7/ICTQ7 AGCA=S1-TR03-B06-A085-R117/DAT=08-07-01/HRS=10-38-14 /AMET=07-020-01 /AFLR=175-011/PLS/CRC=NACT /NSAE=186/NSGE=186/NIND=14/INDI=956/NSDT=0 17
  16. 16. Confusing Alarm SeverityOriginal Severity Priority set byfrom Device Operator 18
  17. 17. Hard-to ManageSeverity Distribution Hollifield, B., Habibi, E.: The Alarm Management Handbook 19
  18. 18. Alarm Type Distribution90% …3500 26 20
  19. 19. Alarm Monitoring Domain-Specific ModelsModeling Alarms – Enable Automation and Increase Quality 21
  20. 20. Research Structure Service Models Service Type Alarm ModelsConfiguration Changes Status Calculation Alarm Type Causality Service Type Component Alarm Type Device Type Constraints Constraints 22
  21. 21. Alarms Today ? We have:  Alarm interface standards  Envelope, the parameters  Alarm documentation  Informal documents for humans What we do not have:  Formal alarm definitions that can be used for automation  The contents of the envelope  “Alarm Model” 23
  22. 22. Alarm Model BASS Alarm Types Predicates Constraints - Information - Semantic 24
  23. 23. BASS 25
  24. 24. Bass Prototype and Validation .alarm CorrelatedAlarm Doc Alarm DBfrom Feedback Correlation BASS Rules Alarms fromReal Vendor Real Operator Uncorrelated Documentation Graphs Information Constraints Semantic Constraints 26
  25. 25. Semantic Constraints173 warnings in approved and released alarm interface 27
  26. 26. Information Constraints to Automate Correlation Automatic identification of root-cause candidates 28
  27. 27. Alarm Monitoring Data-Mining and Self-LearningAssigning Correct Severity Levels by Learning from Experts 29
  28. 28. Research Structure Service Models Service Type Alarm ModelsConfiguration Changes Status Calculation Alarm Type Causality Service Type Component Alarm Type Device Type Constraints Constraints 30
  29. 29. Learning Alarm PrioritiesDatabasesFrom RealServiceProvider Suggest Neural Network Priority Alarm Trouble Ticket Alarm  Prio System System Training Priority 31
  30. 30. Result Distribution of Errors Neural Original network severity • Neural network correct in 53 %Percentage of Alarms • Original severity correct in 11 % Magnitude of Error 32 Too high Too low
  31. 31. The ServiceManagement Problems 33
  32. 32. Service Management ”Services are not currently managed well in any suite of applications and require a tremendous amount of work to maintain” ”Service models are becoming more and more important” ”Focus on service management - bringing this up to 40% from [the] current level of 5-10%” ”Managing services must be the focus of the future development, while pushing network management into a supporting role” 34
  33. 33. Complex Structures“Service Models” Configuration Class Name Attribute Attribute Operation Operation Software Class Name ImplementationAttributeAttributeOperationOperation Monitoring Interpretations and Tedious Mappings 35
  34. 34. Research Structure Service Models Service Type Alarm ModelsConfiguration Changes Status Calculation Alarm Type Causality Service Type Component Alarm Type Device Type Constraints Constraints 37
  35. 35. My Two Tracks for Service Management Configuration Changes Service Type Status Calculation IETF YANG SALmon Service Type Component Device Type 1 Model the Services 2 Express the transformations 38
  36. 36. Simplifed Structures Configuration Models Models Monitoring Remove room for interpretations and automate mappings 39
  37. 37. SALmon ExampleBroadbandForum TR-126Triple PlayQoE Requirements 40
  38. 38. SALmon Test • The TR-126 model could be executedSLA and Service • Compact complete modelmonitor UI • Easy to change in one place 41
  39. 39. My Two Tracks for Service Management Configuration Changes Service Type IETF YANG Status Calculation SALmon Service Type Component Released Device Type 2010 1 Model the Services 2 Express the transformations 42
  40. 40. Service Configuration and Activation IETF Defined YANG as data-modeling language for managing devices  “Replacing SNMP MIBs” Thesis:  YANG can be used to model services, not only devices  Service Configuration as a YANG – YANG transform Work:  Service Modeling projects at service providers  Service Activation product, Tail-f NCS 43
  41. 41. SALmon and YANG SALmon IETF YANG CommentModel Object Oriented Tree Tree structures moreStructure suited for renderingPurpose Operational Data Configuration Data and Operational Data Time-SeriesCalculations Functional - - YANG to YANG mapping in Java for imperative configuration - XPATH possible to express aggregationConstraints - XPATH 44
  42. 42. Conclusions For Research  Closer cooperation with equipment and service providers  Network management is in need of computer science For Network Equipment Providers  Provide models (in a form) that can be used for automation  Interface quality For Service Providers Overcome current practice of incomplete illustrations and free-form documents  Model the offered services  Knowledge management 46
  43. 43. Future Work Alarm Models SALmon features represented in YANG Alarm Type  Language extensions or as models Causality  Time-series Alarm Type  Functional calculations Constraints  XPATH Database representation Service Models Imperative activation as part of Configuration Changes Service Status Calculation the model ? Type Service More knowledge management by using Type Component data-mining and self-learning Device Type Constraints 47
  44. 44. Errata Paper C :  Says trivial approach is correct in 17 % of the cases  Should be 11 % Section 2 :  Wrong “T”, should be: 48
  45. 45. Thank You !Klacke Wikström Jörgen ÖfjellHåkan Millroth Johan EhnmarkMartin Björklund Christer Åhlund Andreas JonssonSeb Strollo Johan Nordlander Ulrik ForsgrenJohan Bevemyr Viktor Leijon Magnus KarlssonJoakim Grebenö Robert Brännström Leif LandénChris Williams Karl Andersson Daniel Granlund Dan Johansson Nicklas BystedtMikael Börjesson EU Funded Equipment Vendors and Sidath Handurukande Service Providers Magneto Project Test Data 49
  46. 46. 50

×