SlideShare a Scribd company logo
1 of 8
Download to read offline
Operational Requirements for Enhanced BGP Error
Handling in BGP-4
draft-shakir-idr-ops-reqs-for-bgp-error-handling




                                                   Rob Shakir <rob.shakir@cw.com>
                                                               IETF 80 – Prague, CZ.
Problem Statement.

•  NOTIFICATION	
  based	
  on	
  errors	
  in	
  BGP-­‐4	
  UPDATE	
  messages	
  cause	
  
   dispropor?onate	
  failures	
  in	
  Service	
  Provider	
  Networks.	
  
                                  IPv4
                                 L3VPN                                                            ASN 1
	
                                                                                                        ASN 2
                                                   IPv6
       RR                                         L3VPN
                                                                                                             ASN 3
                                                             AS BR               IP "TRANSIT"                           ...
                        PE
                                                   VPLS
                                                                                                                        ...
       RR
                                                                                                                  ...
                                         Global
                                          IPv4
                                                                                                 ASN N
                             Global
                              IPv6




iBGP	
                                                    eBGP	
  
•  Mul%ple	
  AFIs	
  (services)	
  affected.	
  
         	
                                               •  Paths	
  to	
  all	
  NLRI	
  affected	
  despite	
  error	
  
•  Discrete	
  rou%ng	
  topologies	
  affected	
             in	
  single	
  UPDATE.	
  
   (e.g.	
  different	
  L3VPNs)	
  
Avoiding sending NOTIFICATION.

•  Operator’s	
  deployments	
  mean	
  compromises	
  to	
  protocol	
  correctness	
  
   resul?ng	
  in	
  invalid	
  rou?ng	
  may	
  be	
  acceptable.	
  
       –  Par%cularly	
  with	
  mul%ple	
  AFI	
  –	
  some	
  carrying	
  many	
  discrete	
  topologies.	
  
	
                                                                                          VRF 1
	
                                                                                               CPE A

	
                                                                                               CPE B

                        RR                                 PE

                                                                                           VRF 2

                                !""#$%&'#()*+*"*,-.'/                                            CPE C
                                    -0#'-0,".#1!&2


•  Requirement	
  is	
  to	
  avoid	
  sending	
  NOTIFICATION	
  where	
  possible.	
  
       –  Do	
  not	
  send	
  for	
  erroneous	
  UPDATEs	
  (and	
  hence	
  avoid	
  teardown).	
  
       –  Session	
  failure	
  affects	
  all	
  NLRI,	
  where	
  nega%ve	
  impact	
  affects	
  a	
  subset.	
  
       –  Required	
  for	
  both	
  eBGP	
  and	
  iBGP.	
  
Recover RIB Consistency.

•  Inconsistent	
  RIB	
  (by	
  trea?ng	
  UPDATE	
  as	
  withdraw)	
  compromises	
  protocol	
  
   correctness.	
  
       –  The	
  resul%ng	
  RIB	
  inconsistency	
  may	
  have	
  resulted	
  in	
  forwarding	
  loops	
  or	
  black-­‐holes.	
  
       –  BGP	
  speaker	
  is	
  aware	
  of	
  this	
  case,	
  if	
  using	
  “treat-­‐as-­‐withdraw”.	
  


•  Whilst	
  such	
  inconsistencies	
  are	
  acceptable,	
  they	
  are	
  clearly	
  sub-­‐op?mal.	
  
       –  Mechanism	
  required	
  to	
  recover	
  consistency	
  of	
  the	
  RIB,	
  and	
  remove	
  invalid	
  rou%ng.	
  
	
  
•  Whole	
  RIB	
  or	
  specific	
  RIB	
  subset?	
  
       –  ROUTE	
  REFRESH	
  is	
  inefficient	
  where	
  a	
  BGP	
  speaker	
  knows	
  the	
  NLRI	
  transmiWed	
  in	
  the	
  
          invalid	
  UPDATE.	
  
       –  Requirement	
  for	
  mechanism(s)	
  to	
  request	
  specific	
  RIB	
  subsets	
  –	
  reduce	
  control-­‐plane	
  
          load.	
  
       –  Allow	
  for	
  such	
  requests	
  to	
  be	
  automa%cally	
  or	
  manually	
  generated.	
  
	
  
Session Reset whilst Maintaining RIB/FIB.

•  Currently	
  NOTIFICATION	
  and	
  session	
  reset	
  is	
  the	
  reac?on	
  to	
  an	
  error.	
  
      –  Deals	
  with	
  reseYng	
  state	
  that	
  may	
  have	
  resulted	
  in	
  erroneous	
  UPDATE.	
  
      –  Major	
  opera%onal	
  issue	
  is	
  the	
  forwarding	
  disrup%on	
  caused.	
  


                                          FORWARDING PLANE UNAFFECTED.


                                                                          SESSION
                                                                           RESET
                              RTR A                                                        RTR B

                                            SESSION
                                            RE-OPEN




•  Benefits	
  of	
  reseSng	
  all	
  session	
  state	
  whilst	
  allowing	
  forwarding	
  to	
  
   con?nue.	
  
      –  Iden%cal	
  recovery	
  mechanism	
  as	
  is	
  implemented	
  currently,	
  with	
  lower	
  impact	
  to	
  
         opera%on	
  of	
  the	
  network.	
  
Monitoring.

•  Addi?onal	
  complexity	
  in	
  the	
  protocol	
  requires	
  further	
  opera?onal	
  
   visibility.	
  
       –  Let	
  our	
  NOCs	
  know	
  about	
  BGP-­‐4	
  errors,	
  and	
  respond.	
  
       –  Previously	
  NOTIFICATION/tear-­‐down	
  was	
  very	
  visible	
  due	
  to	
  forwarding	
  outages.	
  


•  Enhance	
  monitoring	
  toolset.	
  
       –  Capability	
  to	
  transmit	
  error	
  informa%on	
  between	
  BGP	
  neighbours.	
  
       –  Further	
  visibility	
  to	
  determine	
  where	
  errors	
  have	
  occurred,	
  and	
  what	
  they	
  are.	
  




	
  
	
  
Caveats of Requirements.

•  React	
  to	
  errors	
  (and	
  recover)	
  within	
  available	
  control-­‐plane	
  resource.	
  
       –  Ensure	
  that	
  we	
  do	
  not	
  reach	
  looped	
  scenarios	
  where	
  automa%c	
  recovery	
  is	
  available.	
  


•  Exponen?al	
  (?)	
  Back-­‐Off	
  for	
  RIB	
  recovery	
  requests.	
  
       –  Don’t	
  overload	
  neighbour	
  and/or	
  local	
  BGP	
  speaker	
  with	
  recovery	
  requests.	
  


•  Avoid	
  constant	
  session	
  restarts.	
  
       –  Iden%fy	
  a	
  point	
  at	
  which	
  a	
  session	
  is	
  “bad”	
  if	
  using	
  automa%c	
  mechanisms	
  to	
  recover.	
  
	
  
Draft Progression.

•  DraV	
  has	
  been	
  presented	
  and	
  discussed	
  at	
  a	
  number	
  of	
  opera?onal	
  
   forums.	
  
      –  NANOG,	
  UKNOF,	
  LINX.	
  
      –  Well	
  supported	
  as	
  a	
  set	
  of	
  requirements	
  for	
  operators	
  (see	
  GROW	
  and	
  IDR	
  mailing	
  lists).	
  


•  Would	
  like	
  WG	
  adop?on.	
  
      –  Provides	
  a	
  framework	
  to	
  which	
  IDR/GROW	
  work	
  items	
  can	
  be	
  %ed.	
  
      –  Intends	
  to	
  avoid	
  “par%al	
  solu%ons”	
  that	
  do	
  not	
  meet	
  the	
  toolset	
  required	
  by	
  operators.	
  


•  Thoughts	
  as	
  to	
  which	
  WG	
  is	
  most	
  suitable?	
  
	
  

More Related Content

What's hot (12)

RIPE 80: Buffers and Protocols
RIPE 80: Buffers and ProtocolsRIPE 80: Buffers and Protocols
RIPE 80: Buffers and Protocols
 
Bgp
BgpBgp
Bgp
 
Eigrp
EigrpEigrp
Eigrp
 
Rip
RipRip
Rip
 
Spaun vam
Spaun vamSpaun vam
Spaun vam
 
Solution for code congestion
Solution for code congestionSolution for code congestion
Solution for code congestion
 
Ppp
PppPpp
Ppp
 
Mpls technology
Mpls technologyMpls technology
Mpls technology
 
Scot baxtor cdma
Scot baxtor cdmaScot baxtor cdma
Scot baxtor cdma
 
Somerdata AROW Data Diode
Somerdata AROW Data DiodeSomerdata AROW Data Diode
Somerdata AROW Data Diode
 
Voip basics
Voip basicsVoip basics
Voip basics
 
CMAF Live Ingest Uplink Protocol
CMAF Live Ingest Uplink ProtocolCMAF Live Ingest Uplink Protocol
CMAF Live Ingest Uplink Protocol
 

Viewers also liked

LINX65 - Handling BGP Attribute Errors (Rob Shakir)
LINX65 - Handling BGP Attribute Errors (Rob Shakir)LINX65 - Handling BGP Attribute Errors (Rob Shakir)
LINX65 - Handling BGP Attribute Errors (Rob Shakir)Rob Shakir
 
Projekt labhr
Projekt labhrProjekt labhr
Projekt labhrLAB HR
 
UKNOF16 - Enhancing BGP
UKNOF16 - Enhancing BGPUKNOF16 - Enhancing BGP
UKNOF16 - Enhancing BGPRob Shakir
 
Ryan's Rappers - Walk Thru the Years
Ryan's Rappers - Walk Thru the YearsRyan's Rappers - Walk Thru the Years
Ryan's Rappers - Walk Thru the Yearszgperry
 
Chapter six
Chapter sixChapter six
Chapter sixgpappas4
 
Fractures.
Fractures.Fractures.
Fractures.gpappas4
 
Biology of the brain
Biology of the brainBiology of the brain
Biology of the braingpappas4
 
IETF87 - STATUS BoF: Performance Engineered LSPs
IETF87 - STATUS BoF: Performance Engineered LSPsIETF87 - STATUS BoF: Performance Engineered LSPs
IETF87 - STATUS BoF: Performance Engineered LSPsRob Shakir
 
LabHr - Katalog narzędzi diagnostycznych
LabHr - Katalog narzędzi diagnostycznychLabHr - Katalog narzędzi diagnostycznych
LabHr - Katalog narzędzi diagnostycznychLAB HR
 
Reinforcing the Kitchen Sink - Aligning BGP-4 Error Handling with Modern Netw...
Reinforcing the Kitchen Sink - Aligning BGP-4 Error Handling with Modern Netw...Reinforcing the Kitchen Sink - Aligning BGP-4 Error Handling with Modern Netw...
Reinforcing the Kitchen Sink - Aligning BGP-4 Error Handling with Modern Netw...Rob Shakir
 
BGP Error Handling - Developing an Operator-Led Approach in the IETF (UKNOF 18)
BGP Error Handling - Developing an Operator-Led Approach in the IETF (UKNOF 18)BGP Error Handling - Developing an Operator-Led Approach in the IETF (UKNOF 18)
BGP Error Handling - Developing an Operator-Led Approach in the IETF (UKNOF 18)Rob Shakir
 
100GE in the Lab - LINX 71
100GE in the Lab - LINX 71100GE in the Lab - LINX 71
100GE in the Lab - LINX 71Rob Shakir
 
BGP OPERATIONAL Message
BGP OPERATIONAL MessageBGP OPERATIONAL Message
BGP OPERATIONAL MessageRob Shakir
 
Utilizing Parnerships to Communicate Pool Safely Messaging
Utilizing Parnerships to Communicate Pool Safely MessagingUtilizing Parnerships to Communicate Pool Safely Messaging
Utilizing Parnerships to Communicate Pool Safely MessagingSuncoastMeetings
 
accelerator day chinchin 소개자료
accelerator day chinchin 소개자료accelerator day chinchin 소개자료
accelerator day chinchin 소개자료VentureSquare
 
Magnetic hall effect based sensors final
Magnetic hall effect based sensors finalMagnetic hall effect based sensors final
Magnetic hall effect based sensors finalsjykmuch
 

Viewers also liked (20)

LINX65 - Handling BGP Attribute Errors (Rob Shakir)
LINX65 - Handling BGP Attribute Errors (Rob Shakir)LINX65 - Handling BGP Attribute Errors (Rob Shakir)
LINX65 - Handling BGP Attribute Errors (Rob Shakir)
 
Guide To Passing PMI\'s PMP Exam
Guide To Passing PMI\'s PMP ExamGuide To Passing PMI\'s PMP Exam
Guide To Passing PMI\'s PMP Exam
 
Projekt labhr
Projekt labhrProjekt labhr
Projekt labhr
 
UKNOF16 - Enhancing BGP
UKNOF16 - Enhancing BGPUKNOF16 - Enhancing BGP
UKNOF16 - Enhancing BGP
 
Ryan's Rappers - Walk Thru the Years
Ryan's Rappers - Walk Thru the YearsRyan's Rappers - Walk Thru the Years
Ryan's Rappers - Walk Thru the Years
 
Chapter six
Chapter sixChapter six
Chapter six
 
Fractures.
Fractures.Fractures.
Fractures.
 
Biology of the brain
Biology of the brainBiology of the brain
Biology of the brain
 
IETF87 - STATUS BoF: Performance Engineered LSPs
IETF87 - STATUS BoF: Performance Engineered LSPsIETF87 - STATUS BoF: Performance Engineered LSPs
IETF87 - STATUS BoF: Performance Engineered LSPs
 
LabHr - Katalog narzędzi diagnostycznych
LabHr - Katalog narzędzi diagnostycznychLabHr - Katalog narzędzi diagnostycznych
LabHr - Katalog narzędzi diagnostycznych
 
Reinforcing the Kitchen Sink - Aligning BGP-4 Error Handling with Modern Netw...
Reinforcing the Kitchen Sink - Aligning BGP-4 Error Handling with Modern Netw...Reinforcing the Kitchen Sink - Aligning BGP-4 Error Handling with Modern Netw...
Reinforcing the Kitchen Sink - Aligning BGP-4 Error Handling with Modern Netw...
 
BGP Error Handling - Developing an Operator-Led Approach in the IETF (UKNOF 18)
BGP Error Handling - Developing an Operator-Led Approach in the IETF (UKNOF 18)BGP Error Handling - Developing an Operator-Led Approach in the IETF (UKNOF 18)
BGP Error Handling - Developing an Operator-Led Approach in the IETF (UKNOF 18)
 
100GE in the Lab - LINX 71
100GE in the Lab - LINX 71100GE in the Lab - LINX 71
100GE in the Lab - LINX 71
 
BGP OPERATIONAL Message
BGP OPERATIONAL MessageBGP OPERATIONAL Message
BGP OPERATIONAL Message
 
Utilizing Parnerships to Communicate Pool Safely Messaging
Utilizing Parnerships to Communicate Pool Safely MessagingUtilizing Parnerships to Communicate Pool Safely Messaging
Utilizing Parnerships to Communicate Pool Safely Messaging
 
accelerator day chinchin 소개자료
accelerator day chinchin 소개자료accelerator day chinchin 소개자료
accelerator day chinchin 소개자료
 
Shahid
ShahidShahid
Shahid
 
THE HALL EFFECT
THE HALL EFFECTTHE HALL EFFECT
THE HALL EFFECT
 
hall effect
hall effecthall effect
hall effect
 
Magnetic hall effect based sensors final
Magnetic hall effect based sensors finalMagnetic hall effect based sensors final
Magnetic hall effect based sensors final
 

Similar to Operational Requirements for Enhanced BGP Error Handling

Study for FIBRE-BR Backbone Network Architecture
Study for FIBRE-BR Backbone Network ArchitectureStudy for FIBRE-BR Backbone Network Architecture
Study for FIBRE-BR Backbone Network ArchitectureFIBRE Testbed
 
AusNOG 2011 - Residential IPv6 CPE - What Not to Do and Other Observations
AusNOG 2011 - Residential IPv6 CPE - What Not to Do and Other ObservationsAusNOG 2011 - Residential IPv6 CPE - What Not to Do and Other Observations
AusNOG 2011 - Residential IPv6 CPE - What Not to Do and Other ObservationsMark Smith
 
Eigrp
EigrpEigrp
Eigrpfirey
 
IPv6 Segment Routing : an end-to-end solution ?
IPv6 Segment Routing : an end-to-end solution ?IPv6 Segment Routing : an end-to-end solution ?
IPv6 Segment Routing : an end-to-end solution ?Olivier Bonaventure
 
BGP Traffic Engineering with SDN Controller
BGP Traffic Engineering with SDN ControllerBGP Traffic Engineering with SDN Controller
BGP Traffic Engineering with SDN ControllerAPNIC
 
MikroTik Multicast Routing [www.imxpert.co]
MikroTik Multicast Routing [www.imxpert.co]MikroTik Multicast Routing [www.imxpert.co]
MikroTik Multicast Routing [www.imxpert.co]Faisal Reza
 
3.3 gpp NR USER Plane introduction
3.3 gpp NR USER Plane introduction3.3 gpp NR USER Plane introduction
3.3 gpp NR USER Plane introductionSaurabh Verma
 
Operational Experience of MAP-E
Operational Experience of MAP-EOperational Experience of MAP-E
Operational Experience of MAP-EAkira Nakagawa
 
IPv6 in 3G Core Networks
IPv6 in 3G Core NetworksIPv6 in 3G Core Networks
IPv6 in 3G Core NetworksJohn Loughney
 
Mpls Presentation Ine
Mpls Presentation IneMpls Presentation Ine
Mpls Presentation IneAlp isik
 
Performance Evaluation of GTP-U and SRv6 Stateless Translation
Performance Evaluation of GTP-U and SRv6 Stateless TranslationPerformance Evaluation of GTP-U and SRv6 Stateless Translation
Performance Evaluation of GTP-U and SRv6 Stateless TranslationChunghan Lee
 
PLNOG 8: Rafał Szarecki - Telco Group Network
PLNOG 8: Rafał Szarecki - Telco Group Network PLNOG 8: Rafał Szarecki - Telco Group Network
PLNOG 8: Rafał Szarecki - Telco Group Network PROIDEA
 
Mpls vpn.rip
Mpls vpn.ripMpls vpn.rip
Mpls vpn.ripfarhanica
 
IX Best Practices by Tay Chee Yong
IX Best Practices by Tay Chee YongIX Best Practices by Tay Chee Yong
IX Best Practices by Tay Chee YongMyNOG
 
Interautonomous System PLS VPN Advanced Concepts
Interautonomous System PLS VPN Advanced ConceptsInterautonomous System PLS VPN Advanced Concepts
Interautonomous System PLS VPN Advanced ConceptsBrozaa
 

Similar to Operational Requirements for Enhanced BGP Error Handling (20)

Study for FIBRE-BR Backbone Network Architecture
Study for FIBRE-BR Backbone Network ArchitectureStudy for FIBRE-BR Backbone Network Architecture
Study for FIBRE-BR Backbone Network Architecture
 
AusNOG 2011 - Residential IPv6 CPE - What Not to Do and Other Observations
AusNOG 2011 - Residential IPv6 CPE - What Not to Do and Other ObservationsAusNOG 2011 - Residential IPv6 CPE - What Not to Do and Other Observations
AusNOG 2011 - Residential IPv6 CPE - What Not to Do and Other Observations
 
Eigrp
EigrpEigrp
Eigrp
 
IPv6 Segment Routing : an end-to-end solution ?
IPv6 Segment Routing : an end-to-end solution ?IPv6 Segment Routing : an end-to-end solution ?
IPv6 Segment Routing : an end-to-end solution ?
 
Fast Convergence Techniques
Fast Convergence TechniquesFast Convergence Techniques
Fast Convergence Techniques
 
BGP Traffic Engineering with SDN Controller
BGP Traffic Engineering with SDN ControllerBGP Traffic Engineering with SDN Controller
BGP Traffic Engineering with SDN Controller
 
MikroTik Multicast Routing [www.imxpert.co]
MikroTik Multicast Routing [www.imxpert.co]MikroTik Multicast Routing [www.imxpert.co]
MikroTik Multicast Routing [www.imxpert.co]
 
3.3 gpp NR USER Plane introduction
3.3 gpp NR USER Plane introduction3.3 gpp NR USER Plane introduction
3.3 gpp NR USER Plane introduction
 
Operational Experience of MAP-E
Operational Experience of MAP-EOperational Experience of MAP-E
Operational Experience of MAP-E
 
Rip
RipRip
Rip
 
IPv6 in 3G Core Networks
IPv6 in 3G Core NetworksIPv6 in 3G Core Networks
IPv6 in 3G Core Networks
 
Mpls Presentation Ine
Mpls Presentation IneMpls Presentation Ine
Mpls Presentation Ine
 
Performance Evaluation of GTP-U and SRv6 Stateless Translation
Performance Evaluation of GTP-U and SRv6 Stateless TranslationPerformance Evaluation of GTP-U and SRv6 Stateless Translation
Performance Evaluation of GTP-U and SRv6 Stateless Translation
 
PLNOG 8: Rafał Szarecki - Telco Group Network
PLNOG 8: Rafał Szarecki - Telco Group Network PLNOG 8: Rafał Szarecki - Telco Group Network
PLNOG 8: Rafał Szarecki - Telco Group Network
 
Mpls vpn.rip
Mpls vpn.ripMpls vpn.rip
Mpls vpn.rip
 
IX Best Practices by Tay Chee Yong
IX Best Practices by Tay Chee YongIX Best Practices by Tay Chee Yong
IX Best Practices by Tay Chee Yong
 
A series presentation
A series presentationA series presentation
A series presentation
 
I Pv6 Enabling Menog 0.4
I Pv6 Enabling Menog 0.4I Pv6 Enabling Menog 0.4
I Pv6 Enabling Menog 0.4
 
Interautonomous System PLS VPN Advanced Concepts
Interautonomous System PLS VPN Advanced ConceptsInterautonomous System PLS VPN Advanced Concepts
Interautonomous System PLS VPN Advanced Concepts
 
EMEA Airheads- Aruba Instant AP- VPN Troubleshooting
EMEA Airheads- Aruba Instant AP-  VPN TroubleshootingEMEA Airheads- Aruba Instant AP-  VPN Troubleshooting
EMEA Airheads- Aruba Instant AP- VPN Troubleshooting
 

Operational Requirements for Enhanced BGP Error Handling

  • 1. Operational Requirements for Enhanced BGP Error Handling in BGP-4 draft-shakir-idr-ops-reqs-for-bgp-error-handling Rob Shakir <rob.shakir@cw.com> IETF 80 – Prague, CZ.
  • 2. Problem Statement. •  NOTIFICATION  based  on  errors  in  BGP-­‐4  UPDATE  messages  cause   dispropor?onate  failures  in  Service  Provider  Networks.   IPv4 L3VPN ASN 1   ASN 2 IPv6 RR L3VPN ASN 3 AS BR IP "TRANSIT" ... PE VPLS ... RR ... Global IPv4 ASN N Global IPv6 iBGP   eBGP   •  Mul%ple  AFIs  (services)  affected.     •  Paths  to  all  NLRI  affected  despite  error   •  Discrete  rou%ng  topologies  affected   in  single  UPDATE.   (e.g.  different  L3VPNs)  
  • 3. Avoiding sending NOTIFICATION. •  Operator’s  deployments  mean  compromises  to  protocol  correctness   resul?ng  in  invalid  rou?ng  may  be  acceptable.   –  Par%cularly  with  mul%ple  AFI  –  some  carrying  many  discrete  topologies.     VRF 1   CPE A   CPE B RR PE VRF 2 !""#$%&'#()*+*"*,-.'/ CPE C -0#'-0,".#1!&2 •  Requirement  is  to  avoid  sending  NOTIFICATION  where  possible.   –  Do  not  send  for  erroneous  UPDATEs  (and  hence  avoid  teardown).   –  Session  failure  affects  all  NLRI,  where  nega%ve  impact  affects  a  subset.   –  Required  for  both  eBGP  and  iBGP.  
  • 4. Recover RIB Consistency. •  Inconsistent  RIB  (by  trea?ng  UPDATE  as  withdraw)  compromises  protocol   correctness.   –  The  resul%ng  RIB  inconsistency  may  have  resulted  in  forwarding  loops  or  black-­‐holes.   –  BGP  speaker  is  aware  of  this  case,  if  using  “treat-­‐as-­‐withdraw”.   •  Whilst  such  inconsistencies  are  acceptable,  they  are  clearly  sub-­‐op?mal.   –  Mechanism  required  to  recover  consistency  of  the  RIB,  and  remove  invalid  rou%ng.     •  Whole  RIB  or  specific  RIB  subset?   –  ROUTE  REFRESH  is  inefficient  where  a  BGP  speaker  knows  the  NLRI  transmiWed  in  the   invalid  UPDATE.   –  Requirement  for  mechanism(s)  to  request  specific  RIB  subsets  –  reduce  control-­‐plane   load.   –  Allow  for  such  requests  to  be  automa%cally  or  manually  generated.    
  • 5. Session Reset whilst Maintaining RIB/FIB. •  Currently  NOTIFICATION  and  session  reset  is  the  reac?on  to  an  error.   –  Deals  with  reseYng  state  that  may  have  resulted  in  erroneous  UPDATE.   –  Major  opera%onal  issue  is  the  forwarding  disrup%on  caused.   FORWARDING PLANE UNAFFECTED. SESSION RESET RTR A RTR B SESSION RE-OPEN •  Benefits  of  reseSng  all  session  state  whilst  allowing  forwarding  to   con?nue.   –  Iden%cal  recovery  mechanism  as  is  implemented  currently,  with  lower  impact  to   opera%on  of  the  network.  
  • 6. Monitoring. •  Addi?onal  complexity  in  the  protocol  requires  further  opera?onal   visibility.   –  Let  our  NOCs  know  about  BGP-­‐4  errors,  and  respond.   –  Previously  NOTIFICATION/tear-­‐down  was  very  visible  due  to  forwarding  outages.   •  Enhance  monitoring  toolset.   –  Capability  to  transmit  error  informa%on  between  BGP  neighbours.   –  Further  visibility  to  determine  where  errors  have  occurred,  and  what  they  are.      
  • 7. Caveats of Requirements. •  React  to  errors  (and  recover)  within  available  control-­‐plane  resource.   –  Ensure  that  we  do  not  reach  looped  scenarios  where  automa%c  recovery  is  available.   •  Exponen?al  (?)  Back-­‐Off  for  RIB  recovery  requests.   –  Don’t  overload  neighbour  and/or  local  BGP  speaker  with  recovery  requests.   •  Avoid  constant  session  restarts.   –  Iden%fy  a  point  at  which  a  session  is  “bad”  if  using  automa%c  mechanisms  to  recover.    
  • 8. Draft Progression. •  DraV  has  been  presented  and  discussed  at  a  number  of  opera?onal   forums.   –  NANOG,  UKNOF,  LINX.   –  Well  supported  as  a  set  of  requirements  for  operators  (see  GROW  and  IDR  mailing  lists).   •  Would  like  WG  adop?on.   –  Provides  a  framework  to  which  IDR/GROW  work  items  can  be  %ed.   –  Intends  to  avoid  “par%al  solu%ons”  that  do  not  meet  the  toolset  required  by  operators.   •  Thoughts  as  to  which  WG  is  most  suitable?