SlideShare a Scribd company logo
1 of 16
Download to read offline
BGP ERROR HANDLING.
DEVELOPING AN OPERATOR-LED APPROACH IN THE IETF.

 Shakir, Cable&Wireless Worldwide.
Rob




                                UKNOF 18 – 20/01/2011 - LONDON
CUSTOMER
                        A Typical SP Network?
                        PE          PE




TRANSIT            PE   P            P        PE             PEER




                               P




                        PE               PE
             BGP
             IGP
                                                      CUSTOMER




   IGP
 Signals customer/Internal prefixes between PEs
   EGP
 Propagates internal prefixes to neighbouring ASes.
A (Modern) Typical SP Network?
CUSTOMER

                                        PE
                        PE




                               RR

TRANSIT            PE                             PE          PEER

                         P              P



                                    P


                        PE                   PE
             BGP
             IGP
                                                        CUSTOMER




   IGP
 Minimal infrastructure routing information.
   BGP
 Propagate internal routing and service data.
BGP Failures I.

JAN.
                                                             ERRORS IN AS4_PATH

09
                 Erroneous	
  data	
  in	
  the	
  AS4_PATH	
  op6onal	
  transi6ve	
  a9ribute	
  
                                                 causing	
  BGP	
  session	
  failure	
  (JunOS	
  bug).	
  




VERY LONG AS_PATH
                                                                                    FEB.
Very	
  long	
  AS_PATHs	
  in	
  the	
  global	
  BGP	
  table	
  cause	
  session	
  failure.	
  
Not	
  the	
  first	
  6me	
  this	
  had	
  been	
  seen.	
                                           09
BGP Failures II.

AUG.
                                           RIPE NCC RIS EXPERIMENTAL

10
                  A	
  RIPE	
  NCC	
  RIS/Duke	
  University	
  experiment	
  results	
  in	
  BGP	
  
                     sessions	
  being	
  reset	
  –	
  disrup6ng	
  global	
  table	
  (IOS	
  XR	
  bug).	
  




iBGP FAILURES
                                                                                        ??
Mul6ple	
  occurrences	
  within	
  xSP	
  networks.	
  
Likely	
  to	
  cause	
  higher	
  financial	
  impact	
  (L3VPN	
  margin).	
                      ??
Why do we see these events?

RTR A                                RTR B




         UPDATE

                                             Error!




RTR A                                RTR B




                      NOTIFICATION
Cause/Impact.

LIMITED
                                              Must	
  either	
  DISCARD	
  a9ributes	
  or	
  
TOOLSET IN STANDARDS.
                                      respond	
  with	
  NOTIFICATION.	
  



SERVICE
   Transit/Peering	
  failure	
  	
  -­‐	
  although	
  error	
  source	
  may	
  be	
  remote.	
  
IMPACT.
            iBGP	
  failure	
  –	
  high	
  impact	
  sessions?	
  Route	
  reflectors?	
  



                     Results in loss of RIB!
Would you tolerate this in your IGP based on one erroneous LSP?
Intent of Work.

DEFINE HOW
                                      Document	
  the	
  way	
  xSPs	
  use	
  BGP.	
  
BGP IS USED.
   Ensure	
  that	
  cri6cal	
  nature	
  of	
  the	
  protocol	
  is	
  understood.	
  



PROVIDE
                Determine	
  how	
  OPERATORS	
  think	
  that	
  BGP	
  should	
  
REQUIREMENTS
                          fail	
  –	
  and	
  what	
  we’ll	
  compromise	
  on.	
  



TIE TOGETHER
                  Ensure	
  that	
  tools	
  resul6ng	
  from	
  exis6ng	
  dra]s	
  
IETF WORK ITEMS.
              form	
  a	
  useful	
  framework	
  to	
  make	
  BGP	
  robust.	
  
Approach Overview.


01
   DON’T SEND NOTIFICATION.
   04




                                  MONITORING
02
 RECOVER RIB CONSISTENCY.
03
     RESTART BGP HITLESSLY.
Avoid sending NOTIFICATION.
                                                                          Error!
                                                                       172.16.0.0/12
                                                                       WITHDRAWN
                  UPDATE
               172.16.0.0/12


       RTR A                                                         RTR B



                                             NOTIFICATION




WHAT DO WE
                     “treat-­‐as-­‐withdraw”	
  mechanism	
  can	
  result	
  in	
  
COMPROMISE ON?
                        rou6ng	
  inconsistency	
  (possible	
  loops!).	
  


EXISTING WORK
                 dra]-­‐chen	
  (eBGP	
  errors)	
  –	
  includes	
  Opt	
  Trans.	
  
ITEMS IN IETF?
                          Needs	
  to	
  be	
  extended	
  to	
  cover	
  iBGP.	
  
Recover RIB Consistency.
                                                                   Missing
                                                                172.16.0.0/12
                                                                 from RTR A

                                            REQUEST
                                           172.16.0.0/12

       RTR A                                                   RTR B

                   UPDATE
                 172.16.0.0/12




HOW CAN THIS
                    Mechanisms	
  to	
  re-­‐request	
  missing	
  NLRI.	
  
BE ACHIEVED?
                         One	
  prefix	
  at	
  once,	
  or	
  whole	
  RIB.	
  


EXISTING WORK
                                         “One-­‐Time	
  Prefix	
  ORF”.	
  
ITEMS?
                                             Enhanced	
  ROUTE	
  REFRESH.	
  
Reduce Impact of Session Reset.
SESSION RESETS,
             NOTIFICATION	
  has	
  u6lity	
  for	
  resecng	
  state.	
  
CAN WE AVOID THEM?
           Consider	
  that	
  some6mes	
  it	
  is	
  unavoidable.	
  


                  FORWARDING PLANE UNAFFECTED.


                                                SESSION
                                                 RESET
          RTR A                                                   RTR B

                   SESSION
                   RE-OPEN



EXISTING WORK
                              (Expired)	
  “SOFT-­‐NOTIFICATION”.	
  
ITEMS IN IETF?
                             Further	
  work	
  required	
  to	
  revive!	
  
Introduce Further Monitoring.

EXISTING ERRORS
     NOCs	
  can	
  see	
  session	
  failures	
  very	
  easily	
  –	
  both	
  
ARE VERY VISIBLE.
    via	
  session	
  monitoring	
  and	
  forwarding	
  outage!	
  	
  



FURTHER COMPLEXITY
              Mechanisms	
  are	
  required	
  to	
  make	
  error	
  
MEANS LESS MANAGEABLE
            handling	
  visible	
  to	
  both	
  BGP	
  speakers.	
  



EXISTING WORK
                    (In-­‐band)	
  ADVISORY	
  and	
  DIAGNOSTIC.	
  
ITEMS IN IETF?
                 (Out-­‐of-­‐Band)	
  BGP	
  Monitoring	
  Protocol.	
  
Complexities of Approach.
                Know	
  the	
  NLRI?	
  
                  Re-­‐request	
  
                    (ORF)	
  



  Error!	
  
                                            Re-­‐request	
  the	
     Hitless	
  Session	
  
treat-­‐as-­‐
                                              whole	
  RIB	
               Reset	
  
withdraw	
  




                                           OOPS!
Why am I standing here?

                         UKNOF

  As Operators, we deal with the fall-out of protocol issues!


SO…
 an agreed, operator-recommended approach is required.
Questions, comments, review…


   ALL MUCH APPRECIATED!

rob.shakir@cw.com // +44(0)207 100 7532 // RJS-RIPE

More Related Content

What's hot

Future Signaling Protocols What’s New in IETF
Future Signaling Protocols What’s New in IETFFuture Signaling Protocols What’s New in IETF
Future Signaling Protocols What’s New in IETFJohn Loughney
 
AD-IP-JESD204 JESD204B Interface Framework
AD-IP-JESD204 JESD204B Interface FrameworkAD-IP-JESD204 JESD204B Interface Framework
AD-IP-JESD204 JESD204B Interface FrameworkAnalog Devices, Inc.
 
Slide Development Of A Laser Driver Chip Test Set‑Up For Slhc Experiments
Slide Development Of A Laser Driver Chip Test Set‑Up For Slhc ExperimentsSlide Development Of A Laser Driver Chip Test Set‑Up For Slhc Experiments
Slide Development Of A Laser Driver Chip Test Set‑Up For Slhc Experimentsstefanome
 
Bobcat hotchips final 8 2 10
Bobcat hotchips final 8 2 10Bobcat hotchips final 8 2 10
Bobcat hotchips final 8 2 10mbasford
 
OW2 JOnAS Use CAse, OW2con11, Nov 24-25, Paris
OW2 JOnAS Use CAse, OW2con11, Nov 24-25, ParisOW2 JOnAS Use CAse, OW2con11, Nov 24-25, Paris
OW2 JOnAS Use CAse, OW2con11, Nov 24-25, ParisOW2
 
MPLS WC 2014 Segment Routing TI-LFA Fast ReRoute
MPLS WC 2014  Segment Routing TI-LFA Fast ReRouteMPLS WC 2014  Segment Routing TI-LFA Fast ReRoute
MPLS WC 2014 Segment Routing TI-LFA Fast ReRouteBruno Decraene
 
Sy tech rios ai mobile command mobile command with workstations
Sy tech rios ai mobile command   mobile command with workstationsSy tech rios ai mobile command   mobile command with workstations
Sy tech rios ai mobile command mobile command with workstationsSyTech Corporation
 

What's hot (14)

Future Signaling Protocols What’s New in IETF
Future Signaling Protocols What’s New in IETFFuture Signaling Protocols What’s New in IETF
Future Signaling Protocols What’s New in IETF
 
Eigrp
EigrpEigrp
Eigrp
 
Resume201411
Resume201411Resume201411
Resume201411
 
AD-IP-JESD204 JESD204B Interface Framework
AD-IP-JESD204 JESD204B Interface FrameworkAD-IP-JESD204 JESD204B Interface Framework
AD-IP-JESD204 JESD204B Interface Framework
 
Slide Development Of A Laser Driver Chip Test Set‑Up For Slhc Experiments
Slide Development Of A Laser Driver Chip Test Set‑Up For Slhc ExperimentsSlide Development Of A Laser Driver Chip Test Set‑Up For Slhc Experiments
Slide Development Of A Laser Driver Chip Test Set‑Up For Slhc Experiments
 
Bobcat hotchips final 8 2 10
Bobcat hotchips final 8 2 10Bobcat hotchips final 8 2 10
Bobcat hotchips final 8 2 10
 
Albedo.Net.Audit.Ps
Albedo.Net.Audit.PsAlbedo.Net.Audit.Ps
Albedo.Net.Audit.Ps
 
OW2 JOnAS Use CAse, OW2con11, Nov 24-25, Paris
OW2 JOnAS Use CAse, OW2con11, Nov 24-25, ParisOW2 JOnAS Use CAse, OW2con11, Nov 24-25, Paris
OW2 JOnAS Use CAse, OW2con11, Nov 24-25, Paris
 
MPLS WC 2014 Segment Routing TI-LFA Fast ReRoute
MPLS WC 2014  Segment Routing TI-LFA Fast ReRouteMPLS WC 2014  Segment Routing TI-LFA Fast ReRoute
MPLS WC 2014 Segment Routing TI-LFA Fast ReRoute
 
76.8600 50142 b
76.8600 50142 b76.8600 50142 b
76.8600 50142 b
 
Bgp
BgpBgp
Bgp
 
Adf7901
Adf7901Adf7901
Adf7901
 
Sonia.Sharma
Sonia.SharmaSonia.Sharma
Sonia.Sharma
 
Sy tech rios ai mobile command mobile command with workstations
Sy tech rios ai mobile command   mobile command with workstationsSy tech rios ai mobile command   mobile command with workstations
Sy tech rios ai mobile command mobile command with workstations
 

Viewers also liked

Kisi soal IPS kls 8 ukk 2012
Kisi  soal IPS kls 8 ukk 2012Kisi  soal IPS kls 8 ukk 2012
Kisi soal IPS kls 8 ukk 2012Deni Riansyah
 
Keanekaragaman hayati
Keanekaragaman hayatiKeanekaragaman hayati
Keanekaragaman hayatiFadhil_17
 
Buku Guru IPS Kelas VIII SMP Kurikulum 2013
Buku Guru IPS Kelas VIII SMP Kurikulum 2013Buku Guru IPS Kelas VIII SMP Kurikulum 2013
Buku Guru IPS Kelas VIII SMP Kurikulum 2013Randy Ikas
 
Kisi kisi uas ipa kls viii, 2013 2014
Kisi kisi uas ipa kls viii, 2013 2014Kisi kisi uas ipa kls viii, 2013 2014
Kisi kisi uas ipa kls viii, 2013 2014Deni Riansyah
 
Kisi kisi ips kls viii. uas 2013.2014
Kisi kisi ips kls viii. uas 2013.2014Kisi kisi ips kls viii. uas 2013.2014
Kisi kisi ips kls viii. uas 2013.2014Deni Riansyah
 
12. kisi kisi ips uas kelas 7.9
12. kisi kisi ips uas kelas  7.912. kisi kisi ips uas kelas  7.9
12. kisi kisi ips uas kelas 7.9Deni Riansyah
 
Buku Siswa IPS Kelas VIII SMP Kurikulum 2013
Buku Siswa IPS Kelas VIII SMP Kurikulum 2013Buku Siswa IPS Kelas VIII SMP Kurikulum 2013
Buku Siswa IPS Kelas VIII SMP Kurikulum 2013Randy Ikas
 
Kisi kisi ips uas kelas vii-viii dan ix
Kisi kisi ips uas kelas vii-viii dan ixKisi kisi ips uas kelas vii-viii dan ix
Kisi kisi ips uas kelas vii-viii dan ixDeni Riansyah
 

Viewers also liked (8)

Kisi soal IPS kls 8 ukk 2012
Kisi  soal IPS kls 8 ukk 2012Kisi  soal IPS kls 8 ukk 2012
Kisi soal IPS kls 8 ukk 2012
 
Keanekaragaman hayati
Keanekaragaman hayatiKeanekaragaman hayati
Keanekaragaman hayati
 
Buku Guru IPS Kelas VIII SMP Kurikulum 2013
Buku Guru IPS Kelas VIII SMP Kurikulum 2013Buku Guru IPS Kelas VIII SMP Kurikulum 2013
Buku Guru IPS Kelas VIII SMP Kurikulum 2013
 
Kisi kisi uas ipa kls viii, 2013 2014
Kisi kisi uas ipa kls viii, 2013 2014Kisi kisi uas ipa kls viii, 2013 2014
Kisi kisi uas ipa kls viii, 2013 2014
 
Kisi kisi ips kls viii. uas 2013.2014
Kisi kisi ips kls viii. uas 2013.2014Kisi kisi ips kls viii. uas 2013.2014
Kisi kisi ips kls viii. uas 2013.2014
 
12. kisi kisi ips uas kelas 7.9
12. kisi kisi ips uas kelas  7.912. kisi kisi ips uas kelas  7.9
12. kisi kisi ips uas kelas 7.9
 
Buku Siswa IPS Kelas VIII SMP Kurikulum 2013
Buku Siswa IPS Kelas VIII SMP Kurikulum 2013Buku Siswa IPS Kelas VIII SMP Kurikulum 2013
Buku Siswa IPS Kelas VIII SMP Kurikulum 2013
 
Kisi kisi ips uas kelas vii-viii dan ix
Kisi kisi ips uas kelas vii-viii dan ixKisi kisi ips uas kelas vii-viii dan ix
Kisi kisi ips uas kelas vii-viii dan ix
 

Similar to BGP Error Handling - Developing an Operator-Led Approach in the IETF (UKNOF 18)

IETF80 - IDR/GROW BGP Error Handling Requirements
IETF80 - IDR/GROW BGP Error Handling RequirementsIETF80 - IDR/GROW BGP Error Handling Requirements
IETF80 - IDR/GROW BGP Error Handling RequirementsRob Shakir
 
BGP OPERATIONAL Message
BGP OPERATIONAL MessageBGP OPERATIONAL Message
BGP OPERATIONAL MessageRob Shakir
 
Rafał Szarecki - PIM-tunnels and MPLS P2MP as Multicast data plane in IPTV a...
 Rafał Szarecki - PIM-tunnels and MPLS P2MP as Multicast data plane in IPTV a... Rafał Szarecki - PIM-tunnels and MPLS P2MP as Multicast data plane in IPTV a...
Rafał Szarecki - PIM-tunnels and MPLS P2MP as Multicast data plane in IPTV a...PROIDEA
 
BGP Traffic Engineering with SDN Controller
BGP Traffic Engineering with SDN ControllerBGP Traffic Engineering with SDN Controller
BGP Traffic Engineering with SDN ControllerAPNIC
 
CCNA3 Verson6 Chapter7
CCNA3 Verson6 Chapter7CCNA3 Verson6 Chapter7
CCNA3 Verson6 Chapter7Chaing Ravuth
 
Eigrp
EigrpEigrp
Eigrpfirey
 
Vyatta Subscription Edition 6.5 R1 Testing and Analysis
Vyatta Subscription Edition 6.5 R1 Testing and AnalysisVyatta Subscription Edition 6.5 R1 Testing and Analysis
Vyatta Subscription Edition 6.5 R1 Testing and AnalysisRouter Analysis, Inc.
 
PLNOG 8: Rafał Szarecki - Telco Group Network
PLNOG 8: Rafał Szarecki - Telco Group Network PLNOG 8: Rafał Szarecki - Telco Group Network
PLNOG 8: Rafał Szarecki - Telco Group Network PROIDEA
 
Mexico 3070 user group meeting 2012 test coverage john
Mexico 3070 user group meeting 2012  test coverage johnMexico 3070 user group meeting 2012  test coverage john
Mexico 3070 user group meeting 2012 test coverage johnInterlatin
 
CCNA3 Verson6 Chapter10
CCNA3 Verson6 Chapter10CCNA3 Verson6 Chapter10
CCNA3 Verson6 Chapter10Chaing Ravuth
 
Eigrp and ospf comparison
Eigrp and ospf comparisonEigrp and ospf comparison
Eigrp and ospf comparisonDeepak Raj
 
routing Protocols and Virtual private network
routing Protocols and Virtual private networkrouting Protocols and Virtual private network
routing Protocols and Virtual private networkhayenas
 
Huawei ARG3 Router How To - Troubleshooting OSPF: Netmask mismatch
Huawei ARG3 Router How To - Troubleshooting OSPF: Netmask mismatchHuawei ARG3 Router How To - Troubleshooting OSPF: Netmask mismatch
Huawei ARG3 Router How To - Troubleshooting OSPF: Netmask mismatchIPMAX s.r.l.
 

Similar to BGP Error Handling - Developing an Operator-Led Approach in the IETF (UKNOF 18) (20)

IETF80 - IDR/GROW BGP Error Handling Requirements
IETF80 - IDR/GROW BGP Error Handling RequirementsIETF80 - IDR/GROW BGP Error Handling Requirements
IETF80 - IDR/GROW BGP Error Handling Requirements
 
BGP OPERATIONAL Message
BGP OPERATIONAL MessageBGP OPERATIONAL Message
BGP OPERATIONAL Message
 
Rafał Szarecki - PIM-tunnels and MPLS P2MP as Multicast data plane in IPTV a...
 Rafał Szarecki - PIM-tunnels and MPLS P2MP as Multicast data plane in IPTV a... Rafał Szarecki - PIM-tunnels and MPLS P2MP as Multicast data plane in IPTV a...
Rafał Szarecki - PIM-tunnels and MPLS P2MP as Multicast data plane in IPTV a...
 
Fast Convergence Techniques
Fast Convergence TechniquesFast Convergence Techniques
Fast Convergence Techniques
 
Samplab19
Samplab19Samplab19
Samplab19
 
BGP Traffic Engineering with SDN Controller
BGP Traffic Engineering with SDN ControllerBGP Traffic Engineering with SDN Controller
BGP Traffic Engineering with SDN Controller
 
Aag c45 697761
Aag c45 697761Aag c45 697761
Aag c45 697761
 
CCNA3 Verson6 Chapter7
CCNA3 Verson6 Chapter7CCNA3 Verson6 Chapter7
CCNA3 Verson6 Chapter7
 
Eigrp
EigrpEigrp
Eigrp
 
Vyatta Subscription Edition 6.5 R1 Testing and Analysis
Vyatta Subscription Edition 6.5 R1 Testing and AnalysisVyatta Subscription Edition 6.5 R1 Testing and Analysis
Vyatta Subscription Edition 6.5 R1 Testing and Analysis
 
BMP Test Results
BMP Test ResultsBMP Test Results
BMP Test Results
 
PLNOG 8: Rafał Szarecki - Telco Group Network
PLNOG 8: Rafał Szarecki - Telco Group Network PLNOG 8: Rafał Szarecki - Telco Group Network
PLNOG 8: Rafał Szarecki - Telco Group Network
 
Mexico 3070 user group meeting 2012 test coverage john
Mexico 3070 user group meeting 2012  test coverage johnMexico 3070 user group meeting 2012  test coverage john
Mexico 3070 user group meeting 2012 test coverage john
 
CCNA3 Verson6 Chapter10
CCNA3 Verson6 Chapter10CCNA3 Verson6 Chapter10
CCNA3 Verson6 Chapter10
 
cogiel-OLT.pdf
cogiel-OLT.pdfcogiel-OLT.pdf
cogiel-OLT.pdf
 
Eigrp and ospf comparison
Eigrp and ospf comparisonEigrp and ospf comparison
Eigrp and ospf comparison
 
Hybrid Programmable Forwarding Planes: BoF Session
Hybrid Programmable Forwarding Planes: BoF SessionHybrid Programmable Forwarding Planes: BoF Session
Hybrid Programmable Forwarding Planes: BoF Session
 
SDN-IP Peering using BGP
SDN-IP Peering using BGPSDN-IP Peering using BGP
SDN-IP Peering using BGP
 
routing Protocols and Virtual private network
routing Protocols and Virtual private networkrouting Protocols and Virtual private network
routing Protocols and Virtual private network
 
Huawei ARG3 Router How To - Troubleshooting OSPF: Netmask mismatch
Huawei ARG3 Router How To - Troubleshooting OSPF: Netmask mismatchHuawei ARG3 Router How To - Troubleshooting OSPF: Netmask mismatch
Huawei ARG3 Router How To - Troubleshooting OSPF: Netmask mismatch
 

BGP Error Handling - Developing an Operator-Led Approach in the IETF (UKNOF 18)

  • 1. BGP ERROR HANDLING. DEVELOPING AN OPERATOR-LED APPROACH IN THE IETF. Shakir, Cable&Wireless Worldwide. Rob UKNOF 18 – 20/01/2011 - LONDON
  • 2. CUSTOMER A Typical SP Network? PE PE TRANSIT PE P P PE PEER P PE PE BGP IGP CUSTOMER IGP Signals customer/Internal prefixes between PEs EGP Propagates internal prefixes to neighbouring ASes.
  • 3. A (Modern) Typical SP Network? CUSTOMER PE PE RR TRANSIT PE PE PEER P P P PE PE BGP IGP CUSTOMER IGP Minimal infrastructure routing information. BGP Propagate internal routing and service data.
  • 4. BGP Failures I. JAN. ERRORS IN AS4_PATH 09 Erroneous  data  in  the  AS4_PATH  op6onal  transi6ve  a9ribute   causing  BGP  session  failure  (JunOS  bug).   VERY LONG AS_PATH FEB. Very  long  AS_PATHs  in  the  global  BGP  table  cause  session  failure.   Not  the  first  6me  this  had  been  seen.   09
  • 5. BGP Failures II. AUG. RIPE NCC RIS EXPERIMENTAL 10 A  RIPE  NCC  RIS/Duke  University  experiment  results  in  BGP   sessions  being  reset  –  disrup6ng  global  table  (IOS  XR  bug).   iBGP FAILURES ?? Mul6ple  occurrences  within  xSP  networks.   Likely  to  cause  higher  financial  impact  (L3VPN  margin).   ??
  • 6. Why do we see these events? RTR A RTR B UPDATE Error! RTR A RTR B NOTIFICATION
  • 7. Cause/Impact. LIMITED Must  either  DISCARD  a9ributes  or   TOOLSET IN STANDARDS. respond  with  NOTIFICATION.   SERVICE Transit/Peering  failure    -­‐  although  error  source  may  be  remote.   IMPACT. iBGP  failure  –  high  impact  sessions?  Route  reflectors?   Results in loss of RIB! Would you tolerate this in your IGP based on one erroneous LSP?
  • 8. Intent of Work. DEFINE HOW Document  the  way  xSPs  use  BGP.   BGP IS USED. Ensure  that  cri6cal  nature  of  the  protocol  is  understood.   PROVIDE Determine  how  OPERATORS  think  that  BGP  should   REQUIREMENTS fail  –  and  what  we’ll  compromise  on.   TIE TOGETHER Ensure  that  tools  resul6ng  from  exis6ng  dra]s   IETF WORK ITEMS. form  a  useful  framework  to  make  BGP  robust.  
  • 9. Approach Overview. 01 DON’T SEND NOTIFICATION. 04 MONITORING 02 RECOVER RIB CONSISTENCY. 03 RESTART BGP HITLESSLY.
  • 10. Avoid sending NOTIFICATION. Error! 172.16.0.0/12 WITHDRAWN UPDATE 172.16.0.0/12 RTR A RTR B NOTIFICATION WHAT DO WE “treat-­‐as-­‐withdraw”  mechanism  can  result  in   COMPROMISE ON? rou6ng  inconsistency  (possible  loops!).   EXISTING WORK dra]-­‐chen  (eBGP  errors)  –  includes  Opt  Trans.   ITEMS IN IETF? Needs  to  be  extended  to  cover  iBGP.  
  • 11. Recover RIB Consistency. Missing 172.16.0.0/12 from RTR A REQUEST 172.16.0.0/12 RTR A RTR B UPDATE 172.16.0.0/12 HOW CAN THIS Mechanisms  to  re-­‐request  missing  NLRI.   BE ACHIEVED? One  prefix  at  once,  or  whole  RIB.   EXISTING WORK “One-­‐Time  Prefix  ORF”.   ITEMS? Enhanced  ROUTE  REFRESH.  
  • 12. Reduce Impact of Session Reset. SESSION RESETS, NOTIFICATION  has  u6lity  for  resecng  state.   CAN WE AVOID THEM? Consider  that  some6mes  it  is  unavoidable.   FORWARDING PLANE UNAFFECTED. SESSION RESET RTR A RTR B SESSION RE-OPEN EXISTING WORK (Expired)  “SOFT-­‐NOTIFICATION”.   ITEMS IN IETF? Further  work  required  to  revive!  
  • 13. Introduce Further Monitoring. EXISTING ERRORS NOCs  can  see  session  failures  very  easily  –  both   ARE VERY VISIBLE. via  session  monitoring  and  forwarding  outage!     FURTHER COMPLEXITY Mechanisms  are  required  to  make  error   MEANS LESS MANAGEABLE handling  visible  to  both  BGP  speakers.   EXISTING WORK (In-­‐band)  ADVISORY  and  DIAGNOSTIC.   ITEMS IN IETF? (Out-­‐of-­‐Band)  BGP  Monitoring  Protocol.  
  • 14. Complexities of Approach. Know  the  NLRI?   Re-­‐request   (ORF)   Error!   Re-­‐request  the   Hitless  Session   treat-­‐as-­‐ whole  RIB   Reset   withdraw   OOPS!
  • 15. Why am I standing here? UKNOF As Operators, we deal with the fall-out of protocol issues! SO… an agreed, operator-recommended approach is required.
  • 16. Questions, comments, review… ALL MUCH APPRECIATED! rob.shakir@cw.com // +44(0)207 100 7532 // RJS-RIPE