[lacnog] Problemas de ruteo con nuevo atributo
Arturo Servin
aservin en lacnic.net
Vie Ago 27 16:57:56 BRT 2010
Creo que este mensaje de RIPE NCC es importante para la comunidad de ruteo. No se si a alguno le haya impactado en sus operaciones.
Slds
asn
Date: Fri, 27 Aug 2010 11:42:17 -0700 (PDT)
From: Lucy Lynch <llynch en civil-tongue.net>
Subject: Re: Did your BGP crash today?
To: Grzegorz Janoszka <Grzegorz en Janoszka.pl>
Cc: nanog en nanog.org
Message-ID: <alpine.BSF.2.00.1008271141540.79214 en hiroshima.bogus.com>
Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed
FYI:
----------------------------------------------------------------------
Dear Colleagues,
On Friday 27 August, from 08:41 to 09:08 UTC, the RIPE NCC Routing
Information Service (RIS) announced a route with an experimental BGP
attribute. During this announcement, some Internet Service Providers
reported problems with their networking infrastructure.
Investigation
--------------
Immediately after discovering this, we stopped the announcement and
started investigating the problem. Our investigation has shown that the
problem was likely to have been caused by certain router types
incorrectly modifying the experimental attribute and then further
announcing the malformed route to their peers. The announcements sent
out by the RIS were correct and complied to all standards.
The experimental attribute was part of an experiment conducted in
collaboration with a group from Duke University. This involved
announcing a large (3000 bytes) optional transitive attribute, using a
modified version of Quagga. The attribute used type code 99. The data
consisted of zeros. We used the prefix 93.175.144.0/24 for this and
announced from AS 12654 on AMS-IX, NL-IX and GN-IX to all our peers.
Reports from affected ISPs showed that the length of the attribute in
the attribute header, as seen by their routers, was not correct. The
header stated 233 bytes and the actual data in their samples was 237
bytes. This caused some routers to drop the session with the peer that
announced the route.
We have built a test set-up which is running identical software and
configurations to the live set-up. From this set-up, and the BGP packet
dumps as made by the RIS, we have determined that the length of the data
in the attribute as sent out by the RIS was indeed 3000 bytes and that
all lengths recorded in the headers of the BGP updates were correct.
Beyond the RIS systems, we can only do limited diagnosis. One possible
explanation is that the affected routers did not correctly use the
extended length flag on the attribute. This flag is set when the length
of the attribute exceeds 255 bytes i.e. when two octets are needed to
store the length.
It may be that the routers may not add the higher octet of the length to
the total length, which would lead, in our test set-up, to a total
packet length of 236 bytes. If, in addition, the routers also
incorrectly trim the attribute length, the problem could occur as
observed. It is worth noting that the difference between the reported
233 and 237 bytes is the size of the flags, type code and length in the
attribute.
We will be further investigating this problem and will report any
findings. We regret any inconvenience caused.
Kind regards,
Erik Romijn
Information Services
RIPE NCC
_______________________________________________
tech-l mailing list
tech-l en ams-ix.net
http://melix.ams-ix.net/mailman/listinfo/tech-l
- Lucy
------------ próxima parte ------------
Se ha borrado un adjunto en formato HTML...
URL: <https://mail.lacnic.net/pipermail/lacnog/attachments/20100828/752d5845/attachment.html>
Más información sobre la lista de distribución LACNOG