<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>Renesys Blog</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/" />
    <link rel="self" type="application/atom+xml" href="http://www.renesys.com/blog/atom.xml" />
    <id>tag:www.renesys.com,2008-10-18:/blog//1</id>
    <updated>2009-06-22T11:25:28Z</updated>
    
    <generator uri="http://www.sixapart.com/movabletype/">Movable Type 4.25</generator>

<entry>
    <title>The Proxy Fight for Iranian Democracy</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/06/the-proxy-fight-for-iranian-de.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.160</id>

    <published>2009-06-22T11:30:00Z</published>
    <updated>2009-06-22T11:25:28Z</updated>

    <summary>If you put 65 million people in a locked room, they&apos;re going to find all the exits pretty quickly, and maybe make a few of their own. In the case of Iran&apos;s crippled-but-still-connected Internet, that means finding a continuous supply...</summary>
    <author>
        <name>James Cowie</name>
        
    </author>
    
        <category term="Internet" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Security" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="iran" label="iran" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="webproxies" label="web proxies" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[<p>If you put 65 million people in a locked room, they're going to find all the exits pretty quickly, and maybe make a few of their own.  In the case of Iran's <a href="http://www.renesys.com/blog/2009/06/iran-and-the-internet-uneasy-s.shtml">crippled-but-still-connected Internet,</a> that means finding a continuous supply of <a href="http://en.wikipedia.org/wiki/Squid_cache"><strong>proxy servers</strong></a> that allow continued access to unfiltered international web content like Twitter, Gmail, and the BBC. </p>]]>
        <![CDATA[
<p>A proxy server is a simple bit of software that you run on your computer.   It effectively lets you share your computer with anonymous strangers as a "repeater" for content that they aren't allowed to fetch themselves.  For example, an Iranian web browser might be manually configured to use your computer (identified by an IP address and a port number) as a Web proxy.  When your anonymous friend reads twitter.com, or posts a tweet, the request goes via your computer, instead of to Twitter's web server directly.  Except for a little delay, and the fact that your friend gets to see what the uncensored Internet looks like from New York or London or S&atilde;o Paolo instead of Tabriz or Qom, surfing through a proxy is pretty much like surfing without one. </p>

<p>As you might imagine, open web proxies are valuable commodities in places where it's forbidden, possibly dangerous, to surf the Internet.  Iran's opposition movement has been vigorously trading lists of open proxies over the past week.   And as you might further imagine, the Iranian government censors have worked overtime to identify these proxies and add them to the daily blacklists.</p>

As an experiment, we geolocated a list of about 2,000 web proxies (unique IP addresses and port numbers) that were shared on Twitter and other web sites over the course of the last week, to see if we could discern patterns in the places that are hosting them.   Most of these are no longer reachable from inside Iran, of course, precisely because they were made public.  The following map shows the distribution of those proxies worldwide.</p><p>
 
<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><a href="http://www.renesys.com/blog/assets_c/2009/06/proxycount-5.shtml" onclick="window.open('http://www.renesys.com/blog/assets_c/2009/06/proxycount-5.shtml','popup','width=812,height=643,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false"><img src="http://www.renesys.com/blog/assets_c/2009/06/proxycount-thumb-600x475-5.png" width="600" height="475" alt="proxycount.png" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></a></span></p>


<p>The USA and Western Europe were well-represented, but so were China, India, Russia, Romania, Bulgaria, Vietnam, ... <strong>87 countries in all,</strong> a pretty impressive breadth of representation, considering the relatively small size of this sample.  (You can also see about a dozen <strong>Iranian </strong> IP addresses represented in the set.  Not surprisingly, all but one of these belong to networks originated by DCI, the <a href="http://www.renesys.com/blog/2009/06/strange-changes-in-iranian-int.shtml#more">government-run service provider</a> who operates the modern-day Internet equivalent of the Alam&#x016B;t Castle.)</p>

<p>Here's a geographic visualization of the proxies, drawn in Google Earth.  In the first one, we've drawn Iran in green, with some of their domestic network sketched in white, and their major international connections drawn in red.   Each of the colored arcs represents a single open web proxy; they are "fountaining" out of a cable landing or Internet traffic exchange point that makes approximate sense for their Iranian Internet routing.   For example, all of the web proxies in Europe are drawn from the Marseilles termination of the <a href="http://www.seamewe4.com/">Sea-Me-We-4 cable.</a>   The web proxies in Turkey are drawn in light blue, radiating from Ankara, where the <a href="http://en.wikipedia.org/wiki/Iran-Turkey_pipeline">Iran-Turkey gas pipeline</a> passes through on its way from Bazargan.   Those unusual Iranian proxies emerge from Tehran, and so forth. </p>
<p><span class="mt-enclosure mt-enclosure-image" style="display: inline;"><a href="http://www.renesys.com/blog/assets_c/2009/06/third_2-8.shtml" onclick="window.open('http://www.renesys.com/blog/assets_c/2009/06/third_2-8.shtml','popup','width=1329,height=760,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false"><img src="http://www.renesys.com/blog/assets_c/2009/06/third_2-thumb-600x343-8.png" width="600" height="343" alt="third_2.png" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></a></span></p>
If we rotate the globe, you can see how the countries of Asia are doing their part to keep the bits flowing in Iran.  India, China, South Korea, Taiwan, Vietnam, and Japan are all visible sources of web proxy activity.</p>
<p><span class="mt-enclosure mt-enclosure-image" style="display: inline;"><a href="http://www.renesys.com/blog/assets_c/2009/06/third_3-11.shtml" onclick="window.open('http://www.renesys.com/blog/assets_c/2009/06/third_3-11.shtml','popup','width=1333,height=760,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false"><img src="http://www.renesys.com/blog/assets_c/2009/06/third_3-thumb-600x342-11.png" width="600" height="342" alt="third_3.png" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></a></span></p>
<hr><p> I'd like to be able to say that these maps are a measure of the strength of the democratic impulse and volunteer spirit in all the countries of the world. But that might be a stretch.  You see, looked at another way, an open proxy is a security hole, something you might find in a machine that's been compromised, or at the very least, badly administered.  Security purists think of them as the "unlocked gun cabinet" of the Internet &mdash; a resource for anyone who wants to abuse a website, commit fraud, cover their tracks.</p>
<p>  Some of the proxies in this dataset are undoubtedly fresh, created by people who want to keep the Internet alive for the Iranian people.  But many of these proxies have probably been around for months or years, mapped out by those that map out such things. </p>
<p>We did see <a href="http://proxysetupforiran.blogspot.com/">a few</a> <a href="http://blog.austinheap.com/2009/06/15/how-to-setup-a-proxy-for-iran-citizens/">organizers</a> <a href="http://extrafuture.com/2009/06/15/how-to-set-up-an-anonymous-proxy-for-iranians-using-squid-on-mac-os-x/"> try to explain </a><a href="http://belsec.skynetblogs.be/post/7072317/setup-a-proxy-for-iran--overpower-their-censo">the concept of an ACL</a> (Access Control List) to all the new proud parents of open proxies.  If you are diligent, it is possible to restrict the anonymous users of your new proxy to just the Iranians, or even just the Iranian non-government networks, if you have a good enough list of the IP address blocks (network prefixes) in question.  But I expect that the complexity of configuring anything tighter than an "open access" proxy is going to prove too high a barrier to entry for most people who might volunteer to run one.</p>
<p> For one thing, we know how hard this is.  Renesys has pretty good lists of per-country networks and their transit patterns, based on our analysis of the global routing tables, and trust me, they take some work to maintain.  And even given good maps of Iran's address space to work from, ACLs are notoriously hard to test, if you don't have Iranian friends who can try your server from inside the protest zone and report back to you with problems.   Most people aren't going to bother, and that's probably okay.   Freedom is messy.   There'll be time for security later. </p>
<p><span class="mt-enclosure mt-enclosure-image" style="display: inline;"><a href="http://www.renesys.com/blog/assets_c/2009/06/world_proxies-14.shtml" onclick="window.open('http://www.renesys.com/blog/assets_c/2009/06/world_proxies-14.shtml','popup','width=766,height=655,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false"><img src="http://www.renesys.com/blog/assets_c/2009/06/world_proxies-thumb-600x513-14.png" width="600" height="513" alt="world_proxies.png" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></a></span></p>
<hr>
<p>Perhaps the strangest thing of all, given how diverse and active and vocal the proxy server farmers have been, is that by and large, <em>it isn't working. </em>   The rate with which new proxies are being created has slumped over the last few days.   It's getting harder and harder to propagate new proxies to the people who need them, as the government consolidates its hold on the filtering mechanisms. Any new proxy addresses that are posted to Twitter, or emailed, will be blocked very quickly.  </p>
<p>  People we talk to inside Iran say that almost no proxies are usable any more.  Freegate, a Chinese anti-censorship application that makes use of networks of open proxies, has proven popular in Iran.  But this week, it, too, has been experiencing problems.   Many popular applications, like Yahoo! Messenger,  have stopped working.  The authorities are said to be using power interruptions as a cyberweapon, causing brief outages during rallies that cause computers to reboot, just as people are trying to upload images and video.  The net result, as <a href="http://asert.arbornetworks.com/2009/06/a-deeper-look-at-the-iranian-firewall/">Arbor's excellent analysis shows</a>, has been a drastic reduction in inbound traffic on filtered ports since the election.</p>
<p>If there's a lesson here for the rest of the world, perhaps it's this: Install a few proxy instances on machines you control.  Learn how to lock them down properly.  Swap them with your friends overseas who live in places where the Internet is fragile.  Set up your tunnels and test them.  And <em>don't wait until the tanks are in the streets to figure this out,</em> because by that point, you may have already lost the proxy war.</p>]]>
    </content>
</entry>

<entry>
    <title>Iran and the Internet: Uneasy Standoff</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/06/iran-and-the-internet-uneasy-s.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.159</id>

    <published>2009-06-16T21:21:25Z</published>
    <updated>2009-06-17T00:55:11Z</updated>

    <summary> We&apos;ve received enough interest about our previous notes on Iranian Internet connectivity that I wanted to give a brief update, and some reflections....</summary>
    <author>
        <name>James Cowie</name>
        
    </author>
    
        <category term="Internet" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Politics" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Society" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="internet" label="internet" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="iran" label="iran" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[
<p>We've received enough interest about our <a href="http://www.renesys.com/blog/2009/06/strange-changes-in-iranian-int.shtml">previous notes on Iranian Internet connectivity</a> that I wanted to give a brief update, and some reflections.</p>]]>
        <![CDATA[
<p>In short: Iran is still on the Internet.  As the crisis deepens, people are literally <a href="http://www.reddit.com/r/worldnews/comments/8sz12/comprehensive_breakdown_of_the_current_situation/">risking their lives</a> by continuing to use the Internet for coordination and communication.   Iran's <a href="http://www.science-arts.org/internet/node38.html">physical</a> <a href="http://taeint.net/en/network/middle/">connectivity</a> to the Internet is so centralized, and so fragile, that it's within the power of the government to simply "turn it off" if they so desired.  And yet, they have not done so.</p>  

<p>Except for a brief period of outage over the weekend, the routes into Iran from the rest of the world have been basically intact, if a bit congested and unstable.   Most of that congestion and instability is probably the result of six billion people who are freshly interested in Iranian politics, all reading (and in some cases, yes, <a href="http://www.circleid.com/posts/20060615_iranian_protesters_urged_not_to_ddos_government_websites/">attacking</a>) Iranian websites.   We aren't making things easier for the people inside Iran, who need that same bandwidth to get out their images and observations and <a href="http://twitter.com/#search?q=%23IranElection">tweets.</a>  </p>

<p>To show that nothing much has changed structurally, consider the fact that the same lineup of six international carriers are still carrying the megabits back and forth to the government's monopoly points-of-presence at the Iranian border.  (See the following chart, which shows how the relative percentages of routes to Iranian networks carried by those providers has changed over the last few days.)   </p>

<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><a href="http://www.renesys.com/blog/assets_c/2009/06/iran-providers-2.shtml" onclick="window.open('http://www.renesys.com/blog/assets_c/2009/06/iran-providers-2.shtml','popup','width=1500,height=450,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false"><img src="http://www.renesys.com/blog/assets_c/2009/06/iran-providers-thumb-800x240-2.png" width="640" height="192" alt="iran-providers.png" class="mt-image-center" style="text-align: center; display: block; margin: 0 auto 20px;" /></a></span>

<p>What happens inside Iran to those bits is anyone's guess (censorship, site blocking, traffic interception, and harassment, from all accounts).   But the pipes are <strong>open</strong> and the traffic is <strong>flowing.</strong>    In a few cases (which I will not detail, for obvious reasons), there are actually direct paths to international carriers, in defiance of government monopoly, that are now getting good use.</p>

<p>I've talked to a lot of people in the last few days about this puzzle.   In a crisis that verges on revolution, the first thing a government typically does is take control of the media that can be controlled, and shut down the media that cannot.</p>

<p><b>Why is it different this time?</b>   There seem to be three basic theories.</p>

<p>
<ul>
	<li><b>The cynics.</b>   Perhaps the government has left the Internet intact so that they can use it to surveil and round up dissidents.  Perhaps they even put bandwidth constraints in place to make it easier to cope with the volumes of traffic that need to be captured and filtered.</li>
       <li><b>The optimists.</b>  Perhaps the government has realized that a modern economy relies on the Internet to such an extent that it cannot be turned off, for fear of disrupting financial transactions and business communications.   Iran's Internet ecosystem is relatively rich, and the impact on their economy of a sustained Internet shutdown would be significant.  Why make it harder for companies to do business in Iran at a time when oil revenues are cratering and foreign investment is looking for reasons to take a walk?
       <li><b>The realists.</b>  Perhaps the government is too busy with other things to worry about the Internet.   Governments aren't well-suited to run the Internet, and they don't completely understand how it works.   The Internet has never been "turned off" before, and it would take creativity and thoughtful action to figure out who to ask in order to get it done.  So it simply hasn't happened, and probably won't.   Good thing, too, because they might not be able to turn it on again.  
</ul></p>

<p>You can pick the theory you like.    I guess I'm a realist, but I'd like to be an optimist.   If you wait long enough, something good can come out of something bad.   If fertility rates hadn't tripled during the long Iran-Iraq war in the 80s,  Iran wouldn't be faced with this demographic bulge of restless 20-somethings who have grown up with the Internet and expect it to keep working. </p>   

<p>You can fight people, but you can't fight human nature, and you can't fight demographics.  Iran is going to follow the same long-term trend as the rest of the developing world: building a civil society based on free expression, global communication, free mobility of human capital, and cross-border investment.   That's what the Internet symbolizes, and yes, if you ask these folks, it's worth fighting for.</p>]]>
    </content>
</entry>

<entry>
    <title>Strange Changes in Iranian Transit</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/06/strange-changes-in-iranian-int.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.158</id>

    <published>2009-06-14T12:33:22Z</published>
    <updated>2009-06-14T15:16:13Z</updated>

    <summary> Many media sources have reported outages in Iranian mobile networks and Internet services in the wake of Friday&apos;s controversial elections. We took a look at the state of Iranian Internet transit, as seen in the aggregated global routing tables,...</summary>
    <author>
        <name>James Cowie</name>
        
    </author>
    
        <category term="Engineering" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Internet" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Politics" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="election" label="election" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="instability" label="instability" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="iran" label="iran" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[
Many media sources have reported <a href="http://www.cbsnews.com/stories/2009/06/14/world/main5087285.shtml">outages</a> in Iranian mobile networks and Internet services in the wake of Friday's controversial elections.   We took a look at the state of Iranian Internet transit, as seen in the aggregated global routing tables, and found that the story is not as clear-cut as has been reported.

]]>
        <![CDATA[
<p>There's no question that something large happened in the Iranian telecom space, and that the timing aligns with the close of voting and the emerging controversy.   Iran typically has a fairly high baseline level of sporadic route instability, due to the country's highly centralized incumbent transit through DCI (Data Communications Iran, AS12880) and DCI's somewhat peripheral connectivity to the main east-west conduits for data.   Even so, we started seeing spikes of route instability (changes in the paths to Iranian IP space) starting around 08:05 UTC on Saturday (just after noon in Tehran) that were significantly larger than normally expected.  These bursts affected as many as 400 prefixes (blocks of IP addresses) &mdash; the majority of Iran's Internet presence. </p> 

<p>
<a href="/blog-support/2009/06/ir-outages.jpg" 
onclick="window.open('/blog-support/2009/06/ir-outages.jpg','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img style="border:1px solid black;" src="/blog-support/2009/06/ir-outages.jpg" width="690">
</a>
</p>

<p>At 17:48 UTC, instability turned into outage, as more than 180 Iranian networks were withdrawn from the global routing tables, indicating that there were no remaining paths into DCI for that portion of Iranian traffic.   Contrary to media reports, however, the outages were fairly short-lived.   Within a few minutes, half of the outaged population were restored to alternative transit; over the course of an hour, outage levels returned to their normal baseline.    Route instability continued to be fairly high, and that pattern has continued through the night and into Sunday.</p>

<p>What can we say for sure?  Not much, except that Iran remains well-connected to the Internet from a routing perspective.   If I had to guess, I'd say that there are probably a lot more people around the world pulling local content from Iran's providers right now, and that surge of demand is probably contributing to increased congestion and (perhaps) some of the route instability we see.   It wouldn't be unusual for there to be some inbound cyber-mischief as well, from supporters of one or the other side, but so far we only have rumors on that front.</p>

<p>It is interesting to note that the changes in routing that took place were very specific in their impact on DCI's various transit providers, who keep the country connected to the world.  There are six of them:  Turk Telecom (TTNet, AS9121), FLAG (AS15412), Singapore Telecom (AS7473), PCCW (AS3491), Telia (AS1299), and Telecom Italia Sparkle (AS6762).   As the following plot shows, five of them lost Iran's transit, and one of them (Turkish Telecom) was a big gainer.  (Red arrows indicate loss of transit preference from the outside world; green indicates a gain in transit via the given provider.)<a href="/blog-support/2009/06/ir.png" 
onclick="window.open('/blog-support/2009/06/ir.png','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img style=" padding:20px;" align="right" src="/blog-support/2009/06/ir.png" width="345">
</a></p>

<p> A transit shift of this magnitude may indicate that something (administrative, or physical) has affected Iran's connection to the submarine cables running east and west &mdash; not a total outage, but some kind of significant impairment.   Turkey has their own, interesting arrangements with Iran for transit, and those are still in good shape (perhaps somewhat congested, having presumably doubled or tripled in transit volume).   It wasn't unusual to see 300ms traceroutes from North America and Europe in this timeframe to many Iranian sites.

</p>

<p>Of course, you have to remember that globally visible routes are the signposts for <strong>inbound</strong> traffic to and through DCI to the local providers; from the outside, there's no telling what the Internet experience of the average person inside Iran is like today.   It sounds as if a lot of content is being <a href="http://www.examiner.com/x-11620-Atlanta-Lesbian-Relationship-Examiner~y2009m6d13-Iran-riots-over-election-results">blocked</a> within the country.   For now, it's a good sign that information continues to flow, and Iran is still connected to the world at large.   <a href="http://www.nytimes.com/2009/06/15/opinion/15iht-edcohen.html">Let's hope they stay connected.</a></p>]]>
    </content>
</entry>

<entry>
    <title>How a Resilient Society Defends Cyberspace</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/05/how-a-resilient-society-defends-cyberspace.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.157</id>

    <published>2009-05-30T02:45:48Z</published>
    <updated>2009-06-14T16:03:42Z</updated>

    <summary>Seventy-five years ago today, on May 29th, 1934, Egyptian private radio stations fell silent, as the government shut them down in favor of a state monopoly on broadcast communication. Egyptian radio &quot;hackers&quot; (as we would style them today) had, over...</summary>
    <author>
        <name>James Cowie</name>
        
    </author>
    
        <category term="Governance" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Internet" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Politics" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Security" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Society" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[<p>Seventy-five years ago today, on May 29th, 1934, Egyptian private radio stations fell silent, as the government shut them down in favor of a state monopoly on broadcast communication. Egyptian radio "hackers" (as we would style them today) had, over the course of about fifteen years, developed a burgeoning network of unofficial radio stations. They offered listeners an unfiltered, continuous mix of news, gossip, and live entertainment from low-powered transmitters located in private houses and businesses throughout Cairo.</p>

<p>It couldn't last. After two days of official radio silence, on May 31st, official state-sponsored radio stations (run by the Marconi company under special contract) began transmitting a clean slate of government-sanctioned programming, and the brief era of grass-roots Egyptian radio was over.</p>]]>
        <![CDATA[<p>Was there something in the air in the early summer of 1934? Two weeks later, the United States Congress passed the <a href="http://www.thedcoffice.com/34act/index.htm">Communications Act of 1934</a>, which established the Federal Communications Commission, and gave it regulatory authority over not only the US airwaves, but interstate telephone services and telegraphy -- in short, all forms of over-the-air and wireline communications among private citizens. In the United States, as in Egypt, the government's oversight of public and private communications has survived, basically intact, for the last seventy-five years. Telephony replaced telegraphy, and television supplanted radio, and the Internet threatened to absorb all of them in a wave of convergence. But throughout all of the technological change, the US government's role as regulator and defender of the public interest seemed to have survived in recognizable form.<br />
<span class="mt-enclosure mt-enclosure-image" style="display: inline;"><img alt="earth.png" src="http://www.renesys.com/blog/2009/05/29/images/earth.png" width="326" height="294" class="mt-image-right" style="float: right; margin: 0 0 20px 20px;" /></span></p>

<p>That's why it's refreshing to see at least some evidence, in today's release of the long-awaited <a href="http://www.whitehouse.gov/asset.aspx?AssetId=1732">Cyberspace Policy Review</a>, that the Obama administration is taking a long, careful look at how to update the Government's role in "protecting the Internet." The report nods to regulatory precedent, but makes it clear that new regulations are not the first order of business. Instead, the themes of the day are those recognizable to any digital native: bottom-up coordination, transparency, sharing, establishment of trust, resiliency in the face of unknown threats. Leadership comes from above, but it takes the form of continuous measurement and accountability and consensus-building.</p>

<p><br />
Sounds kind of "kumbaya," right? Well, yes. But I suspect there's actually some substance behind the rhetorical styling. </p>

<p>For one thing, the President has consistently, though subtly, identified "resiliency" as a key virtue in the various complex systems that make up the critical infrastructure behind American society. Having just merged the Homeland Security Council and National Security Council, the President is building an integrated National Security Staff that will include both a senior cybersecurity advisor (yet to be named), and a more general "resiliency policy" advisor, who will focus on survivability and flexible response. The emphasis on resilience, on building complex systems that are less brittle and more intelligently flexible and responsive in the face of attacks, are the kinds of wonky concepts that light up the eyes of computer scientists and mathematicians who study complex systems. They aren't there to provide sound bites for the Sunday talk shows (and, indeed, concepts like "resiliency" tend to get a lot less play in the popular press than firmer fare, like "eliminating vulnerabilities" and "shutting out attackers" and even, God help us, "turning off the Internet in case of attack").</p>

<p>The very fact that the report has been some time in arriving, and that it does not contain a laundry list of quick fixes, suggests that the President intends to be appropriately deliberate in building a picture of what, exactly, the government can do to improve its own cybersecurity posture. Of course, it also signals how thickly political the whole cybersecurity issue has become, as various factions battle it out for control and presidential access.</p>

<p>I think we can take some cheer in the fact that the President isn't rushing to action, that the balance between safety and civil liberties appears front and center in the terms of debate, and that there's a general awareness of how much economic and strategic value is created for the American people by the wild global Internet, even as it creates some novel (but managable) risks for continuation of government and critical infrastructure. </p>

<p>1934, thankfully, is a long time gone.</p>]]>
    </content>
</entry>

<entry>
    <title>Reaching Google via Asia?</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/05/google-ntt.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.156</id>

    <published>2009-05-15T01:42:17Z</published>
    <updated>2009-06-14T16:03:11Z</updated>

    <summary> Across the Internet, yesterday, Google users twittered, blogged and emailed that Google search and mail were not usable. And, yesterday afternoon, on Google&apos;s official blog, Urs Hoelzle reported that Google &quot;direct[ed] some [...] web traffic through Asia&quot;....</summary>
    <author>
        <name>Martin A. Brown</name>
        <uri>renesys.com</uri>
    </author>
    
        <category term="Engineering" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[
<p>
Across the Internet, yesterday, Google users twittered, blogged and emailed that Google search and mail were not usable.    And, yesterday afternoon, on 
<a href="http://googleblog.blogspot.com/2009/05/this-is-your-pilot-speaking-now-about.html">Google's official blog</a>, Urs Hoelzle reported that Google "direct[ed] some [...] web traffic through Asia".
</p>
]]>
        <![CDATA[
<p>
Renesys observes the topology of the Internet by collecting routing data from a wide variety of vantage points (peers).  Different peers may choose different routes to the same destination (e.g. Google) depending on their location in the Internet.  Analogously, a driver starting from Los Angeles will choose a different highway to reach New York City than a driver starting in Montreal, Canada.  On the Internet, each peer may choose a different best path or route to a destination.  By recording the choices each peer makes, we can gain some insight into the relative importance of a given route to a destination.
</p>

<p>
The Internet is carved up into IP blocks (called prefixes) which make management of the network easier.  Each provider learns about the prefixes of other providers by passing the knowledge around (like a game of 'telephone').  Google directly advertises or provides service to approximately 190 different prefixes, so the Internet learns how to reach Google based on the relationships Google has.
</p>

<p>
When trying to reach Google, then, any inbound traffic to them needs to pass across other providers with whom Google has a BGP routing relationship.  We'll look at a very obvious shift in the number of other Internet providers who started reaching Google differently during yesterday's event.
</p>

<p>
Even though Google is very well-connected to a diverse set of other networks, two of these relationships predominate.  It is through these relationships that much of the Internet reaches Google.  If we look at Google's primary providers, Level 3 (ASN 3356) and AT&T (ASN 7018), we see that these are very important relationships to Google, since in the case of Level 3, just under 80% of peers choose to reach Google via Level 3 for just over half of all of Google's prefixes.
</p>

<p>
During yesterday's routing instability, Google's prefixes shifted to different relationships.  When automatic, this sort of change is a key feature of redundancy.  Sometimes such changes are motivated from a business change.  Over the course of yesterday's instability, virtually all of our peers, at one time or another, chose to reach many of Google prefixes through NTT (ASN 2914), what had been a far less important relationship.
</p>

<body bgcolor="#ffffff">
   <object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"
           codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=8,0,0,0"
           width="690" height="400"
           id="Line" >
      <param name="movie" value="/fusioncharts/charts/MSColumn2D.swf" />
      <param name="FlashVars" value="&dataURL=/blog-support/2009/05/google.xml" />
      <param name="quality" value="high" />
      <embed src="/fusioncharts/charts/MSColumn2D.swf"
             flashVars="&dataURL=/blog-support/2009/05/google.xml"
             quality="high"
             width="690" height="400"
             name="graphs"
             type="application/x-shockwave-flash"
             pluginspage="http://www.macromedia.com/go/getflashplayer" />
   </object>
</body>

<p>
Note that the bar chart above is showing the total percentage of Google's approximately 190 prefixes and the total percentage of peers (different vantage points on the Internet) choosing to reach Google through NTT.  At no single time did the entire Internet choose to reach Google through NTT, but over the course of the event, almost everybody tried to reach Google on this path.
</p>

<p>
Before yesterday's event, NTT was not so important an inbound path to Google.  In fact, NTT carried very few of Google's prefixes (only 16) and was the preferred path to Google for only 10% of our peers.  During several different bursts of announcements, this relationship became a much more important inbound path to Google.
</p>

<p>
In one particular series of routing changes, which started at around 14:45 UTC, several major providers, e.g. Deutsche Telekom (ASN 3320), Teleglobe (ASN 6453), Telia (ASN 1299), and even, peculiarly, AT&T (ASN 7018) started choosing to reach Google through NTT.
</p>

<p>
While yesterday's instability was disruptive to Google users all over the Internet, we are happy to report that it's back to business as usual with one of the most robust services on the Internet.
</p>

]]>
    </content>
</entry>

<entry>
    <title>AfNOG Takes Byte Out of Internet</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/05/byte-me.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.155</id>

    <published>2009-05-05T20:47:11Z</published>
    <updated>2009-06-14T16:02:50Z</updated>

    <summary> A couple of months ago, we discussed how a small Czech provider ended up causing global Internet mayhem by tickling a Cisco bug via a rather ridiculous routing announcement. While it&apos;s easy to fault the instigator of this meltdown,...</summary>
    <author>
        <name>Earl Zmijewski</name>
        <uri>http://www.renesys.com/blog/</uri>
    </author>
    
        <category term="Engineering" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[<p>
A couple of months ago,
we discussed how a small 
<a href="http://www.renesys.com/blog/2009/02/the-flap-heard-around-the-worl.shtml">Czech provider</a>
ended up causing global Internet mayhem by tickling a Cisco bug via a rather ridiculous routing announcement.
While it's easy to fault the instigator of this meltdown,
ultimate responsibility belongs with the vendors of poorly tested code.
If we've learned anything in decades of software engineering,
it is that you can't assume anything about user input.
If you don't check that input for validity,
you are not just being careless, 
you are creating a time bomb that will eventually go off.
Another such bomb went off on Sunday, 3 May 2009,
taking out a large swath of the Internet.
We recount the sorry tale here.
</p>]]>
        <![CDATA[<p>
Sundays are usually pretty quiet from an engineering perspective on the Internet,
so we were surprised by a sudden and prolonged influx of routing updates at our collection sites.
Thankfully, 
we have some very nifty event correlation software,
written by Renesys &uuml;ber-hacker Alin Popescu,
to which I turned to see what was going on.
Alin's code sorts through all the routing activity on the Internet in real time and correlates activity via the attributes they have in common.
We can then say things like "this spike in outages came from networks in Kyrgyzstan as transited via Golden Telecom in Russia", 
potentially providing some insight into the root cause of the problem.
It's really very slick.
However, 
the <em>only</em> problem with pinpointing Sunday's event was that it occurred all over the place.
We saw sharp spikes in outages and instabilities in Indonesia, Bulgaria, Germany, Romania, Brazil, Russia, the United States and many other places.
Upwards of a couple thousand prefixes (networks) were impacted during a period of about 6 hours.
Given the wide reach of the event,
we immediately thought of the Czech situation and wondered if this was another live Internet QA test.
Turns out we were right.
</p>

<p>
To understand the problem,
let's first provide some background.
As readers of this blog will know,
every organization responsible for Internet routing has two things under their control:
block(s) of IP addresses (prefixes) and an identifying number, or 
<a href="http://en.wikipedia.org/wiki/Autonomous_System_Number">ASN</a>.
Like IP addresses themselves,
ASNs are also running out, at least the old style ones.
The original Internet designers allocated only 2 bytes for storing ASNs, 
which means at most 65,536 of them.
Since then,
ASNs have been largely allocated consecutively starting at 1,
with <a href="http://www.phosmail.se/iis/">Phosworks Drift & Services</a>
getting the most recent allocation of ASN 49232.
Reserved (private) ASNs are at the high end of the range, starting at 64512.
So this means that at present we only have about 15 thousand 2-byte ASNs remaining unassigned.
</p>

<p>
If there weren't an alternative,
the 2-byte limitation for ASNs would mean that before too much longer,
there would be no new members of the Internet routing club:
no new ISPs and no companies with multiple providers and direct control of their routing.
Fortunately, there is an alternative available today,
and that is 4-byte ASNs.
<a href="http://www.ripe.net/">RIPE</a> has been assigning them by default 
since the start of 2009.
But unlike IPv6, the proposed alternative to dwindling IPv4 addresses,
4-byte ASNs can seamlessly and safely co-exist with 2-byte ASNs.
No complete overhaul of the Internet is required.
</p>

<p>
Returning to our story,
if 4-byte ASNs are available and interoperate with 2-byte ASNs, 
there shouldn't be any problems with announcing them anyway we want today.
Yeah right.
The <a href="http://www.afnog.org">African Network Operators' Group</a> (AfNOG)
will soon be meeting in Cairo, Egypt, and plans to be multi-homed.
They acquired the 4-byte ASN 327686, which equals 5*(2<sup>16</sup>) + 6 and hence is also written as 5.6,
and planned to use it as stated 
<a href="http://www.merit.edu/mail.archives/nanog/msg17698.html">here</a>:
<blockquote>
One of the aims of this setup is to demonstrate that 32-bit ASNs
do work and people should not steer away from them, especially
since the pool of 16-bit ASNs is shrinking fast.
A showcase network goes a long way in this regard.
</blockquote>
</p>

<p>
Now, AfNOG does have an IPv4 prefix, 196.200.208.0/20, and a traditional 2-byte ASN, namely 37095, at their disposal.
On 30 April at 17:21:49 UTC, 
we started seeing the 2-byte ASN announcing this prefix, 
but with ridiculously long AS paths like 
<ul>
 <li> 24863 37095 37095 37095 37095 37095 37095 37095 37095 37095 37095 37095</li>
</ul>
AS 24863 is LINKdotNET, a local Egyptian provider.
So far, so good.
Then on 3 May at 12:00:30 UTC, 
the above paths get replaced with 4-byte AS prepends, namely, 
<p></p>
<ul>
 <li> 24863 327686 327686 327686 327686 327686 327686 327686 327686 327686 327686 37095 </li>
</ul>
and the global impact is felt immediately, as shown in the following graph.
</p>
<p></p>

<p>
<body bgcolor="#ffffff">
   <object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"
           codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=8,0,0,0"
           width="690" height="400"
           id="Line" >
      <param name="movie" value="/fusioncharts/charts/ScrollLine2D.swf" />
      <param name="FlashVars" value="&dataURL=/blog-support/2009/05/afnog.xml" />
      <param name="quality" value="high" />
      <embed src="/fusioncharts/charts/ScrollLine2D.swf"
             flashVars="&dataURL=/blog-support/2009/05/afnog.xml"
             quality="high"
             width="690" height="400"
             name="graphs"
             type="application/x-shockwave-flash"
             pluginspage="http://www.macromedia.com/go/getflashplayer" />
   </object>
</body>
</p>

<p>
So what happened?
Well, not everyone can afford a Cisco or Juniper router.
Some folks make do with cheap Linux boxes running the free routing software called
<a href="http://www.quagga.net/">Quagga</a>.
And you guessed it!
Older versions of Quagga have a buffer overflow problem,
dropping the associated BGP sessions like a brick when they saw this announcement.
The latest Quagga release, 0.99.11, came out on 2 October 2008. 
The prior version had the following gem of a comment.
<pre>
  /* ASN takes 5 chars at least, plus seperator, see below.
   * If there is one differing segment type, we need an additional
   * 2 chars for segment delimiters, and the final '\0'.
   * Hopefully this is large enough to avoid hitting the realloc
   * code below for most common sequences.
   *
   * With 32bit ASNs, this range will increase, but only worth changing
   * once there are significant numbers of ASN >= 100000
   */
</pre>
Or until someone prepends a 4-byte ASN a few times.  Talk about lame.
So this latest experiment in real-time Internet QA identified everyone running old versions of Quagga, 
thanks to the time bomb left behind by one very lazy programmer.
The latest version of Quagga appears to handle this condition a little more carefully,
but I can't help but wonder about any software version that starts with a zero.
</p>

<p></p>
<p>
Looking for the positive,
it was nice to see some diversity on the Internet with respect to routing.
Imagine if this had been a Cisco bug?
Still we found it surprising to see the number and variety of organizations running Quagga in a production environment.
The following graph gives the per-country distribution of prefixes for every country with at least 10 outages due to this bug.
</p>

<p>
<body bgcolor="#ffffff">
 <object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"
         codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=8,0,0,0"
         width="690" height="400"
         id="Line" >
    <param name="movie" value="/fusioncharts/charts/MSColumn2D.swf" />
    <param name="FlashVars" value="&dataURL=/blog-support/2009/05/outages_per_country_absolute.xml" />
    <param name="quality" value="high" />
    <embed src="/fusioncharts/charts/MSColumn2D.swf"
           flashVars="&dataURL=/blog-support/2009/05/outages_per_country_absolute.xml"
           quality="high"
           width="690" height="400"
           name="graphs"
           type="application/x-shockwave-flash"
           pluginspage="http://www.macromedia.com/go/getflashplayer" />
 </object>
</body>
</p>

<p>
The lessons are obvious.
If you don't check user input and don't allocate sufficient memory,
bad things will ultimately happen.
Every possible permutation that you can think of, and perhaps some that you can't, 
will eventually be tried by someone,
either accidentally or on purpose.
So far, we're mainly seeing the former, 
which is good news.
Ultimately though, bugs need to be found in lab, 
not in the production Internet.
</p>]]>
    </content>
</entry>

<entry>
    <title>The Blind Routing the Blind</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/05/keeping-score.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.154</id>

    <published>2009-05-04T11:00:00Z</published>
    <updated>2009-06-14T16:02:25Z</updated>

    <summary> In our last blog entry, we talked about measuring the state of routing anarchy that exists on the Internet on a per-country basis. We looked at every routed network (prefix) by country of origin and tried to answer the...</summary>
    <author>
        <name>Earl Zmijewski</name>
        <uri>http://www.renesys.com/blog/</uri>
    </author>
    
        <category term="Internet" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[<p>
In our 
<a href="http://www.renesys.com/blog/2009/03/compliance-scoring-by-country.shtml">last blog entry</a>,
we talked about measuring the state of routing anarchy that exists on the Internet on a per-country basis.
We looked at every routed network (prefix) by country of origin and tried to answer the question: do folks
<em><a href="http://www.youtube.com/watch?v=lzymBKGV8rw">do what they say and say what they do</a></em>,
as articulated via routing registries?
Although many manage to administer their routes with care,
the overall results are quite varied.
And without some way of verifying routes via some authoritative source, 
we are left only with the current system of believing everything we're told and hoping for the best.
The dangers of such a system are 
<a href="http://www.renesys.com/blog/2008/02/pakistan-hijacks-youtube-1.shtml">demonstrated dramatically</a> from time to time.
</p>

<p>
Although they certainly could,
countries typically don't exercise any control over the routing hygiene of the companies operating within their borders.
Countries might tax those companies, 
<a href="http://map.opennet.net/">filter their traffic</a> for objectionable content, 
mandate the types of software or equipment they can use and even spy on them, 
but if a company wants to screw up routing on the global Internet, 
well that's their business.
As we've noted in 
<a href="http://www.renesys.com/blog/2009/02/the-flap-heard-around-the-worl.shtml">the past</a>,
no driver's license is required on the Information Superhighway,
as there are essentially no rules, regulations or enforcement.
So in this blog entry,
we'll apply our scoring idea to those who can easily effect change,
namely, those organizations who are ultimately responsible for how traffic flows on the Internet.
<p>]]>
        <![CDATA[<p>
At first, this exercise might seem pretty straightforward: pick your favorite company or organization, compare their stated routing policies to reality, and give them a score based on the difference.
But on the Internet, nothing is ever so simple,
especially when it involves any sort of attribution.
Simply identifying a company's Internet presence is complicated by mergers, buyouts, spin-offs and other aspects of modern capitalism.
How can you hope to sort it all out?
Well, there is only one thing you can really count on:
anyone managing routing on the Internet must have one or more blocks of IP addresses (prefixes) and at least one "identifying" number, an
<a href="http://en.wikipedia.org/wiki/Autonomous_System_Number">Autonomous System Number (ASN)</a>.
At some point in time, these things must have been doled out by some authority of sorts.
That's about it.
The person who requested these values (essentially a bunch of integers) might be long dead as might be the original company.
And the associated administrative records might no longer be accurate or even maintained, 
but some IP addresses and an ASN are all you need to route traffic on the Internet.
Once you've acquired these, 
you are free to proudly assert your presence to humanity and thereby consume a little bit of memory in every router on the planet.
</p>

<p>
In fairness,
the overwhelming number of organizations we see on the Internet have only one or a handful of prefixes.
These organizations are easy to identify, 
as they have little to begin with and hence little to have gone stale.
But large companies and old timers to the Internet are rarely so fortunate.
Let's take <a href="http://en.wikipedia.org/wiki/AT%26T">AT&amp;T</a> as one example.
In the US,
AT&amp;T was <em>the phone company</em> for a very long time, 
until they were broken up by the courts and a newly formed AT&amp;T became just a much smaller part of the original whole.
This greatly diminished AT&amp;T was ultimately acquired by 
<a href="http://en.wikipedia.org/wiki/SBC_Communications">SBC Communications</a>,
as was Ameritech, Bell South and others,
and eventually the entire collection was renamed to simply "AT&amp;T".
</p>

<p>
So what does AT&amp;T look like on the Internet today?  
Old hands will think of it as ASN 7018,
but we can actually associate almost 100 <em>different</em> ASNs with this single organization, 
about half of which are announcing prefixes.
These are registered under a variety of names such as AT&amp;T Network Systems, 
AT&amp;T Internet Services, AT&amp;T Global Network Services and almost 40 others.
We even still see names containing SBC (and no mention of AT&amp;T)
and the
<a href="http://your.rogers.com/aboutrogers/historyofrogers/overview.asp">now extinct</a>
<a href="http://en.wikipedia.org/wiki/Rogers_Wireless">Rogers AT&amp;T Wireless 
(AS 20453)</a>.
However, AS 20453 continues to announce Rogers' prefixes (over 70 of them) on the Internet, 
but AT&amp;T is not one of their providers.
So while Rogers ended their partnership with AT&amp;T,
no one updated the associated ASN registration.
Talk about confusing.
</p>

<p>
The situation looks grim
until you realize that there are actually three sources of (potentially imperfect) information that can be combined to help ferret out the truth:
<ul>
  <li>registration information for ASNs and prefixes</li>
  <li>current routing information</li>
  <li>open source intelligence, e.g., news, press releases, Wikipedia, etc</li>
</ul>
I won't go into the details here, 
but we employ all three sources appropriately in an attempt to categorically define each organization on the Internet.
While registration information is often incomplete and outdated,
strings like AT&T and Verizon are unlikely to appear outside of those organizations,
whereas others like Internet or Inc. have essentially no informational content.
Other terms can be associated because of known business relationship or brand names, 
such as 
<a href="http://en.wikipedia.org/wiki/Hotmail">Microsoft and Hotmail</a> or 
<a href="http://en.wikipedia.org/wiki/Sprint_Nextel">Sprint and Nextel</a>.
But since registration information can't be trusted,
the associations that are uncovered need to be checked via routing data.
This is how we discovered the Rogers AT&amp;T Wireless problem.
It just didn't make sense that an "AT&amp;T entity" was not routing through AT&amp;T and a quick Internet search confirmed that Rogers AT&amp;T Wireless did not belong to the AT&amp;T organization.
</p>

<p></p><p>
Intelligently analyzing these different inputs,
we can finally list out all the organizations on the Internet:
their ASNs (if any) and associated prefixes.
And once we have that,
we can give them a score based on their routing hygiene,
exactly as we did in our 
<a href="http://www.renesys.com/blog/2009/03/compliance-scoring-by-country.shtml">blog entry</a> for individual countries.
Despite the fact that there are only around 31,000 unique ASNs seen on the Internet at present, 
we manage to find around 75,000 organizations,
as not everyone with a registered prefix also has an ASN. 
(In these cases, their provider typically handles the routing of their prefix.)
</p>

<p>
We found over 13,000 organizations who managed a perfect score of 100, 
although all but 218 of them had fewer than 10 prefixes.
While a small number of prefixes makes their administration easier,
it is not a guarantee of success.
One organization with 2 prefixes scored an embarrassing 9 out of a 100 on our scale.
For those with at least 10 prefixes to manage,
the following scatter plot gives the distribution.
As we saw with countries,
the more prefixes you have,
the harder it is to keep everything straight and attain a good score.
And while many manage to do a very good job, 
the overall results are quite varied, 
especially when you recall that doing essentially nothing will land you around 25 on our scale,
and you have to do less than nothing to rate below that.
Still almost 5,000 organizations manage to score below 25.
</p>

<p>
<a href="http://www.renesys.com/blog-support/2009/05/org_comp.jpg" 
onclick="window.open('http://www.renesys.com/blog-support/2009/05/org_comp.jpg','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img style="border:1px solid black;" src="http://www.renesys.com/blog-support/2009/05/org_comp.jpg" width="690">
</a>
</p>

<p>
The overall winner in this exercise was 
<a href="http://www.telecom.kz/">Kazakh Telecom</a>
with a perfect score of 100 for over 150 prefixes.
<a href="http://www.airtel.in/">Airtel Broadband and Telephone Services (ABTS)</a> came within a quarter of a point of a perfect score for their couple of hundred prefixes.
At the other end of the spectrum,
China Enterprise Communications Ltd. (CEC),
in which 
<a href="http://www.tatacommunications.com/">Tata Communications</a>
recently acquired a 
<a href="http://www.telegeography.com/cu/article.php?article_id=23637">50% stake</a>,
barely managed a score of 20.
CEC is around one point behind our second to last place finisher,
the <a href="http://www.aarp.org">AARP</a>,
a US non-profit organization for people over 50.
Overall, it's clear that there is a lot of room for improvement for most organizations,
but if the Internet is ever going to have any measure of accountability, 
it is going to have to start with those who control the IP space and orchestrate global routing.
Without attribution,
there can be no accountability and no hope of improving upon the current situation.
</p>

<p>
<b>Update:</b>
The earlier version of this blog inadvertently misclassified a number of registered networks due to a software bug in a script used for displaying the data.
The scoring algorithm remains unchanged.
</p>]]>
    </content>
</entry>

<entry>
    <title>Route Hygiene: The Dirt on the Internet</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/03/compliance-scoring-by-country.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.153</id>

    <published>2009-03-25T20:00:00Z</published>
    <updated>2009-05-13T23:36:16Z</updated>

    <summary> Since Renesys maintains large quantities of data on the Internet going back many years, we sometimes get the question: If you guys are watching the entire &apos;net, why don&apos;t you just warn people when things break? My response is...</summary>
    <author>
        <name>Earl Zmijewski</name>
        <uri>http://www.renesys.com/blog/</uri>
    </author>
    
        <category term="Engineering" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Internet" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[<p>
Since Renesys maintains large quantities of data on the Internet going back many years, 
we sometimes get the question:  
<em>If you guys are watching the entire 'net, 
why don't you just warn people when things break?</em>
My response is generally along the lines of:
<em>Sure we can do that.
Simply tell us the correct state of the Internet at each moment in time 
and we'll alert you to any operational differences we observe.</em>
This is generally met with silence.
</p>

<p>
Renesys can tell you a lot about the <em>current</em> state of the Internet,
but absolutely no one can tell you the <em>correct</em> state.
And that is because no one is in charge,
and so there is no central authoritative source of information. 
Think of the Internet as a highway system where anyone can buy a car and simply start driving:
no need to register the car, attach a license plate, buy insurance or get a 
<a href="http://www.renesys.com/blog/2009/02/the-flap-heard-around-the-worl.shtml">driver's license</a>.
You don't even have to show an id or be sober.  
Just pay some fees, buy some equipment, hook up and go.
The barrier to entry really is that low.
</p>

<p>
Obviously, this arrangement can cause some problems.
When
<a href="http://www.renesys.com/blog/2008/02/pakistan-hijacks-youtube-1.shtml">Pakistan hijacked YouTube</a> last year by announcing YouTube IP space, 
out of the hundreds of thousands of routing announcements seen on Internet,
how was anyone to know this particular one was incorrect?
Okay sure, you couldn't get your videos, 
but maybe YouTube had just opened a data center in Karachi and the problem was internal to them?
Without some way of checking the authenticity of routes,
the routers that direct traffic on the Internet simply believe what they are told.
And if the best route to YouTube appears to be via Pakistan, 
then they are all going to use it, no questions asked.
This is not a new problem, and this blog explores an old and largely failed attempt to address it.
We then compare the differences between countries with respect to their routing hygiene.
</p> ]]>
        <![CDATA[<p>
<a href="http://www.renesys.com/blog-support/2009/03/YouTube.jpg" 
onclick="window.open('http://www.renesys.com/blog-support/2009/03/YouTube.jpg','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img style="border:1px solid black;" src="http://www.renesys.com/blog-support/2009/03/YouTube.jpg" width="690">
</a>
</p>

<p>
The intended solution to this problem was developed way back in the early 1990s
when the concept of a 
<a href="http://en.wikipedia.org/wiki/Internet_Routing_Registry">routing registry</a>
was first proposed.
The idea was to construct a common database of routing information maintained by the appropriate parties,
which could then be used to debug operational problems and filter out incorrect routes.
So for example,
YouTube might have registered its blocks of IP addresses (prefixes) with a registry saying that these prefixes belonged to them.
They might further declare that their prefixes should only be seen originating from the YouTube Autonomous System (AS 36561),
which in turn has the following list of providers.
And so forth.
The typical language for expressing your routing policy, 
known as <a href="http://en.wikipedia.org/wiki/RPSL">RPSL</a>,
is very rich, if somewhat arcane, 
and you can fully describe your Internet presence using it.
This allows other 'net citizens to figure out if what they observe about you on the Internet matches your stated policy.
</p>

<p>
The only problem with registries is that, like most things on the Internet,
no one is required to use them or, if they do use them, is required to keep them up-to-date.
In addition, there are dozens to choose from.  Which one do you pick?
Some are regional, others are run by individual countries, nonprofits, or even Internet service providers.  
Of the 39 registries that Renesys monitors, 
the nonprofit <a href="http://www.merit.edu/">Merit Networks Inc</a> maintains the registry with the largest amount of routing information 
(the <a href="http://www.ra.net/">RADb</a>).  
In fact, it has more than double
the second largest such service, run by <a href="http://www.ripe.net">RIPE NCC</a>.  Others of note are run by
Level 3, Savvis, NTT, and <a href="https://www.arin.net/">ARIN</a>.
So as with most things on the Internet,
there are potentially many sources of information and there is no telling how much of it
is incomplete, outdated or otherwise erroneous.
And while registry data is widely viewed as unreliable at best,
we set out to answer the question more definitively.
For a moment in time (20 March 2009 at 00:00:00 UTC),
we compared the true operational state of the Internet to the explicitly listed "correct" state as articulated in any one of the aforementioned 39 registries.
</p> 

<p>
To conduct this comparison, 
we first had to come up with a scoring mechanism.
We used prefixes as the basic building blocks for our scoring approach
because these can then be rolled up into more meaningful higher level scores,
such as by country or by provider.
For each prefix, 
we looked at only two things:
how is the prefix originated and who are the upstream providers for the originator?
We compared the registry entries (if any) to what we were actually seeing on the Internet
and gave each prefix a score based on the variance.
The best score is 100 and each prefix starts there &mdash; innocent until proven guilty.
Then points are deducted for each difference found between registry data and reality.
The exact algorithm is not terribly important and competing alternatives are easy to suggest,
but what is important is that we tried to set the bar very low,
making a good score easy to obtain.
For example, correct originations are worth much more than correct upstream providers. 
If a prefix has a correct origination and only one upstream,
it will still get a score of 80 even if the upstream is incorrectly registered or missing altogether.
For two incorrect or missing upstreams,
the score drops by only a few points.
Thus, correctly originated prefixes tend to have scores in excess of 70.
And if you register nothing at all for a prefix,
its score will be around 25 or slightly less,
depending on the number of providers.
To get lower scores, 
you have to do <em>less than nothing</em>, 
namely, make lots of mistakes in your registration,
enough to outweigh anything you managed to handle correctly.
</p>

<p>
Once each prefix is scored in this way, 
we compute the average scores for all prefixes in certain classes,
such as those that geolocate to a particular region or are ultimately transited by a given provider. 
We can then compare countries, providers and organizations for their level of routing hygiene, 
i.e., how close does their presence on the Internet match their stated policy?
My expectation was that there would be a few folks who got things completely right, 
but by and large, most would either not have registered their routes or not have done so correctly.
In other words,
I expected most of the world to map to a score of around 25,
with a few outliers around 100 and very little in between.
That turns out not to be the case.
The average score for all of planet Earth was fairly respectable 65.6 for around 280,000
routed prefixes.
And there was quite a wide range of scores and surprising differences between countries.
</p>

<p>
<a href="http://www.renesys.com/blog-support/2009/03/compliance.jpg" 
onclick="window.open('http://www.renesys.com/blog-support/2009/03/compliance.jpg','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img style="border:1px solid black;" src="http://www.renesys.com/blog-support/2009/03/compliance.jpg" width="690">
</a>
</p>

<p>
The above scatter plot displays our per-country scoring results.
Each point on the graph represents one of the 228 countries or territories that we see originating prefixes on the Internet.
Since it is much easier for countries with few prefixes to get a high score, 
we plot the prefix count versus the total score for each country.
Thus, we'd expect countries lower on the graph (fewer prefixes) to have higher scores,
but that is generally not the case.
Five tiny countries manage a perfect score of 100, namely,
<a href="http://en.wikipedia.org/wiki/San_Marino">San Marino</a>,
<a href="http://en.wikipedia.org/wiki/Djibouti">Djibouti</a>,
<a href="http://en.wikipedia.org/wiki/Faroe_Islands">Faroe Islands</a>,
<a href="http://en.wikipedia.org/wiki/%C3%85land_Islands">&Aring;land Islands</a>, and
<a href="http://en.wikipedia.org/wiki/Cape_Verde">Cape Verde</a>.
The bottom three, all with scores below 25, are also quite small:
<a href="http://en.wikipedia.org/wiki/Saint_Helena">Saint Helena</a>,
<a href="http://en.wikipedia.org/wiki/Equatorial_Guinea">Equatorial Guinea</a>, and
<a href="http://en.wikipedia.org/wiki/Senegal">Senegal</a>.
</p>

<p>
It is important to compare countries with a similar number of prefixes,
i.e., those that appear in the same horizontal band of the graph.
For example,
<a href="http://en.wikipedia.org/wiki/Ukraine">Ukraine</a> is in first place in their band
(1,000 &mdash; 10,000 prefixes),
while 
<a href="http://en.wikipedia.org/wiki/Chile">Chile</a> is last.
The US is in its own band as the only country with over 100,000 prefixes,
but comes in with a below-average score of 50.5.
Of the major powers,
Russia is way ahead of the others with an 88.4,
while China manages only a miserable 45.4.
And finally, 
we note that Korea (84.9) nudges out India (84.0) to win their band.
</p>

<p>
If you ignore prefix count entirely,
the US ranks 191<sup>st</sup> out of 228 countries.
If instead you consider only countries with at least 1000 prefixes,
i.e., those with a rich Internet infrastructure,
the US ranks 34<sup>th</sup> out of 39 countries,
just behind Mexico and just ahead of Colombia.
The top 39 such Internet-rich countries are listed below by decreasing score.
<ol>
 <li>Ukraine</li>
 <li>Thailand</li>
 <li>Poland</li>
 <li>Austria</li>
 <li>Switzerland</li>
 <li>Romania</li>
 <li>Russia</li>
 <li>Pakistan</li>
 <li>Sweden</li>
 <li>Germany</li>
 <li>Korea</li>
 <li>Japan</li>
 <li>India</li>
 <li>Indonesia</li>
 <li>Bulgaria</li>
 <li>Italy</li>
 <li>Egypt</li>
 <li>United Kingdom</li>
 <li>Israel</li>
 <li>Netherlands</li>
 <li>Australia</li>
 <li>South Africa</li>
 <li>Philippines</li>
 <li>Hong Kong</li>
 <li>Singapore</li>
 <li>Canada</li>
 <li>Taiwan</li>
 <li>Spain</li>
 <li>New Zealand</li>
 <li>France</li>
 <li>Turkey</li>
 <li>Brazil</li>
 <li>Mexico</li>
 <li><b>United States</b></li>
 <li>Colombia</li>
 <li>Ecuador</li>
 <li>China</li>
 <li>Argentina</li>
 <li>Chile</li>
</ol>
</p>

<p>
In my view,
if the Internet is ever going to avoid degenerating into nothing more than a den of thieves and predators,
we have to introduce some accountability somewhere.
Knowing the assigned IP addresses of each organization and their authorized providers is a good first step, and it doesn't seem to be asking all that much.
And while a lot of the world falls short of accurately providing even this minimal information to some "authority",
I couldn't help but notice that over 7,300 autonomous systems managed to get a perfect score of 100 in our system, although 63% of them had only 1 prefix to worry about.
Of those with a perfect score,
Daewoo Information Systems (AS 4961) manages the most prefixes at just under 100.
The remaining 24,000 or so ASes could easily do a better job,
and if they did, 
the registry information could be used as was originally intended,
to both solve and avoid problems.
</p>

<p>
<u>Notes:</u>
<ul>
  <li>Renesys computes these scores and others on a daily basis for the purposes of analysis, trending and product offerings.</li>
  <li>For the most part, individual countries listed here have little control over prefixes registered in their country.  That is, the scores are more a reflection of the businesses found in each country, rather than their governments.</li>
  <li>Richard A. Steenbergen 
       <a href="http://nanog.org/meetings/nanog44/presentations/Tuesday/RAS_irrdata_N44.pdf">presented work</a> on registry scoring of select providers at
       <a href="http://nanog.org/meetings/nanog44/">NANOG 44</a>.</li>
  <li>While writing this blog, 
        I was alerted to similar work being done by the <a href="http://www.nist.gov/index.html">US National Institute of Standards and Technology</a> and described <a href="http://www.antd.nist.gov/bgp_security/publications/ARIN_NetHandle_OriginAS_Analysis.pdf">here</a>.</li>
</ul>
</p>

<p>
<b>Update:</b>
The earlier version of this blog inadvertently misclassified a number of registered networks due to a software bug in a script used for displaying the data.
The scoring algorithm remains unchanged.
</p>]]>
    </content>
</entry>

<entry>
    <title>Longer is not always better</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/02/longer-is-not-better.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.152</id>

    <published>2009-02-21T19:30:30Z</published>
    <updated>2009-03-27T19:59:03Z</updated>

    <summary> This post is a follow-up to our blog last week about a small Czech provider briefly causing global Internet mayhem via a single errant routing announcement. In this incident, SuproNet (AS 47868) announced its one prefix, 94.125.216.0/21, to its...</summary>
    <author>
        <name>Earl Zmijewski</name>
        <uri>http://www.renesys.com/blog/</uri>
    </author>
    
        <category term="Engineering" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[<p>
This post is a follow-up to 
<a href="http://www.renesys.com/blog/2009/02/the-flap-heard-around-the-worl.shtml">our blog last week</a>
about a small Czech provider briefly causing global Internet mayhem
via a single errant routing announcement.
In this incident,
SuproNet (AS 47868) announced its one prefix, 94.125.216.0/21, to its backup provider, Sloane Park Property Trust (AS 29113), with an extremely long AS path.
We've gotten more feedback about this entry than any other in recent memory, 
so we thought we'd try to answer some of the questions that were posed both here and elsewhere, 
as well as provide some clarification about exactly what went on.
The questions we try to address include:
<ul>
	<li>How could anyone be this dumb?</li>
	<li>Why did this cascade throughout the planet?</li>
	<li>Can you provide more details about the impact and its spread?</li>
	<li>How do we prevent this from happening again?</li>
</ul>
</p>]]>
        <![CDATA[
<p><b>How could anyone be this dumb?</b></p>

<p>
I'll admit that this was my first thought.
And since this incident interrupted my lunch,
I was only too happy to join the mob.
In hindsight, 
my reaction was due to the fact that my router experience is largely limited to Cisco gear and their router software, known as IOS.
For example,
suppose SuproNet was using a Cisco and wanted to prepend their ASN (47868)
an additional four times to announcements to a particular provider.
They could use something like the following, 
where the string of x's refers to the IP address of the provider's router.
Notice that I had to explicitly list 47868 four times.
</p>

<p>
<pre>
     neighbor xx.xx.xx.xx route-map longerisbetter out
     route-map longerisbetter permit 10
       set as-path prepend 47868 47868 47868 47868
</pre>
</p>

<p>
The is a common way of prepending in Cisco IOS,
so I naturally thought, 
who would be so dumb as to type (or cut-and-paste or whatever) their own ASN hundreds
of times into a configuration for a router?
<em>Who?</em>
The only problem with this line of reasoning is that SuproNet wasn't using a Cisco.
They were apparently using a router from
<a href="http://www.mikrotik.com/aboutus.php">MikroTik</a>,
a router vendor from Latvia,
as first reported in this 
<a href="http://www.lupa.cz/clanky/maly-cesky-isp-zpusobil-svetovy-kolaps/">Czech blog</a>.
MikroTik obviously targets the Czech market since they have a local language 
<a href="http://www.mikrotik.cz">web page and domain</a>.
</p>

<p>
So how do you prepend on a MikroTik?
According to their
<a href="http://www.mikrotik.com/testdocs/ros/3.0/ip/route_content.php">on-line manual</a>,
you set the following variable in an appropriate configuration mode.
</p>

<p>
<pre>
bgp-prepend (integer: 0..16) - number which indicates how many times to prepend AS_NAME to AS_PATH
</pre>
</p>

<p>
So if SuproNet was thinking "Cisco IOS", 
they might have typed "bgp-prepend 47868" to prepend 47868 once.
However, this would be a mistake as this router is expecting a count, not an ASN.
So at this point,
it would be reasonable to expect the MikroTik to report something like "value out of range".
Let's assume they didn't do any range checking on the input value and let's assume
they devoted one byte (8 bits) to store this value.
One byte can represent all integers from 0 to 255.
So what happens when you try to stuff something larger, like 47868, into one byte?
You get 47868 modulo 256 (i.e., the remainder after dividing of 47868 by 256),
which equals 252.
As <a href="http://www.merit.edu/mail.archives/nanog/msg15730.html">Mikael Abrahamsson</a> first noticed, 
this was the exact number of prepends of 47868 he was seeing.
So I went back and looked at the copious number of announcements we saw of SuproNet's prefix and guess what?
Every single one had 252 prepends of 47868,
leading me to conclude that this was the exact number sent out by SuproNet.
Originally I was thinking the number of prepends probably varied based how these long paths were being truncated and that it was this random truncation that was causing part of the problem.
</p>

<p>
And using this clue,
Ivan Pepelnjak was able to spell out exactly what happened in 
<a href="http://blog.ioshints.info/2009/02/oversized-as-paths-cisco-ios-bug.html">his blog</a>.  
As it turns out, 
the reason for all those routing resets and general instability was due to a previously unknown Cisco bug involving AS paths close to 255 in length.
If you try to prepend to a long path that you receive and by doing so, 
create a path longer than 255, you are toast.
So the maps we gave in our 
<a href="http://www.renesys.com/blog/2009/02/the-flap-heard-around-the-worl.shtml">our last blog</a> were more of an indication of Cisco market share (at least among prependers), 
rather than the propensity of outdated routers.
Kudos to Ivan for figuring this out.
</p>

<p>
In summary, 
we have a situation where a single careless operator in the Czech Republic tickled one bug (i.e., lack of bounds checking in the MikroTik router) that in turn tickled another bug 
(i.e., a problem with long AS paths on a Cisco).
And the result was global Internet instability due to prevalence of Cisco gear in the market.
But in fairness to MikroTik
we note that 
<a href="http://www.merit.edu/mail.archives/nanog/msg15773.html">Mathias Sundman</a> 
observes that bounds checking does now exist in version 3.20 of their router software.
</p>

<p><b>Why did this cascade throughout the planet?</b></p>

<p>
Short answer:  There is a bug in Cisco IOS with regard to long AS paths and lots of folks use Cisco gear.
Longer answer:  Most ISPs apparently do not filter out announcements with long AS paths.  As we noted in 
<a href="http://www.renesys.com/blog/2009/02/the-flap-heard-around-the-worl.shtml">our previous blog</a> on this topic,
we are all fairly close to one another on the Internet and there is really no reason to be seeing excessively long AS paths.
Such paths only indicate a problem or a clueless operator or both, 
and can be safely discarded.
The fact that they were not dropped allowed them to tickle this bug on many Cisco boxes along the way.
</p>

<p>
Since we are all just a few AS hops away from each other, 
the problem only occurred because the paths originated from SuproNet were so close to 255.
This allowed them to reach the core of the Internet and continue onto other edge networks 
<em>before</em> exceeding the 255 path length boundary.
It was only when they did that all hell broke loose,
far away from the original source of the problem.
As Andree Toonk provides on 
<a href="http://bgpmon.net/maxASpath.php">this page</a>,
there are apparently others who have made the same mistake on MikroTik routers.
AS 20912, Panservice of Italy, is doing it as of this writing, but 20912 modulo 256 is 
<em>only</em> 176 and these announcements are apparently not causing a problem.
</p>

<p><b>Can you provide more details about the impact and its spread?</b></p>

<p>
This was an easy one, 
as Renesys monitors every prefix (network) seen on the Internet and computes their stability over time.
We also geo-locate them as accurately as possible.
Thus we can see events like this propagate through the planet and 
<a href="http://earth.google.com/">Google Earth</a> provides an
excellent way of performing the visualization.
We used it to show every <em>newly unstable</em> prefix in during the hour before and the hour of the incident.
Here are a few composite images taken from Google Earth of a few regions and an indication of all the unstable prefixes seen during the 2-hour period.
We start with the US where the impact was the greatest.
</p>

<p>
<a href="http://www.renesys.com/blog/2009/02/us.jpg" 
onclick="window.open('http://www.renesys.com/blog/2009/02/us.jpg','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2009/02/us.jpg" width="690">
</a>
</p>

<p>
Next up is the heart of South America,
where Cisco obviously needs to send some sales folks.
(Before someone points out the population density of South America relative to the US,
we noted in our last blog that South America was the least impacted continent <em>on a percentage basis</em>.)
</p>

<p>
<a href="http://www.renesys.com/blog/2009/02/sa.jpg" 
onclick="window.open('http://www.renesys.com/blog/2009/02/sa.jpg','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2009/02/sa.jpg" width="690">
</a>
</p>

<p>
Finally, we take a look at Europe, 
where all the trouble started.
</p>

<p>
<a href="http://www.renesys.com/blog/2009/02/europe.jpg" 
onclick="window.open('http://www.renesys.com/blog/2009/02/europe.jpg','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2009/02/europe.jpg" width="690">
</a>
</p>

<p><b>How do we prevent this from happening again?</b></p>

<p>
This one is really about assigning blame and there is plenty to go around.
But before we get too caught up with that,
keep in mind that this was really the 
<a href="http://en.wikipedia.org/wiki/Perfect_storm">perfect storm</a>.
As of today,
Renesys has observed 31,188 unique non-private ASNs on the Internet over the last few weeks.
If you compute modulo 256 of each of them,
you get 731 with associated values &ge; 250 or 2.3% of the total.
There is nothing special about 250. 
However, the likelihood of a problem decreases significantly as the values get lower, and
250 seems like a reasonable cutoff, given typical path lengths in the Internet.
And there are still only 1,919 ASNs whose modulo 256 value is &ge; 240,
or 6.2% of the total.
Thus for this event to have occurred at all,
besides the bugs in the router software of two vendors,
only a few percent of the ASes on the Internet could have possibly initiated the meltdown,
but only if they had a careless operator and an obscure Latvian router with outdated software.
How likely was that?
</p>

<p>
As for the blame,
network operators (SuproNet) should obviously read their router documentation and test any proposed changes in a lab environment to see if they get the results they expect.
Router vendors should check bounds on input parameters (MikroTik) and 
on boundary conditions (Cisco).
ISPs should filter out obvious useless garbage,
like ridiculously long AS paths and unrouteable (private) IP addresses.
They obviously don't,
given the scope of the event.
And who designed this BGP routing protocol anyway?
What were <em>they</em> thinking?
</p>

<p>
Seriously, the reason for the success of the Internet is because it is not under the control of any one government or company.
Because of this fact,
it is both cheap and ubiquitous.
But because there is no centralized control or authority,
we are largely at the mercy of the weakest link.
Sure there is plenty we can do to prevent things like this from happening again,
but there will always be the next perfect storm.  
Who could have guessed something like this could have happened?
You won't be able to guess the next one either.
The happy ending to this story is that the community quickly rallied and worked together to both identify and mitigate the problem.
No meetings were held, no bailouts were requested and not a single lawyer was needed to draft an agreement.
The Internet was back to normal in short order.
</p>]]>
    </content>
</entry>

<entry>
    <title>To Catch a Thief</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/02/stealing-the-internet-back-1.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.150</id>

    <published>2009-02-19T19:45:00Z</published>
    <updated>2009-03-06T12:37:37Z</updated>

    <summary> Last August at DEFCON, Alex Pilosov and Tony Kapela presented a talk entitled Stealing the Internet: An Internet Scale Man-In-The-Middle Attack, which illustrated a technique for misdirecting specific Internet traffic via carefully constructed BGP routing messages. Using this approach,...</summary>
    <author>
        <name>Earl Zmijewski</name>
        <uri>http://www.renesys.com/blog/</uri>
    </author>
    
        <category term="Engineering" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[<p>
Last August at 
<a href="http://www.defcon.org/html/defcon-16/dc-16-post.html">DEFCON</a>,
Alex Pilosov and Tony Kapela presented a talk entitled
<a href="https://www.defcon.org/images/defcon-16/dc16-presentations/defcon-16-pilosov-kapela.pdf">Stealing the Internet: An Internet Scale Man-In-The-Middle Attack</a>,
which illustrated a technique for misdirecting specific Internet traffic via carefully constructed BGP routing messages.
Using this approach, 
an attacker can redirect the incoming traffic of <em>any </em> victim through his own site for further inspection or alteration before ultimately passing it on to the victim.
Furthermore, the attack can be carried out in a way that is largely transparent to the victim.
Since this talk,
Renesys staff have been repeatedly asked "So are people using this technique today?"
That is, are people currently "stealing the Internet", and if so, who is attacking whom?
Given the volume of routing data that Renesys has at our disposal and the number of tools we have to slice and dice it, 
we thought this would be a relatively straightforward question to answer.  
<em>We were wrong.</em>
</p>

<p>
Although we ultimately succeeded in answering the question and in developing a general Man-In-The-Middle (MITM) detection algorithm for the global Internet,
we ended up writing a lot of code over the course of several months and burning through endless CPU cycles looking for attack evidence.
Our results were presented this week at
<a href="http://www.blackhat.com/html/bh-dc-09/bh-dc-09-main.html">Black Hat</a>
and the complete presentation can be found
<a href="http://www.renesys.com/tech/presentations/pdf/blackhat-09.pdf">here</a>.
In this blog, we'll hit on some of the highlights from the presentation.
</p>]]>
        <![CDATA[<p><b>Stealing the Internet</b></p>
<p>
As in the infamous example of 
<a href="http://www.renesys.com/blog/2008/02/pakistan-hijacks-youtube-1.shtml">Pakistan hijacking YouTube</a>,
the MITM attack relies on the attacker announcing a more-specific of a prefix announced by the victim.
Since the route to the most specific prefix wins, 
the attacker gets all traffic intended for that more-specific, 
rather than the rightful owner (victim).
The new trick is to then pass the misdirected traffic to the victim in a way that is transparent to him.
This is accomplished by creating a viable path from the attacker to the victim, 
one that is blind to the hijack and hence serves as an avenue for returning the hijacked traffic.
This is accomplished by manipulating the Autonomous System (AS) paths that get passed around with each prefix announcement.
Relative to the more-specific prefix used in the hijack, 
the attacker simply includes the ASes found on the viable path between himself and the victim, which serves to blind these ASes to his bogus announcement.
This works because in order to prevent the endless circulation of routes,
routers will not accept any routes that they think they've already seen.
So while most of the Internet will see the attacker's hijack and act accordingly, 
those ASes on the path between the attacker and his victim will not,
providing a pathway to ultimately return the stolen traffic on to the victim.
That is a pretty fast summary, 
but all the details can be found in the aforementioned talks.
</p>

<p><b>Looking for Clues</b></p>
<p>
So the question is: 
Is someone snooping on your incoming traffic?  
How about on the traffic you send to your business partners?
It turns out the first question is relatively easy to answer, 
while the second is extremely difficult.
The reason the former is easy is because you presumably know your own network.  
Since the attack relies on injecting a more-specific of one of your prefixes into the global routing table, 
this is something you can have a route monitoring service watch for you.
You cannot observe this yourself, 
since you are blind to the attack.
However, most of the Internet is not blinded,
so if a more-specific of one of your prefixes appears and you didn't authorize it,
then someone else is messing with you.
</p>

<p>
But knowing if this is going on in general in the Internet is quite another matter, 
since no one can tell you exactly how every prefix should be appearing at any given point in time.
There are over 270,000 of them and new more-specifics are constantly being announced.
Which ones are legitimate and which ones are hijacks?
At a glance, no one can say, since there is no central authoritative source to consult.
So, it might seem that this question is impossible to answer, 
but a closer look gives us the clues we need.
</p>

<p>
Conveniently, the MITM attack was demonstrated at the DEFCON conference, 
so we had at least one known example to study.  
In this case, 
the victim was Sparkplug Las Vegas, Inc. (AS 20195),
who had SWITCH Communications Group (AS 23005) as their sole provider.
The hijacker was Pilosoft (AS 26627).
The clues we need to automatically detect such hijacks can be found in the associated AS paths.
Looking at the tail end of the relevant AS paths moments before the hijack took place,
we see that they all have the following form.
Namely, AS 20195 must go through its one provider, AS 23005, but from there the paths
branch out to any of the several providers of AS 23005.
<ul>
  <li>19151 23005 20195</li>
  <li>22822 23005 20195</li>
  <li>  3356 23005 20195</li>
</ul>
</p>

<p>
Now let's look at the tails of a few relevant paths at the start of the hijack.  
Notice any difference?
<ul>
  <li>3356 3561 26627 4436 22822 23005 20195</li>
  <li>3549 3561 26627 4436 22822 23005 20195</li>
  <li>1239 3561 26627 4436 22822 23005 20195</li>
</ul>
These paths all end with a long <em>globally unvarying</em> segment that was purposely injected by the hijacker to blind ASes on the path from him to the victim.
The artificially constructed segment of the path is 4436 22822 23005 20195,
which is peculiar for several reasons.
First, AS 23005 had 4 providers at the time of the attack
and AS 22822 (Limelight) had 8 providers and 147 peers.
Yet from all of these possible routes,
the entire world appears to have selected the same path through them to the victim.
Second, none of the ASNs in the artificial segment of the path were seen to have actually announced anything themselves &mdash; they were silent ASNs (with respect to this prefix).  They were seen only in this longer path, 
suggesting that they were added by a third party in order to blind them.
</p>

<p></p>
<p>
And there is even more.
Renesys computes the business relationships between all the ordered pairs of adjacent ASes we observe.
These fall into one of a handful of possible categories:
a customer to a provider relationship, a provider to a customer, two peers, and a few other, 
less common and more complex arrangements.
If we draw the first artificial path given above with peers side by side and customers below their providers, we get the following depiction.
<br>
<br>
<a href="http://www.renesys.com/blog/2009/02/path1.jpg" 
onclick="window.open('http://www.renesys.com/blog/2009/02/path1.jpg','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img src="http://www.renesys.com/blog/2009/02/path1.jpg" width="690">
</a>
<br>
<br>
And this path makes no sense, 
as it violates the 
<a href="http://www.renesys.com/blog/2009/02/gao.pdf">valley-free property</a>
of Internet routing.
An AS in the "valley" (e.g., 26627) is transiting traffic for free, i.e., throwing money out.
And while mistakes do occur, 
one business does not typically provide some other unrelated business with free Internet service.
Similarly, multiple peering links (e.g., 4436_22822 and 3356_3561) in one path means that someone is providing free transit and thus is another red flag.
</p>

<p><b>Catching the Thief</b></p>
<p>
To alert on MITM attacks on the entire global Internet,
we start by examining all the new more-specifics seen in a day.
There will be hundreds of these to consider, 
and most days all of them will be perfectly legitimate.
We then carefully examine the associated AS paths,
looking for monkey business of the type described above.
Full details can be found in our
<a href="http://www.renesys.com/tech/presentations/pdf/blackhat-09.pdf">presentation</a>.
But the short answer is that we take great care not to miss any hijacked more-specifics. 
We assume our attacker does everything possible to cover his tracks,
so we must be equally cautious in avoiding false negatives. 
</p>

<p>
We ran our algorithm on all of our historical global BGP data from 1 July 2008 until 31 January 2009,
sifting through tens of thousands of more-specifics and their associated millions of AS paths.
After all of this, 
we found only three cases that passed all of our tests and looked like actual MITM attacks.
One of these was the actual MITM demonstration conducted at DEFCON.
Of the other two, 
upon visual inspection,
one was almost certainly simple traffic engineering using BGP manipulation.
The other was confirmed by email as a fail-over test gone awry.
</p>

<p><b>Conclusions</b></p>
<p>
The good news is that we can say with a great deal of certainty that the MITM attack is not yet appearing in the wild.
In addition, 
we now have a comprehensive global approach for detecting it when it does ultimately make an appearance,
one that rarely produces a false positive and studiously avoids false negatives.
Only "lucky" corner cases on the edge of the Internet where the attacker is very close to the victim could escape all efforts at detection, 
but these could easily be thwarted by the victim himself. 
</p>

<p>
The bad news is that we've built a global Internet that allows for such an attack in the first place.
And even if you defend your <em>incoming traffic</em> by monitoring for suspicious BGP announcements related to your networks,
you've done nothing to defend the data you send to others.
To guard against interception of your outgoing traffic,
the Internet would require a comprehensive global alarming system, 
one that alerts on any attempt to divert traffic through unusual third parties.
But constructing such a system is non-trivial to get right,
requiring a rich source of global BGP data and sophisticated real-time analytics.
Unfortunately, 
short of getting everyone on the Internet to rigorously police their own networks,
there simply is no other alternative.
The problems outlined here are inherent in how we currently route traffic on the Internet and until we come up with a comprehensive, incremental replacement for BGP and get it widely deployed,
we're going to be living with the consequences.
We can either ignore the problem entirely or work toward a solution, 
while keeping a vigilant eye on our current increasingly wobbly trust-based system.
</p>
 ]]>
    </content>
</entry>

<entry>
    <title>Reckless Driving on the Internet</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/02/the-flap-heard-around-the-worl.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.151</id>

    <published>2009-02-17T03:40:00Z</published>
    <updated>2009-03-06T12:37:06Z</updated>

    <summary> This weekend, John Markoff wrote an interesting piece for the New York Times entitled Do We Need a New Internet? While his emphasis was largely on security, or rather the lack thereof, the central point Markoff makes is that...</summary>
    <author>
        <name>Earl Zmijewski</name>
        <uri>http://www.renesys.com/blog/</uri>
    </author>
    
        <category term="Engineering" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[<p>
This weekend, 
John Markoff wrote an interesting piece for the New York Times entitled 
<a href="http://www.nytimes.com/2009/02/15/weekinreview/15markoff.html">Do We Need a New Internet?</a>
While his emphasis was largely on security, or rather the lack thereof, 
the central point Markoff makes is that the Internet may be so hopelessly broken that it could be better to start over,
rather than continue to apply band-aids.
As if to emphasize this point, 
<a href="http://www.supro.cz/">SuproNet</a>, a local Czech provider,
single-handedly caused a global Internet meltdown for upwards of an hour today.
SuproNet accomplished this feat by sending out a rather unusual routing update,
one which a lot of routers did not handle very well.
The result was Internet bedlam.
</p>]]>
        <![CDATA[<p><b>Some Preliminaries</b></p>

<p>
Routing on the Internet is strictly a cooperative affair.
Neighboring routers tell each other what they know and that information ultimately propagates globally.
Eventually everyone figures out how to reach everyone else. 
And what routers know are prefixes, i.e., blocks of IP addresses,
that are routed in the same way.
Since there is often more than one way to reach any given prefix,
routing announcements include various attributes so that everyone can decide on their preferred path to each prefix. 
One such attribute is the Autonomous System (AS) path, 
i.e., the list of organizations that have to be traversed to reach the prefix.
For example,
I'm typing this blog from my home Verizon DSL connection.
If I wanted to reach an IP address in Qatar served by  
<a href="http://www.qtel.com.qa/">Qtel</a>,
Verizon might hand that traffic off to Tata/Teleglobe (AS 6543), 
which in turn hands it off to Qtel (AS 8781).
The prefix in question and its associated AS path are depicted graphically below.
<br><br>
<a href="http://www.renesys.com/blog/2009/02/ASpath.jpg" 
onclick="window.open('http://www.renesys.com/blog/2009/02/ASpath.jpg','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2009/02/ASpath.jpg" width=690>
</a>
<br><br>
AS path length is one important factor in route selection, 
with shorter paths favored over longer ones.
Suppose that Qtel wanted to used Tata/Teleglobe as a backup provider for this prefix, 
only to be used when other alternatives failed.
They could effect this by making the announced path artificially long.
Instead of
<ul>
   <li> 701 6453 8781 </li>
</ul>
we might see paths like
<p></p>
<ul>
   <li> 701 6453 8781 8781 8781 </li>
</ul>
In this example, 
Qtel would have prepended its own AS to the path several times so that this particular route to this particular prefix would tend not to be selected by others.
</p>

<p></p>
<p>
Now the average path length on the Internet is only around 4.
That is, we are all fairly close to one another.
So if I make any path seem just a little bit longer, one or two ASes, 
it generally will not get selected and will accomplish the objective of being the path of last resort.
Nothing stops you from prepending your own AS a dozen or even a hundred times, 
but it is not going to accomplish anything and will only pointlessly consume everyone else's router memory.
It's also an indication that you don't know what you are doing.
Which brings us to a central problem, 
you don't need a driver's license on the Information Superhighway.
</p>

<p><b>Bedlam on the Internet</b></p>

<p>
Now suppose you just got your Internet learner's permit yesterday and you really don't want your backup provider being used unless your main provider is down.
You could prepend your AS a few times in the route announcements you make to your backup provider and that would do the trick,
but to make <em>really sure</em> you go for a few hundred instead.
In a perfect Internet,
that wouldn't matter, 
but we don't have one of those.
What we think happened next is the Internet equivalent of a massive
<a href="http://en.wikipedia.org/wiki/Buffer_overflow">buffer overflow</a>.
While most of the core routers run by major ISPs fared just fine, 
processing the ridiculous path and sending it on,
others choked.
Perhaps they weren't as well maintained or were running buggy software.
These routers viewed the update as malformed and so tore down their session with whoever sent them the update.
In other words,
two routers that were happily exchanging traffic with each other just moments before suddenly stopped all communication.
Traffic was lost, alternative paths were explored, and maybe the former cooperating routers recovered and re-established contact.
Multiply this by thousands of routers around the world and you can begin to appreciate the ensuing pandemonium.
At Renesys, we experienced an almost 100-fold increase in the rate of routing updates from our worldwide array of sensors
<br><br>
<a href="http://www.renesys.com/blog/2009/02/updates_rate.60.png" 
onclick="window.open('http://www.renesys.com/blog/2009/02/update_rate.60.png','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2009/02/updates_rate.60.png">
</a>
<br><br>
</p>

<p><b>The Details</b></p>

<p>
SuproNet (AS 47868) normally announces a single prefix, 94.125.216.0/21, 
to a single provider, CD-Telematika (AS 25512).
On February 16<sup>th</sup> at 16:23:30 UTC,
we saw this same prefix via a different provider, 
Sloane Park Property Trust (AS 29113),
but with an AS path exceeding 255 ASNs.
Such messages continued for almost exactly one hour or until 17:23:00 UTC.
We observed Level 3 (AS 3356), Tiscali (AS 3257) and TeliaSonera (AS 1299) propagating most of these routes globally,
with a total of 230 unique ASes ultimately sending us the problematic announcements.
</p>

<p>
This single Czech provider announcing a single prefix caused a huge increase in the global rate of updates,
peaking at 107,780 updates <em>per-second</em>.
This peak occurred at 16:30:54 UTC, 
less than 8 minutes after the first announcement.
</p>

<p>
At Renesys,
we call a prefix <em>impacted</em> in a given hour if either suffers an outage or has a non-trivial amount of instability.
In the hour before this event,
there were 1215 impacted prefixes globally out of a total of 271,175.
During the event, that number surged to 12,920 or 4.8% of all prefixes on earth.
One announcement from one provider and we have a 10-fold increase in planetary routing instability for an hour.
North America suffered the most, increasing from 0.35% to 4.76%,
while South America suffered the least, increasing from 0.52% to 1.75%.
</p>

<p><b>Mapping the Damage</b></p>

<p>
It's always the middle of the night somewhere, 
a time when ISPs perform their maintenance.  
And bad weather and backhoes roam the earth.
So routing instability on the Internet is always present,
as networks come and go.
The following map shows instability levels by country in the hour before this event starting at 15:00  UTC and computed as a percentage of all prefixes geo-locating to the country.
</p>

<p>
<table>
<tr>
<td>
<a href="http://www.renesys.com/blog/2009/02/before.png" 
onclick="window.open('http://www.renesys.com/blog/2009/02/before.png','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false">
<img src="http://www.renesys.com/blog/2009/02/before.png">
</a>
</td>
</tr>
<td align="center"><em>Global Instability by Country - Before</em>
</td>
</tr>
</table>
</p>

<p></p><p></p>

<p>
The next map show instability levels by country during the hour of the event,
starting at 16:00 UTC.  Can you spot the outdated routers?
<p>
<table>
<tr>
<td>
<a href="http://www.renesys.com/blog/2009/02/after.png" 
onclick="window.open('http://www.renesys.com/blog/2009/02/after.png','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false">
<img src="http://www.renesys.com/blog/2009/02/after.png">
</a>
</td>
</tr>
<td align="center"><em>Global Instability by Country - During</em>
</td>
</tr>
</table>
</p>

<p></p>
<p><b>Now what?</b></p>

<p>
We were heartened to see that most of Internet's core survived a single odd announcement,
but this does speak to a lot of outdated equipment or software at the edge.
And if you manage to get all of edge routers to reset,
you aren't going to have many people to talk to no matter what the core is doing.
While it might be tempting to bash SuproNet,
can anyone really defend a system where a failure in probably one of the weaker links can cause the entire system to unravel?
Maybe we really do need a new Internet and for more reasons than better security.
The next one needs to come with an operating permit too.
</p>]]>
    </content>
</entry>

<entry>
    <title>&quot;The Adventurous Parts of the Internet&quot;</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2009/01/the-adventurous-parts-of-the-i.shtml" />
    <id>tag:www.renesys.com,2009:/blog//1.148</id>

    <published>2009-01-28T16:40:26Z</published>
    <updated>2009-02-19T11:47:27Z</updated>

    <summary> I just spent a very pleasant 3 days attending NANOG 45 in the Dominican Republic. The whole thing was a whirlwind of peering, technical presentations, and catching up with the people who keep the North American parts of the...</summary>
    <author>
        <name>James Cowie</name>
        
    </author>
    
        <category term="Economics" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Internet" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Society" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[
I just spent a very pleasant 3 days attending <a href="http://nanog.org/meetings/nanog45/index.php">NANOG 45</a> in the Dominican Republic.   The whole thing was a whirlwind of peering, technical presentations, and catching up with the people who keep the North American parts of the internet backbone alive.   What can I say? The DR is overflowing with friendly people, great food, warm breezes (82F in Santo Domingo, versus 0F at my house in New Hampshire), and very decent Presidente beer.   Very conducive to thinking the big thoughts.  The trick is to write them down ...<p>]]>
        <![CDATA[On Monday, I shared about thirty slides describing some of the research I've been doing over the last few months, trying to come up with a straightforward, objective way of rating network service providers based on the behavior (or misbehavior) of their on-net customers.   <a href="http://www.renesys.com/tech/presentations/pdf/nanog45-cowie.pdf">The presentation is here if you'd like to read it.</a>  </p><p> <a href="http://www.renesys.com/tech/presentations/pdf/nanog45-cowie.pdf"><img align="right" width="480" height="384" border="1" src="http://www.renesys.com/img/tech/s18.png"/></a>
I'll give you the gist of it, though.  The vast majority of the internet's "control traffic"  (the BGP protocol messages that keep every router informed about how to reach the 250,000 or so individual networks on earth) is actually generated by, or on behalf of, a <em>really small</em> number of networks.  How small?  On any given day, perhaps 1% of the world's networks generate, directly, or indirectly, 50% to 70% of the total traffic being exchanged by routers all over the world.   If you're a glass-half-full kind of network engineer, that means that 99% of the world's networks are good citizens &mdash; exemplary stability, quietly hanging out in the routing table, very rarely changed, going about their business and  contributing very little to the traffic load that the world's routers have to process.  </p><p>
I'm not one of those glass-half-full guys!   I wanted to find out who the worst-behaved 1% were, and more to the point, who was making money by selling them transit without keeping an eye on the mess they were generating.  </p> <p>
Pretty straightforward, I thought: there's a tragedy-of-the-commons thing going on here, and people who provide transit to unstable networks are propagating their "table pollution" beyond their borders, around the world, making us all pay more to keep the Internet alive by their carelessness. </p><p>
   Perhaps, I suggested innocently, customers would seek out transit providers with better stability report cards, and <em>offer them more money</em>, because stability of the neighborhood could be a source of positive differentiation in an ever-more-cutthroat, low-margin industry.  Perhaps, the audience responded realistically,  transit providers could instead <em>charge unstable downstream customers more</em>, to penalize them for their tendency to flap their routes incessantly (the internet engineering equivalent of blasting your stereo at a red light with the top down).   We were in basic agreement, though: someone should be pricing and paying for this pollution.  (Overcoming the industry's history of abject failure at billing for anything more subtle than commodity megabits is a discussion best left for another day.)   </p><p>
As usual, when you put a big idea in front of a bunch of intelligent people, it got more and more interesting.   By the time I sat down, I had a few new observations.</p><p> 
First of all, the technical voices in the operator community understand these issues quite well.  They know firsthand about CPU loads, traffic trends, and how hard it is to keep your network one step ahead of resource constraints as the Internet continues to grow like a weed.    Since every large provider serves a broad range of customers,  pretty much everyone has at least a few dozen autonomous systems downstream who exhibit serious instability, and don't clean it up, or even know that they're making a mess.    But because the offending parties may be two or three signed contracts downstream (customers of customers of customers...), the sales and business development groups at big providers   have zero incentive to take an interest.   In this economy, if it doesn't turn up big new revenue, or melt the network outright, it's off the radar. </p><p>
The second observation was that instability is really only one of three big routing "externalities" that people inflict on the planet's infrastructure without significant penalty, the  others being deaggregation (littering the routing table with redundant more-specific fragments, rather than advertising a few larger tidy networks) and failure to maintain clean, up-to-date, accurate entries in the routing registries.    Being unstable chews up other people's CPU cycles all over the world.  Wanton deaggregation chews up other people's scarce router memory resources all over the world.  Failing to advertise your routing intentions, and stick to them, makes it nearly impossible (barring some future <a href="http://www.merit.edu/mail.archives/nanog/msg14366.html">cryptographic miracle in global coordination</a>) to decide whether a given routing advertisement is legitimate or suspicious, when you hear it on the other side of the world.   </p><p>
But these are all regarded as venial sins, not even worthy of confession.   Kind of like recharging your car's air conditioner with CFCs, or burning all your kitchen garbage out behind your house in an oil drum.  </p> <p>
The final observation was one that caused me to moderate my thirst for backbone street justice, at least a little bit.  Many of the providers who I gave poor instability scores to had something in common: they serve what I call the "adventurous parts of the internet" &mdash; regions of the planet that are underserved by internet transit infrastructure because of their political instability, their poverty, their unfortunate accidents of geography or choice of neighbors, or all three.  </p><p>
What I found was that unstable networks tend, more often than not, to be in places like Mexico, or Armenia, or Iran, or Vietnam, or Georgia, or Egypt, or Cambodia.   It's not that all providers in these places are unstable &mdash; far from it &mdash; but substantial populations are served by providers who just can't seem to keep their routes under control.   Often these are the old national incumbent telephone companies, trying to bring people into the 21st century, hobbled by a regulatory environment that doesn't discover efficient solutions. </p><p>  By constructing a metric that ranks providers according to the aggregate stability of their customer base, wasn't I just penalizing carriers who chose to serve the adventurous parts of the internet?   Isn't there a "positive externality" being created by these providers, in that they bring the internet within reach of everyone on earth? </p><p>
I took a question from an audience member who suggested, partly tongue-in-cheek, that the instability we had documented was a sign that fewer people should be "speaking BGP" at the edge of the internet.   That is, that these unstable edge providers should simply pick the single best provider they can afford, put all their eggs in that basket, and delegate the difficult challenges of maintaining a stable Internet presence to that one provider (be it Sprint, or Tata Teleglobe, or Verizon Business).   </p><p>
To be sure, that's a conclusion you could draw from the data.  But I have the feeling that in the long run, the benefits of keeping the adventurous parts of the internet well-connected via diverse, redundant transit will outweigh the extra noise they generate.   I hope that we can use instability scoring as a tool to identify providers who need a little extra help, a few words of advice from the global networking community, maybe even an invitation to participate in global community events like NANOG, so that they, too, can find and fix their internet pollution problems.]]>
    </content>
</entry>

<entry>
    <title>Internet Year in Review 2008</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2008/12/internet-year-in-review-2008.shtml" />
    <id>tag:www.renesys.com,2008:/blog//1.144</id>

    <published>2008-12-29T13:15:00Z</published>
    <updated>2009-02-01T13:34:15Z</updated>

    <summary> It&apos;s been an interesting year in many ways, not least of which for the Internet. This year, I started to contribute in earnest to the Renesys blog and back in January I was wondering &quot;How am I going to...</summary>
    <author>
        <name>Earl Zmijewski</name>
        <uri>http://www.renesys.com/blog/</uri>
    </author>
    
        <category term="Internet" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[ <p>
It's been an interesting year in many ways, 
not least of which for the Internet.
This year, I started to contribute in earnest to the Renesys blog and back in January I was wondering 
"How am I going to find anything interesting to talk about on a regular basis?
Nothing much happens on the Internet, right?"
Well, it certainly did this year and now I've got many more ideas than I have time to research and write about.
In hindsight, I guess it isn't too surprising.
As the world becomes more interconnected and more Internet-dependent,
we're bound to bump into each other more and expose the limitations of the current system.
So let's review what 2008 brought us and take a guess at what is in store for the new year.
</p>]]>
        <![CDATA[<p><b>January</b></p>

<table align=left>
<tr>
<td align=left>
<a href="http://www.renesys.com/blog/2008/01/15th-century-routing.shtml">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2008-year-end/earl-small.jpg" hspace="10" width="110" height="119"/>
</a>
</td>
</tr>
</table>

<p>
The year started quietly enough and here in New Hampshire,
we were largely trying to 
<a href="http://www.renesys.com/blog/2008/01/15th-century-routing.shtml">stay warm</a>.
Then the 
<a href="http://www.renesys.com/blog/2008/01/mediterranean-cable-break.shtml">Mediterranean cables</a> starting snapping and suddenly things got much more interesting.
Entire countries and
<a href="http://www.renesys.com/blog/2008/01/mediterranean-cable-break-part-1.shtml">providers</a>
were cut off from the Internet and there was a 
<a href="http://www.renesys.com/blog/2008/02/mediterranean-cable-break-part.shtml">mad scramble</a> to restore connectivity to the area.
As always, there were
<a href="http://www.renesys.com/blog/2008/02/mediterranean-cable-break-part-3.shtml">winners and losers</a>,
but the 
<a href="http://www.renesys.com/blog/2008/02/cable-breaks-lessons-learned-1.shtml">main lessons</a>
countries and businesses in the area learned were:
"stuff happens", don't put all your eggs in one basket, and you get what you pay for.
Of course, these cliches are not very sexy, 
so the conspiracy theorists,
apparently having never seen bad weather or a boat anchor,
were only too quick to claim that all these cable breaks were a 
<a href="http://www.renesys.com/blog/2008/02/attention-iran-is-not-disconne-1.shtml">prelude to war</a>, 
only they weren't.
No new wars were started in January, the cables were ultimately repaired, and a lot of folks in the region started planning for failure by building some redundancy.
</p>

<p><b>February</b></p>

<p>
Just after the Mediterranean cables were repaired and we were starting to wonder if
it would <em>ever</em> stop snowing,
Pakistan had another
<a href="http://www.news.com.au/couriermail/story/0,23739,22516110-10389,00.html">bad hair day</a> and their local incumbent (Pakistan Telecom) inadvertently 
<a href="http://www.renesys.com/blog/2008/02/pakistan-hijacks-youtube-1.shtml">knocked YouTube off the global Internet</a> for about 2 hours.
This incident got a lot of 
<a href="http://www.nytimes.com/2008/02/26/technology/26tube.html?_r=2&scp=1&sq=renesys&st=nyt">press attention</a>,
and there was considerable hand-wringing about this grave <em>new</em> threat to the Internet,
despite the fact that such incidents have been taking place for over a decade and are both inevitable and unavoidable in a trust-based system with no central authority.
So unfortunately, no one <em>fixed</em> the Internet in 2008.
</p>

<p><b>March</b></p>

<p>
It's <em>still snowing</em> and the locals are starting to come a bit unglued.
New Englanders are buying seeds and starting to walk around in shorts in a vain attempt to defy the weather gods.
And our cooperative trust-based Internet dramatically showed another shortcoming when 
<a href="http://www.renesys.com/blog/2008/03/you-cant-get-there-from-here-1.shtml">TeliaSonera and Cogent depeered</a>,
partitioning the Internet for those dependent on one or the other of these providers.
To name just two of the many victims,
this event affected both Martha Stewart's corporate headquarters, single-homed behind Cogent, and Blizzard Entertainment's
<a href="http://en.wikipedia.org/wiki/World_of_Warcraft">World of Warcraft</a> servers,
single-homed behind
<a href="http://www.datacenterknowledge.com/archives/2008/03/20/peering-dispute-disrupts-world-of-warcraft/">TeliaSonera</a>.
In other words, the Internet was broken in lots of little ways:  Martha wouldn't have been able to play a computer game, Europeans might not have been able to reach Cogent-hosted content, and so forth.
But then after two weeks,
this particular episode of Internet chicken ended when the two carriers
<a href="http://www.renesys.com/blog/2008/03/telia-and-cogent-kiss-and-make-1.shtml">reestablished connectivity</a>, and the Internet was once again whole.
</p>

<p><b>April</b></p>

<table align=right>
<tr>
<td align=right>
<a href="http://www.renesys.com/blog/2008/04/the-day-the-youtube-died-video.shtml">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2008-year-end/todd.jpg" hspace="10" width="109" height="80" />
</a>
</td>
</tr>
</table>

<p>
<a href="http://en.wikipedia.org/wiki/Mud_season">Mud season</a>.
The snow is melting fast now and those quaint dirt roads we are famous for are turning to rivers of mud.
It was a slow Internet month,
but thankfully the start of Todd Underwood's
<a href="http://www.renesys.com/blog/2008/04/the-day-the-youtube-died-video.shtml">musical career</a>
gave us something to write about.
Mercifully, both the season and the career were short-lived. 
</p>

<p><b>May</b></p>

<p>
The ice is breaking up on the lakes and 
<a href="http://joespondvermont.com/iceout.html">Joe's pond</a>
had their official iceout at the end of April.
And while that happens every year, 
we did report something truly bizarre on the Internet,
brought to our attention by
<a href="http://www.icann.org/">ICANN</a>.
ICANN, responsible for one of the 13 DNS root name servers, had recently changed the IP address of the server under its stewardship, a switch undoubtedly missed by most of humanity.
Then
<a href="http://www.renesys.com/blog/2008/05/identity-theft-hits-the-root-n-1.shtml">bogus root name servers</a>
started appearing on the old IP and were happily providing answers for up to six months before anyone noticed.
Although we are far from certain, 
it appears that no harm was done in this case,
but the <a href="http://www.renesys.com/blog/2008/06/securing-the-root-1.shtml">potential for mischief</a> was considerable.
Another contest of egos?
As <a href="http://en.wikipedia.org/wiki/Rodney_King">Rodney King</a> once said
<a href="http://www.mediate.com/articles/noll9.cfm">"Why can't we all just get along?"</a>
</a>
</p>

<p><b>June</b></p>

<table align=left>
<tr>
<td align=left>
<a href="http://en.wikipedia.org/wiki/Image:Black_fly.jpg">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2008-year-end/blackfly.jpg" hspace="10" width="125" height="91"/>
</a>
</td>
</tr>
</table>

<p>
All the traces of snow and ice are now gone, 
so it's time for 
<a href="http://en.wikipedia.org/wiki/Black_fly">black flies</a> to make their annual appearance.
These pests feed on human blood and like to rip out chunks of your flesh near a hairline, 
which you notice only after blood is streaming down your face.
Having the ground covered up for the half the year suddenly doesn't seem so bad.
We're still wondering what exactly happened with the ICANN root name server,
but unfortunately have
<a href="http://www.renesys.com/blog/2008/06/root-reprise-more-questions-th-1.shtml">more questions than answers</a>.
At least I start to monitor all of the root name servers from my 
<a href="http://renesys.com/products_services/routing_intelligence/">Renesys Routing Intelligence</a> account,
as I might need new blog material one day!
Besides, someone should be watching over our critical infrastructure and it might as well be me.
The year isn't even half over and we've seen a pretty incredible assortment of major headline-grabbing events, all pointing to serious flaws in how the Internet actually works.
On the business front,
<a href="http://www.renesys.com/blog/2008/06/cogent-becomes-transitfree.shtml">Cogent became transit-free</a> this month,
meaning all the big boys really do need to peer with them to prevent a partitioning of the Internet, like it or not.
</p>

<p><b>July</b></p>

<p>
Ahh, summer.  We get exactly one month of summer in New Hampshire and we aren't going to spend it writing blogs.
Besides, not much happened.
Oh, wait, there was Dan Kaminsky's revelation of a 
<a href="<a href="http://www.kb.cert.org/vuls/id/800113">DNS implementation vulnerability</a> so bad that DNS servers all over the planet were trivially subject to DNS cache poisoning.
Despite the frantic vendor response and software upgrades, 
the vulnerability is not resolved, nor resolvable given the current domain name system.
Maybe we can finally get widespread interest in 
<a href="http://www.networkworld.com/news/2008/112508-dns-root.html">DNSSEC</a>.
Otherwise, it was a quiet month and we worked on our tans.
</p>

<p><b>August</b></p>

<p>
Our  long "summer" is <em>finally</em> winding down and by the end of the month we could come close to the freezing mark again.
While no one was looking,
Tony Kapela told the Defcon 16 crowd how to 
<a href="http://safecomputing.umich.edu/events/sumit08/docs/kapela-umich-summit-final.ppt">steal the Internet</a>,
a technique for transparently intercepting anyone's incoming Internet traffic before eventually passing it on to the rightful owner,
and doing so in a way that can be largely invisible to the victim.
As if that wasn't bad enough, Russia also invaded Georgia.
We expected this to have a rather negative impact on
<a href="http://www.renesys.com/blog/2008/08/georgia-clings-to-the-net.shtml">
Georgian Internet connectivity</a>.
Only it didn't.
Sure there were website defacements and some DoS attacks, 
but that pretty much happens every day in every country.
Despite 
<a href="http://www.renesys.com/blog/2008/08/georgia-on-my-mind-1.shtml">constant calls from the media</a>,
we really couldn't find any changes to Georgia's Internet presence or any significant or prolonged losses of connectivity.
Georgia's meager Internet infrastructure simply wasn't blown up.
</p>

<p><b>September</b></p>

<table align=right>
<tr>
<td align=right>
<a href="http://www.renesys.com/blog/2008/09/ike-brings-biggest-multistate.shtml">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2008-year-end/ike.png" hspace="10" width="125" height="79" />
</a>
</td>
</tr>
</table>

<p>
The days are getting noticeably shorter and much cooler.  
Thankfully the bugs will all soon be dead, 
as this was a particularly bad year for winged pests.
But as New England cools, 
the Gulf coast is just heating up with one hurricane after another.
Gustav started out looking like a replay of 
<a href="http://www.renesys.com/blog/2007/01/the-shape-of-disaster-on-the-n.shtml">Katrina</a>, 
but in the end, 
Louisiana wasn't hit as hard and was
<a href="http://www.renesys.com/blog/2008/09/isps-learn-from-katrina-surviv.shtml">much better prepared</a> for Katrina's little brother.
The real loser in the hurricane lottery for 2008 was Texas,
which was 
<a href="http://www.renesys.com/blog/2008/09/ike-hammers-texas-internet.shtml">hit very hard by Ike</a>.
After rolling through Texas,
Ike still packed enough punch afterward to cause
<a href="http://www.renesys.com/blog/2008/09/ike-brings-biggest-multistate.shtml">extensive Internet outages throughout the US</a>.
We ended the destructive month with a good old-fashioned
<a href="http://www.youtube.com/watch?v=Erthun0Pauc">public stoning</a>:
the notorious 
<a href="http://www.renesys.com/blog/2008/09/internet-vigilantism.shtml">Intercage was knocked off the Internet</a> by their irate providers,
only to briefly rise from the dead before being taken down for good.
</p>

<p><b>October</b></p>

<p>
It's 
<a href="http://www.renesys.com/blog/2008/10/wrestling-with-the-zombie-spri.shtml">leaf peeper</a> time and while the bugs might be dead, a new scourge has arrived on the land: tour buses.
As if these flatlanders weren't entertaining enough, 
<a href="http://www.renesys.com/blog/2008/10/wrestling-with-the-zombie-spri.shtml">Sprint depeered Cogent</a> at the end of the month,
resulting in a partitioned Internet for the second time this year and impacting a few thousand customers of each carrier.
Perhaps Sprint upset the <em>wrong</em> customer,
since the peering link was quickly restored 
<a href="http://www.renesys.com/blog/2008/11/sprint-and-cogent-repeerfor-no.shtml">over the weekend</a>, 
when upper management and their lawyers were unlikely to be working.
I can't help but wonder where the phone call came from and to whom in Sprint's organization, 
but Sprint clearly didn't hesitate long to blink.
I guess <em>some</em> customers are always right.
Also this month,
<a href="http://www.outpost24.com/news/news-2008-10-02.html">Outpost24</a>
claims to have discovered a way to trivially DoS anyone by exploiting some TCP vulnerabilities,
but details remain 
<a href="https://www.cert.fi/haavoittuvuudet/2008/tcp-vulnerabilities.html">murky</a>.
</p>

<p><b>November</b></p>

<p>
No leaves, No snow, No-vember.
At least the tourists are gone and the growing darkness gives us time for reflection.
Even if you ignore all of the security concerns,
the Internet is still heading for a
<a href="http://www.renesys.com/blog/2008/11/its-the-end-of-the-internet-as.shtml">train wreck</a>.
We're running out of usable 
<a href="http://www.renesys.com/blog/2008/11/for-sale-clean-lightly-used-ip.shtml">IP space</a>
and
<a href="http://www.renesys.com/blog/2008/11/will-work-for-bandwidth.shtml">the providers are all going broke</a>.
If you follow any of the links in this blog,
it should be the preceding three for Todd Underwood's insightful comments on the industry and difficulties that lie ahead.
After reading these, you won't be surprised when the Internet doesn't work.  
You'll be thankful every day that it does.
Speaking of which,
we almost had another Internet-sized blowout when 
Companhia de Telecomunicacoes do Brasil Central
<a href="http://www.renesys.com/blog/2008/11/brazil-leak-if-a-tree-falls-in.shtml">leaked a "full table"</a>.
Had this spread,
Brazil would have ended up on the receiving end of much of the traffic on the global Internet, 
which would not have been a good thing for anyone.
As it turned out, 
we dodged the bullet this time and the damage was very contained.
And the month saw another public flogging of an Internet bad boy;
this time it was 
<a href="http://www.washingtonpost.com/wp-dyn/content/article/2008/11/12/AR2008111200658.html">McColo's turn</a>.
</p>

<p><b>December</b></p>

<table align=left>
<tr>
<td align=left>
<a href="http://www.flickr.com/photos/psnh/sets/72157611221689904/">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2008-year-end/broken-pole.jpg" hspace="10" width="83" height="125" />
</a>
</td>
</tr>
</table>

<p>
Darkness.  The earliest sunset is now at 4:12pm, but when it's cloudy, it's more like 3:12pm or even earlier as the sun doesn't get very high in the sky now.  
We spend way too much time indoors, looking at computer screens.
This problem is partially solved for us when a 
<a href="http://www.chron.com/disp/story.mpl/ap/nation/6162682.html">ferocious ice storm</a>
knocks out power to more than half of New Hampshire.
Three Renesys employees were without power over a week later.
<a href="http://www.renesys.com/about/management.shtml">Jim Cowie</a>, 
was one of them, but he refused to despair over his own homelessness,
not to mention the growing global economic gloom,
and wrote a 
<a href="http://www.renesys.com/blog/2008/12/fiber-to-the-home-ideal-econom.shtml">hopeful and interesting blog</a> about a possible economic stimulus that does <em>not</em> involve bailing out dying industries.
Since the year was drawing to a close,
those of us with power also took the opportunity to review how the top global Internet service providers
<a href="http://www.renesys.com/blog/2008/12/winners-and-losers-for-2008.shtml">fared in 2008</a>.
In short, if you figured out that more than half the planet lives in underserved Asia and built your business plan around that part of the world,
you probably did alright, at least in terms of gaining market share.
(And we were happy to see that most of the winners were also users of Renesys' 
<a href="http://www.renesys.com/products_services/market_intel">Market Intelligence</a>
product.)
Finally, it was
<a href="http://www.renesys.com/blog/2008/12/deja-vu-all-over-again-cables.shtml">Deja Vu All Over Again</a> when more Mediterranean cables broke to provide a nice symmetry to the year.
</p>

<p><b>The Year Ahead</b></p>

<table align=left>
<tr>
<td align=left>
<a href="http://www.renesys.com/blog/2008-year-end/earl-mailbox.jpg">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2008-year-end/earl-mailbox.jpg" hspace="10" width="67" height="152" />
</a>
</td>
</tr>
</table>

<table align=right>
<tr>
<td align=right>
<a href="http://www.renesys.com/blog/2008-year-end/earl-skis.jpg">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2008-year-end/earl-skis.jpg" hspace="10" width="175" height="62" />
</a>
</td>
</tr>
<tr>
<td align=right>
<a href="http://www.renesys.com/blog/2008-year-end/firefighters.jpg">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2008-year-end/firefighters.jpg" hspace="10" width="175" height="83" />
</a>
</td>
</tr>
</table>

<p>
I predict that New England will have a banner ski season this winter.
Although it is only December as I write this blog, 
I've observed three leading indicators:
I can reach my mailbox from above,
the picnic table in my backyard is a ski hazard,
and the local firefighters have been observed patrolling the neighborhood looking for lost fire hydrants.
In other words,
we've had a lot of snow for this early in the season.
</p>

<p>
As far as the Internet goes, 
it's a different story.
If you managed to get through all of the above and maintain your sense of optimism, 
I applaud you.
It's really not a very pretty picture, nor am I able to find many silver linings.
So what will 2009 bring for the Internet?
It's probably pretty safe to say that things are going to get worse before they get better.
With the world economy in turmoil,
don't expect your service to improve.
And with all the incentive for ill-gotten gain in a deteriorating economic situation, 
don't expect much relief on the security front either.
Major new vulnerabilities will continue to surface and older ones will be more fully exploited, 
as the inherent trust-based model of the Internet continues to crumble.
Nation-states will beef up their cyber-warfare capabilities.
And I wouldn't be surprised to see a major telco declare bankruptcy and at least another major de-peering event.
In other words, 
I don't expect 2009 to look much different than 2008.
To get out of this mess,
we're going to need some global leadership and a game-changing shift in both the 
<a href="http://www.renesys.com/blog/2008/12/fiber-to-the-home-ideal-econom.shtml">economic</a> and
<a href="http://www.cyberinnovationcenter.org/NewsEvents/News/tabid/72/newsid677/169/Default.aspx">security models</a> surrounding the Internet.
If things get bad enough, 
maybe we can at least start a serious discussion around that in 2009.
</p>]]>
    </content>
</entry>

<entry>
    <title>Deja Vu All Over Again: Cables Cut in the Mediterranean</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2008/12/deja-vu-all-over-again-cables.shtml" />
    <id>tag:www.renesys.com,2008:/blog//1.147</id>

    <published>2008-12-20T02:23:00Z</published>
    <updated>2009-02-13T16:51:47Z</updated>

    <summary> The end of the year is approaching which seems to be a harbinger of Internet disasters. Four years ago (on 24 Dec. 2004), TTNet significantly disrupted Internet traffic by leaking over 100,000 networks that were globally routed for about...</summary>
    <author>
        <name>Alin Popescu</name>
        <uri>http://www.renesys.com/blog</uri>
    </author>
    
        <category term="Engineering" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[<p>
The end of the year is approaching which seems to be a harbinger of
Internet disasters. Four years ago (on 24 Dec. 2004), <a href=http://renesys.com/tech/presentations/pdf/renesys-nanog34.pdf>TTNet</a> significantly
disrupted Internet traffic by leaking over 100,000 networks that were
globally routed for about an hour. Two years ago (on 26 Dec.
2006), large <a href=http://renesys.com/tech/presentations/pdf/nanog39.pdf>earthquakes</a> hit the Luzon Strait, south of Taiwan, severing
several underwater cables and wreaking havoc on communications in
the region. Last year there was a small delay. On 30
Jan. 2008, more underwater cables were severed in the <a href=http://renesys.com/tech/presentations/pdf/nanog42-lightning.pdf>Mediterranean</a>, severely
disrupting communications in the Middle East, Africa, and the Indian
subcontinent.
</p>

<p>
Calamity returned to its customary end-of-year schedule this year, when early today (19 Dec. 2008) several communications cables were <a href=http://www.bloomberg.com/apps/news?pid=newsarchive&sid=aBa0lTN.dcoQ>severed</a>, affecting traffic in the Middle East and Indian subcontinent. According to a <a href=http://www.orange.com/en_EN/press/press_releases/cp081219en.html>press release</a> by France Telecom three major cables were damaged:
<a href="http://en.wikipedia.org/wiki/South_East_Asia-Middle_East-Western_Europe_4">Sea-Me-We 4</a> at 7:28 UTC, <a href="http://en.wikipedia.org/wiki/SEA-ME-WE_3_(cable_system)">Sea-Me-We 3</a> at 7:33 UTC, and <a href="http://en.wikipedia.org/wiki/FLAG_Telecom#Segment_FLAG_Europe_Asia_.28FEA.29">FLAG FEA</a> at 8:06 UTC. It appears that the SMW3 cable was only partially cut, the SMW4 cable was completely cut, while the FLAG cable was "observed down" with no other information given. The location of the cut appears to be between Sicily and Tunisia in a section which is the responsibility of Egypt Telecom. The causes of the cut remained unclear. It seems that ships were deployed to repair the damaged cables, but no ETA was given.
 </p>]]>
        <![CDATA[<p>
In this preliminary analysis we considered prefixes (networks) that suffered outages between 07:00 UTC and 17:00 UTC on 19 Dec. 2008. At least 2840 prefixes suffered outages during this interval. The most seriously affected countries are shown in the map below. Darker shades of red indicate that a larger percentage of prefixes in the country suffered outages.
</p>

<p>
<a href="http://www.renesys.com/blog/2008/12/med_cable_cut2/cable_cut2_map.png" 
onclick="window.open('http://www.renesys.com/blog/2008/12/med_cable_cut2/cable_cut2_map.png','popup',scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0); return false">
<img style="border:1px solid black;" src="http://www.renesys.com/blog/2008/12/med_cable_cut2/cable_cut2_map.png" width="690">
</a>
</p>

<p>
The impact for each affected country is illustrated in the bar graphs below. Shown in the first graph is the distribution per country of the 2840 prefixes that suffered outages. In the second graph, we show the percentage of prefixes in each country that suffered outages.
</p>

<p>
<body bgcolor="#ffffff">
 <object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"
         codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=8,0,0,0"
         width="690" height="400"
         id="Line" >
    <param name="movie" value="/fusioncharts/charts/MSColumn2D.swf" />
    <param name="FlashVars" value="&dataURL=/blog/2008/12/med_cable_cut2/outages_per_country_absolute.xml" />
    <param name="quality" value="high" />
    <embed src="/fusioncharts/charts/MSColumn2D.swf"
           flashVars="&dataURL=/blog/2008/12/med_cable_cut2/outages_per_country_absolute.xml"
           quality="high"
           width="690" height="400"
           name="graphs"
           type="application/x-shockwave-flash"
           pluginspage="http://www.macromedia.com/go/getflashplayer" />
 </object>
</body>
</p>

<p>
<body bgcolor="#ffffff">
 <object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"
         codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=8,0,0,0"
         width="690" height="400"
         id="Line" >
    <param name="movie" value="/fusioncharts/charts/MSColumn2D.swf" />
    <param name="FlashVars" value="&dataURL=/blog/2008/12/med_cable_cut2/outages_per_country_percentage.xml" />
    <param name="quality" value="high" />
    <embed src="/fusioncharts/charts/MSColumn2D.swf"
           flashVars="&dataURL=/blog/2008/12/med_cable_cut2/outages_per_country_percentage.xml"
           quality="high"
           width="690" height="400"
           name="graphs"
           type="application/x-shockwave-flash"
           pluginspage="http://www.macromedia.com/go/getflashplayer" />
 </object>
</body>
</p>

<p> 
This event appears very similar to the late January 2008 cable breaks, with Egypt taking most of the impact: over 1400 of Egypt's globally routed prefixes suffered outages, about 81% of their prefix count. The second most impacted country was Saudi Arabia with 374 prefixes, 42% of its total. While India and Pakistan had large numbers of prefixes experiencing outage, 456 and 169 respectively, these outages represented a relatively small percentage of their globally routed prefixes: 4% and 10%. The other countries fell somewhere in between with approximately 25% of their total prefixes suffering as a result of these cable cuts.
</p>

<p>
The plot below details the total number of network outages in the above countries between 07:00 UTC and 09:00 UTC. The times when the three cables were damaged (SMW3 7:28 UTC, SMW4 7:33 UTC, and FLAG 8:06 UTC) can be clearly seen.
</p>

<p>
<body bgcolor="#ffffff">
 <object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"
         codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=8,0,0,0"
         width="690" height="400"
         id="Line" >
    <param name="movie" value="/fusioncharts/charts/MSLine.swf" />
    <param name="FlashVars" value="&dataURL=/blog/2008/12/med_cable_cut2/outages_per_country_absolute.xml" />
    <param name="quality" value="high" />
    <embed src="/fusioncharts/charts/MSLine.swf"
           flashVars="&dataURL=/blog/2008/12/med_cable_cut2/aggregate.xml"
           quality="high"
           width="690" height="400"
           name="graphs"
           type="application/x-shockwave-flash"
           pluginspage="http://www.macromedia.com/go/getflashplayer" />
 </object>
</body>
</p>

<p>
It is not clear if <a href="http://www.renesys.com/blog/2008/02/cable-breaks-lessons-learned-1.shtml">any lessons</a> were learned from the previous cable cut event. There are at least a couple of initiatives to build new cables in the Mediterranean: one led by a <a href="http://en.wikipedia.org/wiki/I-ME-WE">consortium</a> of 9 companies, and another led by Telecom Egypt who <a href="http://www.reuters.com/article/rbssTechMediaTelecomNews/idUSL317680120080131">signed a contract with Alcatel-Lucent</a> to build the cable. The new cables however seem designed only for increased capacity rather than for redundancy, as they appear to follow the same geographic route as the current cables: from Alexandria, Egypt, going by Malta, south of Sicily and eventually landing in Marseille, France. If that is the case, the new cables will likely be susceptible to the same problems as the current ones. It is interesting to notice that both in January and now all the cables in the region were damaged at the same time.
It is possible that building new cables to follow a different geographic route is either impractical or too expensive. The geography is what it is and there is not much one can do about that. The providers and consumers in the region will, however, have to accept the risk that occasional outages like these can and will happen.
</p>

<p>
For now we can only hope that the repairs will be quicker than in the past, although with the holiday season in Western Europe coming up it may not be very realistic to expect it. We will come back with updates as the situation develops.
</p>]]>
    </content>
</entry>

<entry>
    <title>Rising to the Top:  A Baker&apos;s Dozen</title>
    <link rel="alternate" type="text/html" href="http://www.renesys.com/blog/2008/12/winners-and-losers-for-2008.shtml" />
    <id>tag:www.renesys.com,2008:/blog//1.146</id>

    <published>2008-12-17T20:00:00Z</published>
    <updated>2009-01-10T12:45:39Z</updated>

    <summary><![CDATA[ As readers of this blog will know, Renesys collects Internet routing data &mdash; a lot of it. We use this data in a variety of ways: in determining the impact of cable breaks, natural disasters and deliberate partitionings; in...]]></summary>
    <author>
        <name>Earl Zmijewski</name>
        <uri>http://www.renesys.com/blog/</uri>
    </author>
    
        <category term="Business" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://www.renesys.com/blog/">
        <![CDATA[<p>
As readers of this blog will know, 
Renesys collects Internet routing data &mdash; a lot of it.
We use this data in a variety of ways:
in determining the impact of 
<a href="http://www.renesys.com/blog/2008/01/mediterranean-cable-break.shtml">cable breaks</a>,
<a href="http://www.renesys.com/blog/2008/09/ike-brings-biggest-multistate.shtml">natural disasters</a> and
<a href="http://www.renesys.com/blog/2008/10/wrestling-with-the-zombie-spri.shtml">deliberate partitionings</a>;
in uncovering the source of
<a href="http://www.renesys.com/blog/2008/02/pakistan-hijacks-youtube-1.shtml">hijacks</a> or other
<a href="http://www.renesys.com/blog/2008/05/identity-theft-hits-the-root-n-1.shtml">questionable activity</a>;
in analyzing
<a href="http://www.renesys.com/products_services/market_intel">Internet business relationships</a>;
and in exploring
<a href="http://www.renesys.com/blog/2008/08/internet-matchmaking.shtml">"what-if"</a> scenarios.
</p>

<p>
All of our reports and products are based on hard facts and objective analysis.
Perhaps the only controversial thing we do with our data is to <em>rank</em> all the service providers in the world: globally, by geography, and by market segment.  
The rankings are a rather crude measure of <em>size</em>, 
as they are based entirely on the quantity of IP space ultimately transited by each provider.
Although there are obvious shortcomings in this approach, 
it is certainly objective and the process is fully automated.
It also happens to be derived from data that is readily available for all providers.
Routing data, unlike most other metrics we could consider using, is inherently public.
</p>

<p>
While everyone wants to be #1 (hence the controversy around rankings),
<em>changes</em> in rank can be far more revealing than the actual rank itself.
In other words,
while there are surely big differences between #1 and #50 in our rankings,
the differences between #5 and #6 are much less clear given the nature of the metric.
What we tend to look for are abrupt changes and long-term trends.
Did a provider just jump in the rankings? 
Maybe they picked up a large customer or a nearby rival lost one?  Who was it?
Is another provider showing steady gains in the rankings?
Maybe they are consistently taking market share with an aggressive, well-executed business plan in a particular part of the world?
This is why changes in rankings matter:
they capture some of the dynamics of the business of providing Internet service.
With this in mind, 
we will take a look at the top 13 providers in the world for 2008 and
how they have jockeyed for position throughout the year.
We will also highlight some of the more interesting changes.
</p>
]]>
        <![CDATA[<p>
To rank providers,
Renesys starts out by determining the business relationships between adjacent Autonomous Systems (ASes), 
as seen via BGP announcements from our over 300 peers.
This is a complex, computation-intensive task,
consisting of processing over 10 million AS paths and 100,000 unique AS pairs that make up these paths.
By combining the computed business relationships with IP prefix geo-location information,
we can determine exactly which prefixes are ultimately transited by a given provider in a given location.
Rankings are then based on the scores we assign to the relevant prefixes,
scores which depend on prefix size and date of assignment.
(More details can be found 
<a href="http://www.renesys.com/tech/presentations/pdf/menog2.pdf">here</a>.)
In this blog, we'll concentrate on the top 13 providers according to our global rankings and how their score changed over 2008.
Since the exact numeric score is not that meaningful in this context, 
we will not provide the scale in our graphs.
Rather, we will show how the scores changed over the year and consider some of the reasons behind these changes.
</p>

<p>
<body bgcolor="#ffffff">
   <object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"
           codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=8,0,0,0"
           width="690" height="400"
           id="Line" >
      <param name="movie" value="/fusioncharts/charts/ScrollLine2D.swf" />
      <param name="FlashVars" value="&dataURL=/blog/2008-winners-losers/top-13.xml" />
      <param name="quality" value="high" />
      <embed src="/fusioncharts/charts/ScrollLine2D.swf"
             flashVars="&dataURL=/blog/2008-winners-losers/top-13.xml"
             quality="high"
             width="690" height="400"
             name="graphs"
             type="application/x-shockwave-flash"
             pluginspage="http://www.macromedia.com/go/getflashplayer" />
   </object>
</body>
</p>

<p>
The above graph plots global scores over time for the top 13 providers.  
While this might not look very informative,
if you squint carefully enough, 
you can see three distinct clusters of providers,
namely, the top two, the middle six, and the final five,
each operating within roughly the same range of scores over time.
We will consider each cluster in turn,
enlarging the scale so we can more clearly see the changes,
and then making note of some of the more interesting ones.
</p>

<p>
<body bgcolor="#ffffff">
   <object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"
           codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=8,0,0,0"
           width="690" height="400"
           id="Line" >
      <param name="movie" value="/fusioncharts/charts/ScrollLine2D.swf" />
      <param name="FlashVars" value="&dataURL=/blog/2008-winners-losers/top-tier.xml" />
      <param name="quality" value="high" />
      <embed src="/fusioncharts/charts/ScrollLine2D.swf"
             flashVars="&dataURL=/blog/2008-winners-losers/top-tier.xml"
             quality="high"
             width="690" height="400"
             name="graphs"
             type="application/x-shockwave-flash"
             pluginspage="http://www.macromedia.com/go/getflashplayer" />
   </object>
</body>
</p>

<p>
In 2006, when Renesys first started ranking providers,
Sprint (AS 1239) seemed to have the #1 spot locked up with a score of almost 25% more than second-place Level 3 (AS 3356).
Since that time, 
Level 3 has been gradually and consistently closing the gap and took over the #1 spot for the first time this year.
Despite a few long, relatively flat periods,
Level 3 had two huge surges, one in July and another in November.
In July, 
Level 3 picked up Korea Telecom (AS 4766) and ReTN.net (AS 9002) as customers.
(Previously we had them classified as peers.)
And Asia Netcom (AS 10026) greatly increased the number of prefixes they were transiting via Level 3.
November's rise was almost entirely due to Japan's Softbank Telecom (AS 4725) increasing the number of transited prefixes from 2% to just over 11%.
Although they seem to have tailed off recently,
Sprint's growth was more consistent over the year, 
but not enough to counter the big wins by Level 3.
The average score in this cluster increased by 14% over the year.
</p>

<p>
<body bgcolor="#ffffff">
   <object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"
           codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=8,0,0,0"
           width="690" height="400"
           id="Line" >
      <param name="movie" value="/fusioncharts/charts/ScrollLine2D.swf" />
      <param name="FlashVars" value="&dataURL=/blog/2008-winners-losers/mid-tier.xml" />
      <param name="quality" value="high" />
      <embed src="/fusioncharts/charts/ScrollLine2D.swf"
             flashVars="&dataURL=/blog/2008-winners-losers/mid-tier.xml"
             quality="high"
             width="690" height="400"
             name="graphs"
             type="application/x-shockwave-flash"
             pluginspage="http://www.macromedia.com/go/getflashplayer" />
   </object>
</body>
</p>

<p>
Looking at the second cluster of providers,
a few things are readily apparent.
First, Verizon (AS 701) and NTT (AS 2914) both suffered sharp declines over short periods.
Second, Global Crossing (AS 3549) and TeliaSonera (AS 1299) grew consistently during the year and were the only ones in their peer group to show appreciable gains.
Verizon lost big after 
<a href="http://www.renesys.com/blog/2008/03/he-said-she-said-cogent-vs-tel.shtml">TeliaSonera stopped using them for transit</a>.
(Peers, whether paid or not, do <em>not</em> get credit for each others customers.)
Likewise,
Cogent 
<a href="http://www.renesys.com/blog/2008/06/cogent-becomes-transitfree.shtml">became transit-free</a> mid-year after they stopped using NTT to reach AOL ATDN (AS 1668),
which accounted for much of NTT's decline.
Global Crossing's gains consisted of numerous big wins, many in Asia,
picking up increased transit from Hurricane Electric (AS 6939), PCCW (AS 3491), ReTN.net (AS 9002), Asia Netcom (10026) and Golden Telecom (AS 3216) to name a few.
As a result,
Global Crossing catapulted from #5 to #3 in the world, 
quite an accomplishment, 
especially considering they were #25 back in 2006.
TeliaSonera also picked up increased transit from Hurricane Electric and Asia Netcom,
briefly passing AT&T to capture the #7 spot for a time.
But as of a few days ago, AT&T sneaked ahead again by picking up more transit from Uninet (AS 8151), a large retail provider in Mexico.
Still, given the flat or declining fortunes of most of this group, 
I would not be surprised to see TeliaSonera ranked #4 or #5 by the end of 2009.
While quite small, the average score increase for this cluster is not very meaningful, 
given the very different trends observed.
</p>

<p>
<body bgcolor="#ffffff">
   <object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000"
           codebase="http://fpdownload.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=8,0,0,0"
           width="690" height="400"
           id="Line" >
      <param name="movie" value="/fusioncharts/charts/ScrollLine2D.swf" />
      <param name="FlashVars" value="&dataURL=/blog/2008-winners-losers/bottom-tier.xml" />
      <param name="quality" value="high" />
      <embed src="/fusioncharts/charts/ScrollLine2D.swf"
             flashVars="&dataURL=/blog/2008-winners-losers/bottom-tier.xml"
             quality="high"
             width="690" height="400"
             name="graphs"
             type="application/x-shockwave-flash"
             pluginspage="http://www.macromedia.com/go/getflashplayer" />
   </object>
</body>
</p>

<p>
The members of our final cluster largely tracked one other with similar growth rates.
One exception is Tiscali (AS3257),
which moved from the bottom of its peer group to near the top, within striking range of Tata/VSNL/Teleglobe (AS 6543).
In fact, they surpassed them briefly during the year on a couple of occasions. 
Tiscali picked up increased transit from Asia Netcom
and gained Interoute (AS 8928) and KPN (AS 286) as customers.
China Telecom (AS 4134) moved to #11 all the way from #21 back in 2006.
While China Telecom is by no means a global provider,
they are by far the dominant player in the huge Chinese market,
and also provide transit to South Korea and Vietnam.
These large and emerging markets allowed China Telecom to move up two places in our global rankings this year and to within easy striking range of the top 10.
The average score in this cluster increased by over 40%.
</p>

<p>
<b>Conclusions</b>
</p>

<p>
It's pretty clear from this narrative that the providers who are moving up in the rankings are doing so thanks in large measure to Asia.
This should come as no surprise.
With more than half the world's population and only 
<a href="http://www.internetworldstats.com/stats.htm">15% online</a>,
there is enormous potential here.
But market share does not imply profitability or even a well-run network.
So we would never suggest you pick your providers based on Renesys Market share rankings alone.
While declining market share might indicate a problem,
a growing company is worth your business only if they meet your service needs and are going to be around tomorrow.
Rankings, by their very nature,
cannot take all possible considerations into account,
nor can they be tailored for each consumer's needs,
and thus they should only be one factor used in making an informed business decision.
</p>]]>
    </content>
</entry>

</feed>
