Single point of failure blamed for Verizon FiOS, DSL outage

By Scott M. Fulton, III | Published October 5, 2009, 11:15 AM

A single stalled router is being blamed by Verizon officials for a service outage that impacted customers of its high-speed Internet service, including fiberoptic FiOS, in New York and Massachusetts.

The outage occurred at approximately 3:15 pm EDT, according to a message Friday afternoon from the company's chief PR executive, Eric Rabe. He acknowledged that routers typically fail over to adjacent ones, but in this instance, this one didn't.

"The router went into a hung state and did not appear to the rest of the network as though it was having problems," Rabe wrote, being careful not to name the manufacturer.

According to reporting from Telephony Online's Ed Gubbins, Verizon's principal hardware provider for FiOS is Juniper Networks. In fact, Gubbins reports, Verizon contributes 13% of Juniper's total revenue, and may be the sole reason why that company found black ink again last year.

Juniper's E-series routers service Verizon's broadband network. Last October, Juniper announced a major upgrade to its routers' operating system, adding features that included the capability for service providers to deploy deep packet inspection -- the ability to analyze Internet traffic based on its contents. The company marketed this feature as part of its "Intelligent Services Edge" portfolio, which it described as "leverag[-ing] a single, consistent operating system and high-performance hardware to flexibly deliver many service types -- including broadband routing, voice, multimedia and integrated security, as well as application-level awareness."

The rollout schedule for these changes targeted the third quarter of 2009. The latest version of Juniper's router software for E-series routers, called JUNOSe, began rollout on August 13. While the evidence that Juniper's router software may have been involved is circumstantial, these facts do tell a curious tale.

As some customers confirmed to Verizon's support forum, Rabe's statement that the outage lasted about 40 minutes on Friday afternoon appears accurate. However, other customers, including in Massachusetts, reported poor or no service even after the problem was resolved by 4:00 pm. What's more, support representatives who diligently worked with customers in an attempt to resolve issues as if their own on-premise equipment were to blame, were apparently not informed of the service outage themselves until after the problem was resolved. One heavy-use customer complained to Verizon, "I have 30 years in networking designing service provider networks and I don't have a single design that has a single point of failure. It appears Verizon does."

That provoked another customer to come to Verizon's defense: "Even with redundancy, there is no way to guarantee 100% availability. That is an impossibility, I don't care who you are. And that percentage is including regularly scheduled failover testing."

Comments

View comments by with a score of at least

"Last October, Juniper announced a major upgrade to its routers' operating system, adding features that included the capability for service providers to deploy deep packet inspection -- the ability to analyze Internet traffic based on its contents."

This ability--the ability to snoop--may have got them into trouble. Perhaps, it went deeper and deeper trying to determine the content. Someone may have sent some (fractal?) data designed to test the network in order to disguise other data in the future.

Score: 1

|

"I have 30 years in networking designing service provider networks and I don't have a single design that has a single point of failure."

Wait so they design their networks to traverse the globe a myriad of ways, using completely different routers with different OS's? Hire this mofo!

Score: 1

|

Lol, no kidding! :P

Score: 0

|

At some point in the process there is (in most real world installations) the point where it all funnels off the property. Unless you have multiple providers, with multiple technologies, with a backup redundant redundant backup array. Some how, some way it can still go down. IE: it all leaves the bulding on one fiber bundle. (which one backhoe operator can surely damage in a heartbeat). I am often suprised that with the vast amount of technology, installed by varing degrees of knowedgable individuals from outright idiots to supreme masters of the craft that things are as relaible as they are when you really think about it.

I think "Mr 30 years" must be an exectutive that lives in a fishbowl, and gets lied to by his subordinates who pray the system will work another day.

Score: 0

|

there is always a SPF somewhere in every network, no matter how small or unrellivent it may be thought of, there is at least one.

Score: 0

|

Google Buzz: Another attempt to harness the content firehose

Similar to how Google successfully remolded RSS into a Google tool, the company now wants to remold Gmail into one big Google party

Success: Google's Nexus One shipping support line takes tech support questions

UPDATED Though the support line had been set up for shipping, it now appears Google personnel are happy to hear technical concerns.

Goodnight, moon: What I learned from a space shuttle

Carmi Levy | Wide Angle Zoom: Can the tech sector learn a few lessons from the space program? Certainly, if you believe in learning from someone else's mistakes.

Netflix to FCC: NBCU + Comcast could bypass net neutrality

Weaning itself from the post office as its main means of video transfer, Netflix would like someone to ensure the Internet remains just as unencumbered.

Rhapsody to become an independent company

RealNetworks and Viacom subsidiary MTV Networks have begun the process of spinning off music service Rhapsody into an independent company.

Nvidia debuts new dynamically-switched graphics card technology

Today, Nvidia announced that its Optimus technology for GPU switching will soon be available in a handful of Asus notebooks.

Google lowers 'unusually high' early termination fee on Nexus One

Google has lowered the Nexus One's early termination fees which were twice as high as the norm.

Netgear and Ericsson introduce a mobile broadband hotspot with a twist

It's a mobile broadband hotspot, but it's for use in the home.

Report: Streaming video drove 72% global increase in mobile data consumption

A new study says streaming video is "the single most influential factor driving the need for increased mobile network capacity."

Stymied by continuing Nexus One 3G issues, Google blames the environment

If you're still afflicted with the 3G flip-flop trouble, then you might consider moving. That appears to be the only suggestion Google can give for now.

Wolfram|Alpha makes a strong argument for virtual keyboards

"Answer engine" Wolfram|Alpha has updated its iPhone/iPod Touch app, harnessing the strength of the virtual keyboard.