Help - Search - Members - Calendar
Full Version: Lost connections to server (Serious conn. problems)
The Planet Forums > System Administration > Network
thrillhaus
Game server had about 15 people in it, and all but a few dropped.

Uplink Router: CAR2-4.DLLSTX2

Time: time of this post


I personally didn't lose connection to the server...
JonThompson
Looks like broadwing is having issues again, seems to be cleared up for now, you can see spike in mrtg.
concept
Yea im getting destroyed by it to, except im coming accross on gblx
JonThompson
QUOTE (JonThompson)
Looks like broadwing is having issues again, seems to be cleared up for now, you can see spike in mrtg.


Spoke too soon - still having issues over broadwing.
thrillhaus
ibr2.dllstx2 - DOWN
car1-2.dllstx2 - DOWN

Looks like every connection except AT&T...
JonThompson
QUOTE (thrillhaus)
ibr2.dllstx2 - DOWN
car1-2.dllstx2 - DOWN

Looks like every connection except AT&T...


Now that mrtg has updated, it looks like a good sized ddos..
concept
As long as I come in over sprintlink all is good, but looks like the routes keep changing, and knocking over to horrible routes.
timdorr
Yeah, I'm showing problems here too. They're intermittent, and I'm trying to do cPanel updates, too :S
thrillhaus
Looks like Spring#1 went down and transfered it all to Spring#2, maxing it out... that would be connections going out, that is.
illyriah
I have customers reporting issues as well.
JonThompson
QUOTE (thrillhaus)
Looks like Spring#1 went down and transfered it all to Spring#2, maxing it out... that would be connections going out, that is.


Check out the inbound on globalcrossing - nice amount of traffic coming in there, def not normal traffic failing over.
thrillhaus
Giant spikes everywhere... I don't think it's DoS because it'd show more incoming connections..
timdorr
QUOTE (thrillhaus)
Giant spikes everywhere... I don't think it's DoS because it'd show more incoming connections..


Look at the network overview. ibr2.dllstx2 crashed. That's the problem. Layer3 and Sprint are out of the mix for the meantime.
thrillhaus
Ah. Hope it gets resolved soon icon_confused.gif
rabbit994
Hmmm..... Everything is normal according to my server monitors but I'm not sure. I thought GigE connections were redundent. Hopefully, whatever it is, is setting alarms off all over the NOC and they are too busy to check the forums.
concept
I called, he said 15 minute ETA on the fix
thrillhaus
My thoughts exactly. I opened a ticket to see if I could get a definate answer, but not sure when I will...

EDIT - Great! But 15 minutes can mean a lot to some people icon_sad.gif
JonThompson
QUOTE (thrillhaus)
Giant spikes everywhere... I don't think it's DoS because it'd show more incoming connections..


Ah it looks like they are having problems with a part of their network, if you look at the network map:
http://www.servermatrix.com/about_us.html#network
Everything coming into the second M20 (sprint x2 and level3 x2 and probably broadwing as its not on diagram) is down, everything else is failing over for them being down.
thrillhaus
Couldn't get onto SM's forums there for a little bit. Orbit was working at the time, but now Orbit is down and this is back up...
SiRi
Problems here also.

I "upgraded" from my SM Dual Xeon to the TC2800 about 2 months ago. My SM was in dllstx4, my new TC2800 is in dllstx2. I'm starting to miss my old SM...
thrillhaus
Funny.. seems that dllstx4 has more network issues than dllstx2 icon_wink.gif

Also, I called them and they also told me about 15 minutes... we shall see.
thrillhaus
Seems that distribution switch is back up.
SiRi
Wait, I was wrong. I just assumed and we all know where that leads you.

My TC 2800 is in 4, but it seems to be having problems as well. 2 is/was the rox!

SiRi
timdorr
QUOTE (SiRi)
Problems here also.

I "upgraded" from my SM Dual Xeon to the TC2800 about 2 months ago.  My SM was in dllstx4, my new TC2800 is in dllstx2.  I'm starting to miss my old SM...


This wasn't specific to one datacenter or the other, as the network for both runs through the main routers. One of the routers went down and the "downtime" was while routes had to reconfigure.
timdorr
QUOTE (rabbit994)
I thought GigE connections were redundent.


They are. If they weren't, you would show a downtime of a LOT longer than we experienced. However, router reconfiguration isn't instant and it takes a bit for things to work themselves out when a router goes down.
This is a "lo-fi" version of our main content. To view the full version with more information, formatting and images, please click here.
Invision Power Board © 2001-2010 Invision Power Services, Inc.