New on LowEndTalk? Please Register and read our Community Rules.
All new Registrations are manually reviewed and approved, so a short delay after registration may occur before your account becomes active.
All new Registrations are manually reviewed and approved, so a short delay after registration may occur before your account becomes active.
Comments
They are taking a nap.
Not sure if relevant to the router software update they sent out
I had missed their notification in email.
They are upgrading their router software.
My NY server has been down for 3 hours and 38 minutes now.
I thought they will be offline for 15-20 minutes only as per their email
I can confirm my NY server is also down atm.
Update: ok nvm received the same email as above.
Reliablesite is not reliable, what a surprise…
They could be experiencing issues they hadn't anticipated. Even so, it's still within the planned outage timeframe.
We have a number of servers in their NYC DC and the original outage was around 50 mins for us then everything came back up until the 2nd outage around 20 mins ago, everything is currently back up.
From the wording of the maintenance alert, I would imagine there could be multiple small outages during the maintenance window.
A copy of the scheduled maintenance is available in the dedicated panel under "Maintenance & Status". We had about 7 switches that didn't come back online for whatever reason, 3 are fixed, 4 are being worked on.
All switches are up, finalizing a few things, and we should be wrapped up.
they announced a maintenance like 3 hours prior to the maintenance lol.
and the maintenance was supposed to mean a 15 minutes outage, and it ended to be 75 minutes.
funny thing is that all our NY servers with them had the same outage, so i guess are all on the same switch. how lucky we are.
Mine has been down for 4 hours and 50 minutes now and is still not back up.
what does support tells you?
No response yet.
In the city that never sleeps.
I got a response, but just the generic:
Does your NY servers still on the same network speed that it's back online now?
Mine is back after 6 hours of downtime, but the network download speed is very slow.
I tested it via Iperf3 on the server.
sadly, haven't made tests to have a point to compare.
Here's how the yesterday's timeline went:
~9AM Eastern - We started observing abnormal CPU behavior in NY1. It was found to be related to abnormal ARP processing. The source of the issue was found and fixed by 11:30 AM. The router was reporting normal operation.
~6:40PM Eastern - We started observing abnormal errors in logs and routing behavior.
~8:00PM Eastern - Juniper was engaged and determined the issue was caused by a known software bug in JunOS and recommended an upgrade.
~10:30PM Eastern - Planning had been finished and a maintenance period was put together. At this specific point in time, the router was already refusing ARP and adding next-hop to its table. We were limited to the kind of configurations it would accept, the 1-click installer and much of our automation was no longer operational. Considering these issues, the router was still pushing traffic normally, but we didn't know for how long. The time period that we set had to do with traffic levels on our network in NY1, we chose the least busy time possible pushing the maintenance as far back as possible.
"We expect brief unavailability in 15 to 20 minute periods"
No, it was stated as "15 to 20 minute periods", which is how long the routing engines usually take to reboot. The first one took longer because the bug we were trying to avoid took the backup RE offline ~15 minutes after we gracefully transferred traffic to it and we couldn't bring it back until we finished the upgrade on the primary RE.
=========
Trust me when I say that this wasn't what we wanted to to be doing, but it would have led to a much longer and much more painful unplanned outage.
@MrRadic Does other ReliableSite locations have routers with the same bug?
I believe they were updated years ago for specific feature requirements, but would need to confirm with my network engineers.
I must confess I did also read it as 15-20 minutes of downtime on my first read through of the email.
I really like reliablesite, in general I have very few problems
MrRadic
a comparison, that MrRadic is always here providing information, this is very important.
I'm waiting for more Miami servers to become available to purchase