Known issue: Gateways appear as offline

htdvisser · July 6, 2017, 7:50am

What happened

The NOC

The NOC experienced unexpected timeouts while writing to its database. Instead of backing off, the NOC started even more processes trying to write to the database (an unintended positive feedback loop). This resulted in an explosion of goroutines and memory usage and eventually the NOC just kept crashing/restarting/…

The Proxy

We have an SSL-terminating proxy in front of the NOC. When the NOC crashed and restarted within a second, the proxy did not close existing connections

The Routing Services

Because the connections were not closed by the proxy, the routing services (router, broker, …) that forward metadata to the NOC did not back off to allow the NOC to recover, this also didn’t really help. Instead, they started buffering these messages, dumping a flood of messages onto the NOC when it came back after a crash. For some reason these components also spawned more goroutines, leading to extremely high memory usage and slowdown of message processing.

Mitigation

We temporarily disabled forwaring metadata to the NOC, but as a result the gateways now appear as offline on the console and maps.

Resolution

We are still working on reproducing the issue in a controlled environment, and will post an update when we know more. We aim to re-enable NOC forwarding within a couple of hours, after which the gateway pages should display the correct gateway status again.

htdvisser · July 6, 2017, 2:31pm

We decided to push re-enabling the noc to tomorrow.

arjanvanb · July 7, 2017, 12:31pm

Until then, ttnctl gateways status [gatewayID] still shows the actual status of your gateways.

niau · July 7, 2017, 2:21pm

Now miraculously my gateway appeared online

gsethi2409 · July 7, 2017, 2:27pm

Two of our gateways appear online now! Finallyyyy!

alexbn71 · July 7, 2017, 2:31pm

Mine too, yippee!

Alex

mark189 · July 7, 2017, 6:34pm

Hello, is the problem solved? our gateway status is still “not connected”
Lora traffic is being send but the status in TTN is incorrect.

marcelstoer · July 7, 2017, 8:09pm

Watch https://status.thethings.network/ for updates.

mark189 · July 8, 2017, 7:07am

Thanx Marcelstoer

alexbn71 · July 9, 2017, 8:15pm

Are there any predictions about returning online on the map?

Thanks
Alex

Smartohm · July 10, 2017, 1:39am

The Gateway now appears in the console.

It still doesn’t appear when issuing CLI “ttnctl gateways status …” or shows as inactive with “ttnctl gateways info …”, which I normally would consider more accurate.

Is there any way to fix this?

htdvisser · July 10, 2017, 1:51pm

Did you supply --router-id to ttnctl?

alexbn71 · July 10, 2017, 2:31pm

Actions done in the dashboard on gateway and devices are reported by ttnctl in real time? or there is a delay?

Thanks
Alex

patmolloy · July 10, 2017, 3:31pm

ttnctl gateways info eui-ID
and
ttnctl gateways status eui-ID

both working for me when I enter the eui -ID of my gateway.

Smartohm · July 11, 2017, 12:16am

The problem was with --router-id, as we are using ttn-router-asia-se,

Thanks htdvisser!

This command seemed to work previously without this flag. Perhaps it was assigned a different router prior to the recent changes.

gsethi2409 · July 11, 2017, 5:26am

My gateway isn’t displayed on the map of TTN-mapper.
“Location”, “Status” and “Owner” are all public in the Gateway > Settings > Privacy tab.
There is no marker at all on the location of my gateway (EUI: eui-b827ebfffe52009e)

PS: I’m in India.

Please help!
Thank you.

eric_lee · September 5, 2018, 6:45am

Hi htdvisser,

There is an offline issue to occur when Router set to ttn-router-asia-se.
Gateways appear as online if Router set to ttn-router-us-west.
I am in Taiwan.

Can you provide any suggestions?

Thanks,

techboycr · September 5, 2018, 8:13am

Neither my gateways or devices are shown in console, is there anything that can be made? My devices can’t join the network either.

Thanks!

techboycr · September 5, 2018, 2:55pm

I just resolve it for my gateways, I changed the router to EU and they show now in the console.

Dont know if is the best practice but resolve the issue, I am in Costa Rica - Central America.

helioz · September 18, 2018, 9:50pm

Has this been resolved? It appears this thread describes the same symptoms.