You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, when the issue happens, we restart the app server, use the Github sync and manually fix deployments. Works most of the time, as it doesn't happen frequently.
What we know so far
There's a 5-second gap between receiving the last event from NATS and the consumer being deleted
There might also still be some todos left within "What we should do". I'd propose we add some kind of log for a certain threshold regarding the duration of the handler. I can also deal with it tomorrow.
There might also still be some todos left within "What we should do". I'd propose we add some kind of log for a certain threshold regarding the duration of the handler. I can also deal with it tomorrow.
I think we gave the threshold values generously (NATS) and that long running webhook listeners problem is kinda solved.
I am aware that It shouldn't be taking that long but I also suggest that we should focus on the sprint tasks and keep this issue in mind. Then when we finish our tasks lets add those hard-limits or finding the what is taking long. What do you say?
In some instances, we noticed that our app server stops handling Webhook events seemingly randomly.
Server logs
This is an excerpt from the logs that contains the last event we handled and the first errors we got after that.
Workaround
Currently, when the issue happens, we restart the app server, use the Github sync and manually fix deployments. Works most of the time, as it doesn't happen frequently.
What we know so far
What we should do
InactiveThreshold
to more like 20+ seconds manually as a hot-fix for nowThe text was updated successfully, but these errors were encountered: