Zammad stops checking mailbox after some time during a day and the corresponding notifications don’t come through to our preferred channel (Slack)
Steps to reproduce the behavior:
Upgrade to 4.0
Ever since we upgraded to the latest version (went without any issues), the scheduler has been failing permanently during a day.
scheduler_err.log only has “log writing failed. execution expired” – that’s all there is and been since the update. Googling for this error isn’t fruitful.
There are no errors in production.log, postgres logs are alright, and nothing suspicious in elasticsearch logs. Elastic works fine, Zammad front-end lets me in, although it might occasionally throw 5xx error.
Things run smoothly after restarting zammad service, but only for several hours.
Has anyone encountered this behavior? Any suggestions on how to at least fix the logging for scheduler so I can investigate further?
We encountered the exact same problems and behaviors after upgrading to 4.1
In the beginning everything works fine.
The productions.log shows “scheduler running…” periodically.
Out of the sudden the entry wont show up, Emails wont get fetched and processed.
Since we see no error message we cant reproduce the problem except by waiting for a couple of minutes or hours…
We set up the same task but every 2 hours - currently it’s the only reliable “fix”.
I made another backup and updated to 4.1. Scheduler is still failing within the first hour and logger is still broken by some timeout issue.
There’s like a dozen topics on this forum and a handful issues on github that describe similar behavior of the scheduler. Some posts date back to 2.x version. None of them have any working solution that could be applied to 4.x.
We have a tiny helpdesk compared to what some of the users have here: 4 agents, 2-10 tickets per day. The server is pretty robust for such setup: 10GB RAM, decent Xeon CPU, 4TB HDD. Elastic automatically starts with 4GB heap size, so that should be enough.
I’d be grateful if someone could help me figure out how to solve the logging problem at least