AI OCR feature in Zammad 7 vs. Tesseract

Dear Zammad Team,

we are evaluating Zammad 7 in our test environment and found most of the AI features very useful.
Thank you very much for your effort to properly implement features where it makes sense.

One thing we noticed was the OCR feature using AI. While I think that AI has its pros, the amout of processing required (energy and ressource consumption) is significant.

For our environment, using AI for simple OCR tasks (screenshots of logs or even error messages) seems not necessary or even overengineered.
A simpler approach like by using Tesseract could help obtaining text data for AI summarization while keeping a low ressource footprint.

Have you considered using Tesseract in the development process?

I do not want to see you guys removing the AI OCR feature but maybe consider offering an alternative.
Another approach could be to use tesseract as default and switch to AI if no text has been detected.

Keen to know your thoughts about that.

Best regards,
Bernhard

For now we used a simple approach to add this support, because also mostly every newer LLM directly supporting it this days, without the need to swichting around.

It’s a similar situation like for translation, I think here are also multiple possibilities.

In the end, the current OCR-Feature implementation is only the start, there are more ideas, so maybe we will also add some more possibilities of selecting the OCR mode (LLM vs other OCR engines - but our direction would definitiv to have one way).

1 Like