The issue here is the automatic OCR for images. There's just not a lot of super good locally hosted solutions for this. Tesseract can be hit or miss depending on how lossy your images are.
AI and good OCR likely means you're gonna be paying for some kind of SaaS offering.
AI and good OCR likely means you're gonna be paying for some kind of SaaS offering.