Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

There are a lot of other options and preprocessing methods you can use to get better results. It's hard to tell without seeing the picture but thresholding/binarization might help with the contrast. In order to isolate the text, the mode option also makes a lot of difference: https://tesseract-ocr.github.io/tessdoc/ImproveQuality.html#...

If that doesn't work you'll have to add a text localization model to your pipeline.



Thank you for your guidance. I will investigate further




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: