Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Hey! Camelot maintainer here. You can check out this doc for details on how Camelot extracts tables from PDFs: https://camelot-py.readthedocs.io/en/master/user/how-it-work...

As pointed out in this thread, right now it only works with text-based PDFs. But there's a PR[1] which will add OCR support (using EasyOCR) for image-based PDFs in some time.

[1] https://github.com/camelot-dev/camelot/pull/209



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: