Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

>I had to do a heuristic layer on top to break up the PDFs into small chunks so the output didn’t overflow

How do you stitch the outputs of all chunks without losing the overall context?



The output is just individual line items from the invoices, so all you have to do is concatenate the outputs of the chunks. If there was data that crossed a page, it would have been harder!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: