Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think CogVLM2 is even better than Intern at OCR (my usecase is extracting information from an invoice)


After some superficial testing I with bad quality scans you can find on kaggle I can not confirm that. CogVLM2 refuses to handle scans that InternVL-V1.5 still can comprehend.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: