Surprisingly, this has been a project I’ve been tinkering with for years. There is an easy way to get the raw png/jpeg files out, but it does require a windows box. Im planning on working on it more over the long holiday.
I've predominantly worked in two industries, healthcare/public health and insurance where policies terms are measured in decades. The software for both ranges from 20 to 40 years old, and it hasn't been upgraded because to do so poses an existential risk to either the business or, in the case of healthcare, to human life. Upgrades are measured in terms of human generations because of said risk, but I wouldn't call these systems legacy due to not moving beyond java 1.6.
What do you mean by “traditionally hard” in relation to a pdf? Most if not all of the docs I’m tasked with parsing are secured, flattened, and handwritten, which can cause any tool (traditional or ai) to require a confidence score and manual intervention. Also might be that i just get stuck with the edge cases 90% of the time.
After reading a good chunk of the comments, I got the distinct impression that people don't realize we could just not do the whole "let's make a dystopian hellscape" project and just turn all of it off. By that I mean, outlaw AI, destroy the data centers, have severe consequences for it's use by corporations as a way to reduce headcount, I'm talking executives get to spend the rest of their lives in solitary confinement, and instead invest all of this capital in making a better world (solving homelessness, the last mile problem of food distribution, the ever present and ongoing climate catastrophe). We, as humans, can make a choice and make it stick through force of actions.
Or am I just too idealistic ?
Sidenote, I never quite understand why the rich think their bunkers are going to save them from the crisis they caused. Do they fail to realize that there's more of us than them, or do they really believe they can fashion themselves as warlords?
We might not be able to wish it away, but we can, as a society, decide to not utilize it and even actively eradicate it. I honestly believe that llm's/ai are a net negative to society and need to be ripped out root and stem. If tomorrow all of us decided to do that, nothing bad would happen, and we'd all be ok.