That my friend is the secret sauce. My first job out of school was for a staffing company that pimped me out to perform similar projects. Word of mouth mostly, it's very lucrative if you have domain knowledge and can sit for long periods of time. It's not Mechanical Turk style cleansing and I have to take personal responsibility for every row as they mostly just suck up my cleansed data and roll with it, which you can't do with ML cleansed data or off-shored cleansed data. I've been thinking of launching a SaaS product but the transition from trusted consultant to enterprise SaaS is not easy. What do you do?
I do big data stuff mostly. A lot of that involves cleansing. So working with large excel/csv/other tabular data formats is second nature to me. Familiar with many technical tools that I could maybe use as force-multiplier (e.g. serverless SQL processing, OLAP, RDBMS, python pandas, spark), hence my interest since if we relax the condition that every single row is accounted for, my productivity would likely be pretty high
The customers always find the rows that are not accounted for, it's bad form to drop data. We sell high quality data, which you can't claim if you can't account for every single row. You should be careful about delivering an incomplete data-set.