Really? I have an image with 180 classes on, so far (slight model config changes from pets tutorial) I've managed to detect 80% of the classes. Don't get me wrong images are similar in both sequence of objects and background (supermarket shelf).
I have a budget to bring in a consultant, is this something you'd be interested in?