4 TOPS is last-gen smartphone territory - you aren't going to have a good time running big generative models, but it's a useful amount of performance for typical edge workloads like machine vision and robotics.
Efficientdet gives almost acceptable performance for security cams on an i5 if you only decode keyframes, I wonder how many NVR channels it could handle?