I can only recommend hetzner. At my former company, we started with 8 servers with them nearly 15 years ago and now have more than 6000 servers with them.
Interesting! How did you get GPU servers? They used to be available but can you still get them? And can you share more details about the automated hardware failure monitoring?
https://www.hetzner.com/customers/talkwalker