That sounds like a problem with the test suites themselves, not TestSwarm or any sort of execution mechanism. For jQuery's test suite we've put a lot of work into making sure that the tests run stable and the results are easily repeatable.
There's a big difference between one test randomly failing and someone submitting completely bogus results.
There's a big difference between one test randomly failing and someone submitting completely bogus results.