I asked a similar question some time ago and got similar answers to you. However one reply, from a particularly well placed forum member, did actually address my specific query and indicates that, yes, there are criteria and at some point we may find out.
It’s very hard to describe, but easy to see. We’re working on a more robust system for this. Once it’s out you’ll be able to see how we classify failed vs non failed.