Checking against callibration measurements isn't the only way to do things. With M87's SMBH for instance, they had a simulated image of what they ought to see given general relativity and what they knew about M87. The extracted data matches that fairly closely, thus lending credibility to the result for obtaining parity with physics without actually applying those physics to the data.
>With M87's SMBH for instance, they had a simulated image of what they ought to see given general relativity and what they knew about M87.
How confident are we that our current simulations accurately reproduce the universe?
Given how many unpredicted and supposedly impossible exoplanet and star configurations that keep being found, I'd say the current model is not doing so well on the prediction front.
Those configurations aren't "impossible" due to fundamental well tested physics like general relativity. They're "impossible" from the perspective of our understanding of the formation of planetary systems, which is understandably less well developed given how much more difficult it is to study since as a science, exoplanet detection is just 30 years old, with the majority of detections being less than 15 years old.
In comparison, general relativity is one of the most well tested theories in physics, having been undergoing rigorous testing for over a century, and regardless, the point stands that the results were close to the model, thus providing evidence for the model's validity.
If general relativity based models have struggled to help predict or explain planetary system formation and exoplanet observations, I think we should have the same level of expectation about how helpful a general relativity based black hole model will be.
General Relativity has pretty much nothing to do with planet formation.
Newtonian gravity is good enough for that problem, and the problems have nothing to do with not understanding gravity well enough. The problems have to do with things like complex chemistry in stellar accretion disks, how grains in these disks stick to one another to eventually form rocks, and so on.
You might as well be saying that if fluid dynamics models have struggled to help predict or explain CPU performance increases, we should have the same level of expectation about how helpful a fluid dynamics based model of aerodynamics will be, i.e. complete nonsense