I asked @OceanStWx about that and supposedly it had better results over GFS op (early testing anyways) but didn’t perform as well in anomalous events which one would expect when a model is training on past events.
The other thing he said is that each vertical level is trained independently, so it’s possible the 500mb pattern may not match what you’d think to see for MSLP.