White RoomNEW

Can a lower Brier score prove better calibration?

Classifier M1 has a lower Brier score than classifier M2 on the same test set. Which conclusion is the most defensible?