White RoomNEW

What exactly does focal loss down-weight?

For a correctly classified example with target-class probability pₜ close to one, what is the primary effect of the focal-loss factor (1−pₜ)^γ with γ>0?