What exactly does focal loss down-weight?
For a correctly classified example with target-class probability pₜ close to one, what is the primary effect of the focal-loss factor (1−pₜ)^γ with γ>0?
Sign in to answer questions and track your progress
Sign In