Metrics for CL
For the final evaluation metric, BeGin provides the following metrics. In the mathematical expression, \(\mathrm{M}_{i,j}\) indicates the performance on on j-th task after the i-th task is processed, \(\mathrm{M}^{joint}\) is a basic performance matrix of the joint model, and \(r_i\) denotes the performance of a randomly initialized model on i-th task.
Average Performance (AP)
Average performance on each task after learning all tasks.
Average Forgetting (AF)
Average forgetting on each task after learning all tasks. We measure the forgetting on the i-th task by the difference between the performance on the i-th task after learning all tasks and the performance on the i-th task right after learning the i-th task.
Intransigence (INT)
Average intransigence on each task. We measure the intransigence on the i-th task by the difference between the performances of the Joint model and the target model on the i-th task after learning the i-th task.
Forward Transfer (FWT)
Average forward transfer on each task. We measure the forward transfer on the i-th task by the difference between the performance on the i-th task after learning (i−1)-th task and the performance \(r_i\) on the i-th task without any learning.