Supported Metrics¶
Utility¶
Utility metrics are defined by the literal
fairical.metrics.UtilityMetricsType. The column Objective indicates if the
value needs to be /maximized (the higher, the better) or /minimized (the
lower, the better). The column Threshold indicates if the metric needs to be probed on
a specific threshold (/yes, requires thresholding; /no). The column named
Exclusive indicates if this utility metric can be used alone in system evaluations
(/yes, can be used exclusively; /no, cannot). If not, one should ensure to
pick two (or more metrics) containing from both negative and positive samples.
Metric |
Range |
Objective |
Threshold |
Exclusive |
Description |
|---|---|---|---|---|---|
|
\([0,1]\) |
||||
|
\([0,1]\) |
||||
|
\([0,1]\) |
||||
|
\([0,1]\) |
||||
|
\([0,1]\) |
||||
|
\([0,1]\) |
||||
|
\([0,1]\) |
||||
|
\([0,1]\) |
||||
|
\([0,1]\) |
||||
|
\([0,1]\) |
||||
|
\([0,1]\) |
Implementation of these metrics rely mostly on the scikit-learn toolkit
(Pedregosa et al. [PVG+11]).
Fairness¶
Fairness metrics are defined by literals fairical.metrics.FairnessMetricsType
and fairical.metrics.MinMaxFairnessMetricsType. The first literal type
includes fairness metrics which are parameterised only by a protected attribute (such as
age, or gender). The second class of fairness metrics correspond to min-max
criteria comparing specific utility metrics types (see above) between protected groups.
It therefore requires two parameters: The utility metric (as per table above), and a
protected attribute. Separate parameters of a metric using the + (plus sign).
Examples are provided on the next table. The column Threshold indicates if the metric
needs to be probed on a specific threshold (the value /depends is attributed
to fairness metrics in which thresholding depends on the chosen utility metric). In such
cases, use the table above to determine if thresholding of scores is necessary. The
column Objective indicates if the value needs to be /maximized (the higher, the
better) or /minimized (the lower, the better).
Metric |
Parameters |
Example |
Range |
Objective |
Threshold |
Description |
|---|---|---|---|---|---|---|
|
attribute |
|
\([0,1]\) |
|||
|
attribute |
|
\([0,1]\) |
|||
|
attribute |
|
\([0,1]\) |
|||
|
attribute |
|
\([0,1]\) |
|||
|
attr., util. |
|
\([0,1]\) |
|||
|
attr., util. |
|
\([0,1]\) |
Implementation of these metrics rely on the fairlearn toolkit
(Weerts et al. [WDE+23]).