Metrics Extended

A backwards compatible reimplementation of fastai metrics to increase usability and flexibility.

fastxtend’s Metrics Extended is an enhancement of fastai metrics and is backward compatible with fastai metrics. You can mix and match fastxtend and fastai metrics in the same Learner.

fastxtend metrics add the following features to fastai metrics:

fastxtend metrics can independently log on train, valid, or both train and valid
All fastxtend metrics can use the activation support of fastai.metrics.AccumMetric, inherited from MetricX
fastxtend metrics add AvgSmoothMetricX, a metric version of fastai.learner.AvgSmoothLoss

There are three main metric types: AvgMetricX, AccumMetricX, and AvgSmoothMetricX. These correspond one-to-one with fastai.learner.AvgMetric, fastai.metrics.AccumMetric, and fastai.learner.AvgSmoothLoss. fastxtend metrics inherit from fastai.learner.Metric and run on fastai.learner.Learner via a modified fastai.learner.Recorder callback.

To jump to the fastxtend metrics reference, click here.

Important

To maintain backward compatibility with fastai Metrics, importing the Metrics Extanded module patches Recorder to add the new features. With the exception of error stack traces, this should be unnoticable to end users.

Note

Documentation for metrics are lightly adapted from the fastai metrics documentation.

Using a Metric

To use the accuracy metric, or any fastxtend metrics detailed below, create a Learner like normal (or task specific learner such as vision_learner, text_classifier_learner, etc) and add the metric(s) to the metrics argument:

from fastai.vision.all import *
from fastxtend.vision.all import *

Learner(..., metrics=Accuracy())

Fastxtend metrics can be mixed with fastai metrics:

Learner(..., metrics=[accuracy, Accuracy()])

Fastxtend metrics can be logged during training, validation, or both by setting the log_metric argument to LogMetric.Train, LogMetric.Valid, or LogMetric.Both. The sole exception is AvgSmoothMetricX which only logs during training.

Note

By default, a fastxtend metric will log during validation. fastai metrics can only log during validation.

To log a fastxtend metric during training pass LogMetric.Train to log_metric:

Learner(..., metrics=Accuracy(log_metric=LogMetric.Train))

Non-scikit-learn metrics can have the log type set via the metric_type argument to one of MetricType.Avg, MetricType.Accum, MetricType.Smooth, corresponding to AvgMetricX, AccumMetricX, and AvgSmoothMetricX, respectively.

To log a smooth metric on the training set and normal metric on the valid set:

Learner(..., 
        metrics=[Accuracy(log_metric=LogMetric.Train, metric_type=MetricType.Smooth), 
                 Accuracy()])

Fastxtend metrics also support custom names via the name argument:

Learner(..., metrics=Accuracy(name='metric_name'))

which will result in Accuracy logging under “metric_name” instead of the default “accuracy”.

If a fastxtend metric is logged with multiple MetricTypes, the fastxtend Recorder will automatically deduplication the metric names. Unless the metric’s name argument is set. Then fastxtend will not deduplicate any metric names.

Creating a Metric

AvgMetricX, AccumMetricX, and AvgSmoothMetricX all require func, which is a funcational implementation of the metric. The signature of func should be inp,targ (where inp are the predictions of the model and targ the corresponding labels).

AvgMetricX, AccumMetricX, and AvgSmoothMetricX will automatically recognize and pass any func’s unique arguments to func.

Important

Some metrics, like Root Mean Squared Error, will have incorrect results if passed to AvgMetricX via MetricType.Avg, as the mean of multiple batches of RMSE isn’t equal to the RMSE of the whole dataset. For these metrics use AccumMetricX via MetricType.Accum.

An example of creating a fastxtend metric from a functional implementation:

def example_accuracy(inp, targ):
    return (inp == targ).float().mean()

def ExampleAccuracy(dim_argmax=-1, log_metric=LogMetric.Valid, **kwargs):
    return AvgMetricX(example_accuracy, dim_argmax=dim_argmax, log_metric=log_metric, **kwargs)

Alternatively, use the func_to_metric convenience method to create the metric:

def ExampleAccuracy(axis=-1, log_metric=LogMetric.Valid, **kwargs):
    return func_to_metric(example_accuracy, MetricType.Avg, True, axis=axis, log_metric=log_metric, **kwargs)

It is also possible to inherit directly from MetricX to create a fastxtend metric.

class ExampleAccuracy(MetricX):
    def __init__(self, dim_argmax=-1, log_metric=LogMetric.Valid, **kwargs):
    super().__init__(dim_argmax=dim_argmax, log_metric=log_metric, **kwargs)

    def reset(self): self.preds,self.targs = [],[]

    def accumulate(self, learn):
        super().accumulate(learn)
        self.preds.append(learn.to_detach(self.pred))
        self.targs.append(learn.to_detach(self.targ))

    @property
    def value(self):
        if len(self.preds) == 0: return
        preds,targs = torch.cat(self.preds),torch.cat(self.targs)
        return (preds == targs).float().mean()

Important

If your custom MetricX has state depending on tensors, don’t forget to store it on the CPU to avoid any potential memory leaks.

Additional Metrics Functionality

MetricX, and classes which inherit from MetricX such as AvgMetricX, AccumMetricX, and AvgSmoothMetricX, have optional helper functionality in MetricX.accumulate to assist in developing metrics.

For classification problems with single label, predictions need to be transformed with a softmax then an argmax before being compared to the targets. Since a softmax doesn’t change the order of the numbers, apply the argmax. Pass along dim_argmax to have this done by MetricX (usually -1 will work pretty well). If the metric implementation requires probabilities and not predictions, use softmax=True.

For classification problems with multiple labels, or if targets are one-hot encoded, predictions may need to pass through a sigmoid (if it wasn’t included in in the model) then be compared to a given threshold (to decide between 0 and 1), this is done by MetricX by passing sigmoid=True and/or a value for thresh.

AvgMetricX, AccumMetricX, and AvgSmoothMetricX have two additional arguments to assist in creating metrics: to_np and invert_arg.

For example, if using a functional metric from sklearn.metrics, predictions and labels will need to be converted to numpy arrays with to_np=True. Also, scikit-learn metrics adopt the convention y_true, y_preds which is the opposite from fastai, so pass invert_arg=True to make AvgMetricX, AccumMetricX, and AvgSmoothMetricX do the inversion. Alternatively, use the skm_to_fastxtend convenience method to handle sklearn.metrics automatically.

Using a Metric

Creating a Metric

Additional Metrics Functionality

LogMetric

MetricType

ActivationType

MetricX

MetricX.reset

MetricX.accumulate

MetricX.value

MetricX.name

AvgMetricX

AccumMetricX

AvgSmoothMetricX

AvgLossX

AvgSmoothLossX

ValueMetricX

Metrics

Custom Metric Creation

func_to_metric

skm_to_fastxtend

Single-label classification

Accuracy

ErrorRate

TopKAccuracy

APScoreBinary

BalancedAccuracy

BrierScore

CohenKappa

F1Score

FBeta

HammingLoss

Jaccard

Precision

Recall

RocAuc

RocAucBinary

MatthewsCorrCoef

Multi-label classification

AccuracyMulti

APScoreMulti

BrierScoreMulti

F1ScoreMulti

FBetaMulti

HammingLossMulti

JaccardMulti

MatthewsCorrCoefMulti

PrecisionMulti

RecallMulti

RocAucMulti

Regression

MSE

RMSE

MAE

MSLE

ExpRMSE

ExplainedVariance

R2Score

PearsonCorrCoef

SpearmanCorrCoef

Segmentation

ForegroundAcc

Dice

DiceMulti

JaccardCoeff

NLP

CorpusBLEUMetric

Perplexity

LossMetric

LossMetrics

Logging