sklearn.metrics.rand_score#

sklearn.metrics.rand_score(labels_true, labels_pred)[source]#

Rand index.

The Rand Index computes a similarity measure between two clusterings by considering all pairs of samples and counting pairs that are assigned in the same or different clusters in the predicted and true clusterings [1] [2].

The raw RI score [3] is:

RI = (number of agreeing pairs) / (number of pairs)

Read more in the User Guide.

Parameters:
labels_truearray-like of shape (n_samples,), dtype=integral

Ground truth class labels to be used as a reference.

labels_predarray-like of shape (n_samples,), dtype=integral

Cluster labels to evaluate.

Returns:
RIfloat

Similarity score between 0.0 and 1.0, inclusive, 1.0 stands for perfect match.

See also

adjusted_rand_score

Adjusted Rand Score.

adjusted_mutual_info_score

Adjusted Mutual Information.

References

Examples

Perfectly matching labelings have a score of 1 even

>>> from sklearn.metrics.cluster import rand_score
>>> rand_score([0, 0, 1, 1], [1, 1, 0, 0])
1.0

Labelings that assign all classes members to the same clusters are complete but may not always be pure, hence penalized:

>>> rand_score([0, 0, 1, 2], [0, 0, 1, 1])
0.83...

Examples using sklearn.metrics.rand_score#

Adjustment for chance in clustering performance evaluation

Adjustment for chance in clustering performance evaluation