Mammography

The original Mammography (Woods et al., 1993) data set was made available by the courtesy of Aleksandar Lazarevic. This dataset is publicly available in openML. It has 11,183 samples with 260 calcifications. If we look at predictive accuracy as a measure of goodness of the classifier for this case, the default accuracy would be 97.68% when every sample is labeled non-calcification. But, it is desirable for the classifier to predict most of the calcifications correctly. For outlier detection, the minority class of calcification is considered as outlier class and the non-calcification class as inliers.

Mammography is available on Aftershock and normal observations are available in the included training dataset consisting of 6 dimensions per observation. During evaluation, the main program of your submission is expected to access /ingress/mammography/testing.csv which has the same form as the development dataset and produce sequentially aligned anomaly confidence values (in [0, 1]) at /egress/mammography/predictions.csv.

Scores

Method Authors AUC Runtime (ms)
Bionic (Pre-Release) K. Demetriou, I. Becker, S. Hailes 0.890 515.930
Baselines - Isolation Forest K. Demetriou, I. Becker, S. Hailes 0.883 112.742
Baselines - One Class SVM K. Demetriou, I. Becker, S. Hailes 0.796 188.492
Baselines - Robust Covariance K. Demetriou, I. Becker, S. Hailes 0.724 0.514
Baselines - Local Outlier Factor K. Demetriou, I. Becker, S. Hailes 0.576 103.513