Multivariate statistical tests for comparing classification algorithms

Yıldız, Olcay Taner; Aslan, Özlem; Alpaydın, Ahmet İbrahim Ethem

Göster/Aç

Publisher's Version (790.0Kb)

Tarih

2011

Yazar

Yıldız, Olcay Taner
Aslan, Özlem
Alpaydın, Ahmet İbrahim Ethem

Üst veri

Tüm öğe kaydını göster

Künye

Yıldız O.T., Aslan Ö. & Alpaydın A. İ. E. (2011). Multivariate Statistical Tests for Comparing Classification Algorithms. In: Coello C.A.C. (eds) Learning and Intelligent Optimization. LION 2011. Paper present at the Lecture Notes in Computer Science, 6683, 1-15. doi:10.1007/978-3-642-25566-3_1

Özet

The misclassification error which is usually used in tests to compare classification algorithms, does not make a distinction between the sources of error, namely, false positives and false negatives. Instead of summing these in a single number, we propose to collect multivariate statistics and use multivariate tests on them. Information retrieval uses the measures of precision and recall, and signal detection uses true positive rate (tpr) and false positive rate (fpr) and a multivariate test can also use such two values instead of combining them in a single value, such as error or average precision. For example, we can have bivariate tests for (precision, recall) or (tpr, fpr). We propose to use the pairwise test based on Hotelling's multivariate T test to compare two algorithms or multivariate analysis of variance (MANOVA) to compare L > 2 algorithms. In our experiments, we show that the multivariate tests have higher power than the univariate error test, that is, they can detect differences that the error test cannot, and we also discuss how the decisions made by different multivariate tests differ, to be able to point out where to use which. We also show how multivariate or univariate pairwise tests can be used as post-hoc tests after MANOVA to find cliques of algorithms, or order them along separate dimensions.

Kaynak

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Cilt

6683

Bağlantı

https://hdl.handle.net/11729/1939
https://dx.doi.org/10.1007/978-3-642-25566-3_1