Open this publication in new window or tab >>2009 (English)In: Proceedings of IEEE Symposium on Computational Intelligence and Data Mining (CIDM), IEEE conference proceedings, 2009, p. 245-251Conference paper, Published paper (Refereed)
Abstract [en]
Both theory and a wealth of empirical studies have established that ensembles are more accurate than single predictive models. Unfortunately, the problem of how to maximize ensemble accuracy is, especially for classification, far from solved. In essence, the key problem is to find a suitable criterion, typically based on training or selection set performance, highly correlated with ensemble accuracy on novel data. Several studies have, however, shown that it is difficult to come up with a single measure, such as ensemble or base classifier selection set accuracy, or some measure based on diversity, that is a good general predictor for ensemble test accuracy. This paper presents a novel technique that for each learning task searches for the most effective combination of given atomic measures, by means of a genetic algorithm. Ensembles built from either neural networks or random forests were empirically evaluated on 30 UCI datasets. The experimental results show that when using the generated combined optimization criteria to rank candidate ensembles, a higher test set accuracy for the top ranked ensemble was achieved, compared to using ensemble accuracy on selection data alone. Furthermore, when creating ensembles from a pool of neural networks, the use of the generated combined criteria was shown to generally outperform the use of estimated ensemble accuracy as the single optimization criterion.
Place, publisher, year, edition, pages
IEEE conference proceedings, 2009
National Category
Computer and Information Sciences
Research subject
Technology
Identifiers
urn:nbn:se:his:diva-3212 (URN)10.1109/CIDM.2009.4938656 (DOI)000271487700035 ()2-s2.0-67650434708 (Scopus ID)978-1-4244-2765-9 (ISBN)
Conference
2009 IEEE Symposium on Computational Intelligence and Data Mining (CIDM 2009) proceedings : March 30-April 2, 2009, Sheraton Music City Hotel, Nashville, TN, USA
2009-06-262009-06-262018-01-13Bibliographically approved