his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Maximizing the Area under the ROC Curve with Decision Lists and Rule Sets
University of Skövde, School of Humanities and Informatics. University of Skövde, The Informatics Research Centre.
2007 (English)In: Proceedings of the 7th SIAM International Conference on Data Mining, Society for Industrial and Applied Mathematics , 2007, 27-34 p.Conference paper, (Refereed)
Abstract [en]

Decision lists (or ordered rule sets) have two attractive properties compared to unordered rule sets: they require a simpler classi¯cation procedure and they allow for a more compact representation. However, it is an open question what effect these properties have on the area under the ROC curve (AUC). Two ways of forming decision lists are considered in this study: by generating a sequence of rules, with a default rule for one of the classes, and by imposing an order upon rules that have been generated for all classes. An empirical investigation shows that the latter method gives a significantly higher AUC than the former, demonstrating that the compactness obtained by using one of the classes as a default is indeed associated with a cost. Furthermore, by using all applicable rules rather than the first in an ordered set, an even further significant improvement in AUC is obtained, demonstrating that the simple classification procedure is also associated with a cost. The observed gains in AUC for unordered rule sets compared to decision lists can be explained by that learning rules for all classes as well as combining multiple rules allow for examples to be ranked according to a more fine-grained scale compared to when applying rules in a fixed order and providing a default rule for one of the classes.

Place, publisher, year, edition, pages
Society for Industrial and Applied Mathematics , 2007. 27-34 p.
Research subject
Technology
Identifiers
URN: urn:nbn:se:his:diva-2096ISI: 000289220200003Scopus ID: 2-s2.0-70449372884ISBN: 978-0-898716-30-6 OAI: oai:DiVA.org:his-2096DiVA: diva2:32372
Conference
7th SIAM International Conference on Data Mining;Minneapolis, MN;26 April 2007through28 April 2007
Available from: 2008-05-30 Created: 2008-05-30 Last updated: 2013-03-18

Open Access in DiVA

No full text

Other links

Scopushttp://www.siam.org/proceedings/datamining/2007/dm07_003bostrom.pdf

Search in DiVA

By author/editor
Boström, Henrik
By organisation
School of Humanities and InformaticsThe Informatics Research Centre

Search outside of DiVA

GoogleGoogle Scholar

Total: 30 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf