首页   按字顺浏览 期刊浏览 卷期浏览 STATLOG: COMPARISON OF CLASSIFICATION ALGORITHMS ON LARGE REAL-WORLD PROBLEMS
STATLOG: COMPARISON OF CLASSIFICATION ALGORITHMS ON LARGE REAL-WORLD PROBLEMS

 

作者: R. D. KING,   C. FENG,   A. SUTHERLAND,  

 

期刊: Applied Artificial Intelligence  (Taylor Available online 1995)
卷期: Volume 9, issue 3  

页码: 289-333

 

ISSN:0883-9514

 

年代: 1995

 

DOI:10.1080/08839519508945477

 

出版商: Taylor & Francis Group

 

数据来源: Taylor

 

摘要:

This paper describes work in the StatLog project comparing classification algorithms on large real-world problems. The algorithms compared were from symbolic learning (CART. C4.5, NewID, AC2,ITrule, Cal5, CN2), statistics (Naive Bayes, k-nearest neighbor, kernel density, linear discriminant, quadratic discriminant, logistic regression, projection pursuit, Bayesian networks), and neural networks (backpropagation, radial basis functions). Twelve datasets were used: five from image analysis, three from medicine, and two each from engineering and finance. We found that which algorithm performed best depended critically on the data set investigated. We therefore developed a set of data set descriptors to help decide which algorithms are suited to particular data sets. For example, data sets with extreme distributions (skew > l and kurtosis > 7) and with many binary/categorical attributes (>38%) tend to favor symbolic learning algorithms. We suggest how classification algorithms can be extended in a number of directions.

 

点击下载:  PDF (1532KB)



返 回