MINING TAXATION DATA WITH PARALLEL BMARS
作者:
SERGEY BAKlN,
MARKUS HEGLAND,
GRAHAM WILLIAMS,
期刊:
Parallel Algorithms and Applications
(Taylor Available online 2000)
卷期:
Volume 15,
issue 1-2
页码: 37-55
ISSN:1063-7192
年代: 2000
DOI:10.1080/01495730008947349
出版商: Taylor & Francis Group
关键词: Data mining;Parallel algorithms;Multivariate regression;Predictive modeling
数据来源: Taylor
摘要:
A new parallel version of Friedman's Multivariate Adaptive Regression Splines (MARS) algorithm is discussed. By partitioning the data over the processors of a parallel computational system one achieves good parallel efficiency. Instead of using truncated power basis functions of the original MARS, the new method (BMARS) utilises B-sp!ines which improves numerical stability and reduces the computational cost of the procedure. In addition, the coefficients of the basis functions of a BMARS model provide quickly accessible information about the local behaviour of the function. The algorithm has a time complexity proportional to the number of data records. The method provides a new means for the detection of areas in the space of features which are characterised by the "interesting" patterns of response values. This is applied to searching for classes of incorrect tax returns using multiple predictor variables or features. The parallel algorithm makes it feasible to investigate very large databases, such as the taxation database.
点击下载:
PDF (598KB)
返 回