Repository logo
Log In(current)
  1. Home
  2. Colleges & Schools
  3. Tickle College of Engineering
  4. Engineering -- Faculty Publications and Other Works
  5. Faculty Publications and Other Works - Industrial & Information Engineering
  6. A Comparative Analysis of Predictive Data-Mining Techniques
Details

A Comparative Analysis of Predictive Data-Mining Techniques

Date Issued
January 1, 2009
Author(s)
Li, Xueping  
DOI
https://10.1504/IJRAPIDM.2009.029380
Link to full text
https://10.1504/IJRAPIDM.2009.029380
Permanent URI
https://trace.tennessee.edu/handle/20.500.14382/47455
Abstract

It is non-trivial to select the appropriate prediction technique from a variety of existing techniques for a datasets, since the competitive evaluation of techniques (bagging, boosting, stacking and meta-learning) can be time consuming. This paper compares five predictive data mining techniques on four unique datasets that have a combination of the following characteristics: few predictor variables, many predictor variables, highly collinear variables, very redundant variables and the presence of outliers. Different data mining techniques, including multiple linear regression (MLR), principal component regression (PCR), ridge regression, partial least squares (PLS) and non-linear partial least squares (NLPLS), are applied to each of the datasets. The comparisons are based on different criteria: R-square, R-square adjusted, mean square error (MSE), mean absolute error (MAE), coefficient of efficiency, condition number (CN) and the number of variables of features included in the model. The advantages and disadvantages of the techniques are discussed and summarised.

Subjects

predictive data minin...

statistical analysis

knowledge discovery

multiple linear regre...

MLR

principle component r...

PCR

ridge regression

partial least squares...

nonlinear PLS

Recommended Citation
Xueping Li, Godswill Chukwugozie Nsofor and Laigang Song (2009) "A Comparative Analysis of Predictive Data-Mining Techniques", International Journal of Rapid Manufacturing ( IJRM), Vol.1, No.2, pp.150-172.
Embargo Date
August 31, 2010
File(s)
Thumbnail Image
Name

A_comparative_analysis_of_predictive_data_mining_techniques.pdf

Size

608.6 KB

Format

Adobe PDF

Checksum (MD5)

24ed3eb739a6de8ab653e70bbaddc8ad

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
  • Contact
  • Libraries at University of Tennessee, Knoxville
Repository logo COAR Notify