The Comparison of Robust Partial Least Squares Regression with Robust Principal Component Regression on a Real Data


POLAT E., GÜNAY S.

11th International Conference of Numerical Analysis and Applied Mathematics (ICNAAM), Yunanistan, 21 - 27 Eylül 2013, cilt.1558, ss.1458-1461 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 1558
  • Doi Numarası: 10.1063/1.4825793
  • Basıldığı Ülke: Yunanistan
  • Sayfa Sayıları: ss.1458-1461
  • Hacettepe Üniversitesi Adresli: Evet

Özet

One of the problems encountered in Multiple Linear Regression (MLR) is multicollinearity, which causes the overestimation of the regression parameters and increase of the variance of these parameters. Hence, in case of multicollinearity presents, biased estimation procedures such as classical Principal Component Regression (CPCR) and Partial Least Squares Regression (PLSR) are then performed. SIMPLS algorithm is the leading PLSR algorithm because of its speed, efficiency and results are easier to interpret. However, both of the CPCR and SIMPLS yield very unreliable results when the data set contains outlying observations. Therefore, Hubert and Vanden Branden (2003) have been presented a robust PCR (RPCR) method and a robust PLSR (RPLSR) method called RSIMPLS. In RPCR, firstly, a robust Principal Component Analysis (PCA) method for high-dimensional data on the independent variables is applied, then, the dependent variables are regressed on the scores using a robust regression method. RSIMPLS has been constructed from a robust covariance matrix for high-dimensional data and robust linear regression. The purpose of this study is to show the usage of RPCR and RSIMPLS methods on an econometric data set, hence, making a comparison of two methods on an inflation model of Turkey. The considered methods have been compared in terms of predictive ability and goodness of fit by using a robust Root Mean Squared Error of Cross-validation (R-RMSECV), a robust R-2 value and Robust Component Selection (RCS) statistic.