Robust principal component analysis by reverse iterative linear programming


Visentin A., Prestwich S., TARIM Ş. A.

15th European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2016, Riva del Garda, İtalya, 19 - 23 Eylül 2016, cilt.9852 LNAI, ss.593-605 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 9852 LNAI
  • Doi Numarası: 10.1007/978-3-319-46227-1_37
  • Basıldığı Şehir: Riva del Garda
  • Basıldığı Ülke: İtalya
  • Sayfa Sayıları: ss.593-605
  • Anahtar Kelimeler: L1-norm, Linear programming, Principal components analysis, Robust
  • Hacettepe Üniversitesi Adresli: Evet

Özet

Principal Components Analysis (PCA) is a data analysis technique widely used in dimensionality reduction. It extracts a small number of orthonormal vectors that explain most of the variation in a dataset, which are called the Principal Components. Conventional PCA is sensitive to outliers because it is based on the L2-norm, so to improve robustness several algorithms based on the L1-norm have been introduced in the literature. We present a new algorithm for robust L1- norm PCA that computes components iteratively in reverse, using a new heuristic based on Linear Programming. This solution is focused on finding the projection that minimizes the variance of the projected points. It has only one parameter to tune, making it simple to use. On common benchmarks it performs competitively compared to other methods. The data and software related to this paper are available at https://github. com/visentin-insight/L1-PCAhp.