Robust principal component analysis by reverse iterative linear programming

15th European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2016, Riva del Garda, İtalya, 19 - 23 Eylül 2016, cilt.9852 LNAI, ss.593-605

Yayın Türü: Bildiri / Tam Metin Bildiri
Cilt numarası: 9852 LNAI
Doi Numarası: 10.1007/978-3-319-46227-1_37
Basıldığı Şehir: Riva del Garda
Basıldığı Ülke: İtalya
Sayfa Sayıları: ss.593-605
Anahtar Kelimeler: L1-norm, Linear programming, Principal components analysis, Robust
Hacettepe Üniversitesi Adresli: Evet

Özet

Principal Components Analysis (PCA) is a data analysis technique widely used in dimensionality reduction. It extracts a small number of orthonormal vectors that explain most of the variation in a dataset, which are called the Principal Components. Conventional PCA is sensitive to outliers because it is based on the L2-norm, so to improve robustness several algorithms based on the L1-norm have been introduced in the literature. We present a new algorithm for robust L1- norm PCA that computes components iteratively in reverse, using a new heuristic based on Linear Programming. This solution is focused on finding the projection that minimizes the variance of the projected points. It has only one parameter to tune, making it simple to use. On common benchmarks it performs competitively compared to other methods. The data and software related to this paper are available at https://github. com/visentin-insight/L1-PCAhp.