rho-Gain: a utility based data publishing model

Vural, Yilmaz; AYDOS, MURAT

doi:10.17341/gazimmfd.416433

rho-Gain: a utility based data publishing model

JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, cilt.33, sa.4, ss.1355-1368, 2018 (SCI-Expanded, Scopus, TRDizin)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 33 Sayı: 4
Basım Tarihi: 2018
Doi Numarası: 10.17341/gazimmfd.416433
Dergi Adı: JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, TR DİZİN (ULAKBİM)
Sayfa Sayıları: ss.1355-1368
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
Hacettepe Üniversitesi Adresli: Evet

Özet

Data privacy is a difficult problem that tries to find the best balance between the privacy risks of data owners and the utility of data sharing to the third parties. Anonymization is the most commonly applied technique to overcome data privacy problems. The equivalence classes, the natural outcome of anonymization process, are classified according to the data utility in two main categories: Utility and Outlier Equivalence Classes (UEC, OEC). The utility equivalence class contains records that have been suppressed by anonymization techniques for privacy concerns. Meanwhile, the outlier equivalence class contains records that have been fully suppressed by anonymization techniques resulting in no data utility. In this study, rho-Gain model, which focus on the effect of outlier equivalence class for increasing data utility, was proposed. In the proposed model, k-Anonymity and 1- Diversity privacy models were used together with p-iterations to reduce the privacy risks. The Average Equivalence Class metric was used to measure data utility. According to the findings obtained from the study, the rho-Gain model improved the data utility, but did not cause a significant negative impact on privacy risk estimates. With the use of the proposed rho-Gain model as an anonymization technique, we have shown that the data utility has improved while keeping the data privacy risk with no significant change