Analysis on Privacy Preserving Clustering Methods for Big Datasets Using Random Number Generators

Main Article Content

Kavitha Guda, K. Kavitha, B. Sujatha

Abstract

Due to the growing requirement for powerful processing capabilities, safeguarding and processing sensitive data in big data systems has become an increasingly prevalent issue. Additional effort is required to secure sensitive data on a system with numerous connections, each of which has its own privacy policy, and to exchange cryptographic keys in accordance with the appropriate protocols for each of these parties. Data encrypted using homomorphic encryption algorithms can undergo the same kinds of arithmetic operations as data encrypted using plaintext. This allows for the outsourcing of computations to cloud services while also providing data privacy by processing in the encrypted domain. Also, key exchange sessions become easier for everyone involved because of this. This study presents new privacy-preserving clustering algorithms and homomorphic encryption systems that may together operate on a shared HPC infrastructure, like the cloud. Therefore, it is not necessary for the participants in this system to have a lot of processing power, as the system would distribute the most power-intensive jobs to any provider of cloud computing. Multiple clustering techniques can have their distance matrices computed in a way that does not compromise user privacy using our technology.

Article Details

Section
Articles
Author Biography

Kavitha Guda, K. Kavitha, B. Sujatha

[1]Kavitha Guda

2Dr. K. Kavitha

3Dr. B. Sujatha

 

[1]Research Scholar, Department of Computer Science and Engineering, Annamalai University, Annamalai Nagar, Tamil Nadu

2Associate Professor, Department of Computer Science and Engineering, Annamalai University, Annamalai Nagar, Tamil Nadu

3Assistant Professor, Department of Computer Science and Engineering, University College of Engineering(a), Osmania University, Hyderabad

Copyright © JES 2024 on-line : journal.esrgroups.org

References

Wieringa, J.; Kannan, P.K.; Ma, X.; Reutterer, T.; Risselada, H.; Skiera, B. Data analytics in a privacy-concerned world. J. Bus. Res. 2021, 122, 915–925. [Google Scholar] [CrossRef]

Sweeney, L. Simple demographics often identify people uniquely. Health 2000, 671, 1–34. [Google Scholar]

Majeed, A.; Lee, S. Anonymization techniques for privacy preserving data publishing: A comprehensive survey. IEEE Access 2020, 9, 8512–8545. [Google Scholar] [CrossRef]

Sweeney, L. k-anonymity: A model for protecting privacy. Int. J. Uncertain. Fuzziness-Knowl.-Based Syst. 2002, 10, 557–570.

Chaudhuri, K.; Monteleoni, C. Privacy-preserving logistic regression. In Advances in Neural Information Processing Systems 21; Koller, D., Schuurmans, D., Bengio, Y., Bottou, L., Eds.; Curran Associates, Inc.: Dutchess County, NY, USA, 2009; pp. 289–296. [Google Scholar]

Agrawal, R.; Srikant, R. Privacy-preserving Data Mining. SIGMOD Rec. 2000, 29, 439–450. [Google Scholar] [CrossRef]

Xu, K.; Yue, H.; Guo, L.; Guo, Y.; Fang, Y. Privacy-Preserving Machine Learning Algorithms for Big Data Systems. In Proceedings of the 2015 IEEE 35th International Conference on Distributed Computing Systems, Columbus, OH, USA, 29 June–2 July 2015; pp. 318–327. [Google Scholar] [CrossRef]

Merugu, S.; Ghosh, J. Privacy-preserving distributed clustering using generative models. In Proceedings of the Third IEEE International Conference on Data Mining, Melbourne, FL, USA, 22 November 2003; pp. 211–218. [Google Scholar] [CrossRef]

Shokri, R.; Shmatikov, V. Privacy-Preserving Deep Learning. In Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security, CCS ’15, Denver, CO, USA, 12–16 October 2015; pp. 1310–1321. [Google Scholar] [CrossRef]

Yang, Z.; Zhong, S.; Wright, R.N. Privacy-Preserving Classification of Customer Data without Loss of Accuracy. In Proceedings of the 2005 SIAM International Conference on Data Mining, Newport Beach, CA, USA, 21–23 April 2005; pp. 92–102. [Google Scholar] [CrossRef] [Green Version]

Emekci, F.; Sahin, O.; Agrawal, D.; Abbadi, A.E. Privacy preserving decision tree learning over multiple parties. Data Knowl. Eng. 2007, 63, 348–361.