Privacy preserving sentiment analysis on multiple edge data streams with Apache NiFi
Pandya, Abhinay; Kostakos, Panos; Mehmood, Hassan; Cortes, Marta; Gilman, Ekaterina; Oussalah, Mourad; Pirttikangas, Susanna (2020-06-05)
A. Pandya et al., "Privacy preserving sentiment analysis on multiple edge data streams with Apache NiFi," 2019 European Intelligence and Security Informatics Conference (EISIC), Oulu, Finland, 2019, pp. 130-133, doi: 10.1109/EISIC49498.2019.9108851
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
https://rightsstatements.org/vocab/InC/1.0/
https://urn.fi/URN:NBN:fi-fe2020061644570
Tiivistelmä
Abstract
Sentiment analysis, also known as opinion mining, plays a big role in both private and public sector Business Intelligence (BI); it attempts to improve public and customer experience. Nevertheless, de-identified sentiment scores from public social media posts can compromise individual privacy due to their vulnerability to record linkage attacks. Established privacy-preserving methods like k-anonymity, l-diversity and t-closeness are offline models exclusively designed for data at rest. Recently, a number of online anonymization algorithms (CASTLE, SKY, SWAF) have been proposed to complement the functional requirements of streaming applications, but without open-source implementation. In this paper, we present a reusable Apache NiFi dataflow that buffers tweets from multiple edge devices and performs anonymized sentiment analysis in real-time, using randomization. The solution can be easily adapted to suit different scenarios, enabling researchers to deploy custom anonymization algorithms.
Kokoelmat
- Avoin saatavuus [31657]