Abstract
Real-time processing of user data streams in online services inadvertently creates tension between the users and analysts: users are looking for stronger privacy, while analysts desire for higher utility data analytics in real time. To resolve this tension, this paper describes the design, implementation and evaluation of PRIVAPPROX, a data analytics system for privacy-preserving stream processing.
PRIVAPPROX provides three important properties: (i) Privacy: zero-knowledge privacy guarantee for users, a privacy bound tighter than the state-of-the-art differential privacy; (ii) Utility: an interface for data analysts to systematically explore the trade-offs between the output accuracy (with error estimation) and the query execution budget; (iii) Latency: near real-time stream processing based on a scalable “synchronization-free” distributed architecture. The key idea behind PRIVAPPROX is to combine two techniques together, namely, sampling (used for approximate computation) and randomized response (used for privacy-preserving analytics). The resulting combination is complementary—it achieves stronger privacy guarantees, and also improves the performance for stream analytics.
PRIVAPPROX provides three important properties: (i) Privacy: zero-knowledge privacy guarantee for users, a privacy bound tighter than the state-of-the-art differential privacy; (ii) Utility: an interface for data analysts to systematically explore the trade-offs between the output accuracy (with error estimation) and the query execution budget; (iii) Latency: near real-time stream processing based on a scalable “synchronization-free” distributed architecture. The key idea behind PRIVAPPROX is to combine two techniques together, namely, sampling (used for approximate computation) and randomized response (used for privacy-preserving analytics). The resulting combination is complementary—it achieves stronger privacy guarantees, and also improves the performance for stream analytics.
Original language | English |
---|---|
Title of host publication | Encyclopedia of Big Data Technologies |
Editors | Sherif Sakr, Albert Zomaya |
Publisher | Springer-Verlag |
Chapter | P |
Pages | 1292-1300 |
Number of pages | 9 |
Edition | 1 |
ISBN (Electronic) | 978-3-319-77525-8 |
ISBN (Print) | 3319775243, 978-3319775241, 978-3-319-77526-5 |
DOIs | |
Publication status | Published - 1 Mar 2019 |