Abstract
In today's web-scale, data-driven environments, real-time detection of persistent items that consistently recur over time is essential for maintaining system integrity, reliability, and security. Persistent items often signal critical anomalies, such as stealthy DDoS and botnet attacks in web infrastructures. Although various methods exist for identifying such items as well as for determining their frequency, they require recording every item for processing, which is impractical at very high data rates achieved by modern data streams. In this paper, we introduce Pontus, a novel approach that uses an approximate data structure (sketch) specifically designed for the efficient and accurate detection of persistent items. Our method not only achieves fast and precise lookup but is also flexible, allowing for minor modifications to accommodate other types of persistence-based item detection tasks, such as detecting persistent items with low frequency. We rigorously validate our approach through formal methods, offering detailed proofs of time/space complexity and error bounds to demonstrate its theoretical soundness. Our extensive trace-driven evaluations across various persistence-based tasks further demonstrate Pontus's effectiveness in significantly improving detection accuracy and enhancing processing speed compared to existing approaches. We implement Pontus in an experimental platform with industry-grade Intel Tofino switches and demonstrate the practical feasibility of our approach in a real-world memory-constrained environment.
Original language | English |
---|---|
Title of host publication | WWW '25 |
Subtitle of host publication | Proceedings of the ACM on Web Conference 2025 |
Place of Publication | New York, NY, United States |
Publisher | Association for Computing Machinery (ACM) |
Pages | 1783-1794 |
Number of pages | 12 |
ISBN (Electronic) | 9798400712746 |
DOIs | |
Publication status | Published - 22 Apr 2025 |
Event | The ACM Web Conference 2025 - ICC Sydney: International Convention & Exhibition Centre, Sydney, Australia Duration: 28 Apr 2025 → 2 May 2025 https://www2025.thewebconf.org/ |
Conference
Conference | The ACM Web Conference 2025 |
---|---|
Abbreviated title | WWW '25 |
Country/Territory | Australia |
City | Sydney |
Period | 28/04/25 → 2/05/25 |
Internet address |
Keywords / Materials (for Non-textual outputs)
- data stream processing
- persistent item lookup
- probabilistic data structure
- Tofino programmable switch