Pontus: A memory-efficient and high-accuracy approach for persistence-based item lookup in high-velocity data streams

Weihe Li, Zukai Li, Beyza Bütün, Alec Diallo, Marco Fiore, Paul Patras

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In today's web-scale, data-driven environments, real-time detection of persistent items that consistently recur over time is essential for maintaining system integrity, reliability, and security. Persistent items often signal critical anomalies, such as stealthy DDoS and botnet attacks in web infrastructures. Although various methods exist for identifying such items as well as for determining their frequency, they require recording every item for processing, which is impractical at very high data rates achieved by modern data streams. In this paper, we introduce Pontus, a novel approach that uses an approximate data structure (sketch) specifically designed for the efficient and accurate detection of persistent items. Our method not only achieves fast and precise lookup but is also flexible, allowing for minor modifications to accommodate other types of persistence-based item detection tasks, such as detecting persistent items with low frequency. We rigorously validate our approach through formal methods, offering detailed proofs of time/space complexity and error bounds to demonstrate its theoretical soundness. Our extensive trace-driven evaluations across various persistence-based tasks further demonstrate Pontus's effectiveness in significantly improving detection accuracy and enhancing processing speed compared to existing approaches. We implement Pontus in an experimental platform with industry-grade Intel Tofino switches and demonstrate the practical feasibility of our approach in a real-world memory-constrained environment.
Original languageEnglish
Title of host publicationWWW '25
Subtitle of host publicationProceedings of the ACM on Web Conference 2025
Place of PublicationNew York, NY, United States
PublisherAssociation for Computing Machinery (ACM)
Pages1783-1794
Number of pages12
ISBN (Electronic)9798400712746
DOIs
Publication statusPublished - 22 Apr 2025
EventThe ACM Web Conference 2025 - ICC Sydney: International Convention & Exhibition Centre, Sydney, Australia
Duration: 28 Apr 20252 May 2025
https://www2025.thewebconf.org/

Conference

ConferenceThe ACM Web Conference 2025
Abbreviated titleWWW '25
Country/TerritoryAustralia
CitySydney
Period28/04/252/05/25
Internet address

Keywords / Materials (for Non-textual outputs)

  • data stream processing
  • persistent item lookup
  • probabilistic data structure
  • Tofino programmable switch

Fingerprint

Dive into the research topics of 'Pontus: A memory-efficient and high-accuracy approach for persistence-based item lookup in high-velocity data streams'. Together they form a unique fingerprint.

Cite this