INSPECTOR: Data Provenance Using Intel Processor Trace (PT)

J. Thalheim, P. Bhatotia, C. Fetzer

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Data provenance strives for explaining how the computation was performed by recording a trace of the execution. The provenance trace is useful across a wide-range of workflows to improve the dependability, security, and efficiency of software systems. In this paper, we present Inspector, a POSIX-compliant data provenance library for shared-memory multithreaded programs. The Inspector library is completely transparent and easy to use: it can be used as a replacement for the pthreads library by a simple exchange of libraries linked, without even recompiling the application code. To achieve this result, we present a parallel provenance algorithm that records control, data, and schedule dependencies using a Concurrent Provenance Graph (CPG). We implemented our algorithm to operate at the compiled binary code level by leveraging a combination of OS-specific mechanisms, and recently released Intel PT ISA extensions as part of the Broadwell micro-architecture. Our evaluation on a multicore platform using applications from multithreaded benchmarks suites (PARSEC and Phoenix) shows reasonable provenance overheads for a majority of applications. Lastly, we briefly describe three case-studies where the generic interface exported by Inspector is being used to improve the dependability, security, and efficiency of systems. The Inspector library is publicly available for further use in a wide range of other provenance workflows.
Original languageEnglish
Title of host publication2016 IEEE 36th International Conference on Distributed Computing Systems (ICDCS)
Place of PublicationNara, Japan
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages25-34
Number of pages10
ISBN (Electronic)978-1-5090-1483-5
ISBN (Print)978-1-5090-1484-2
DOIs
Publication statusPublished - 11 Aug 2016
Event36th International Conference on Distributed Computing Systems - Nara, Japan
Duration: 27 Jun 201630 Jun 2016
https://ieeexplore.ieee.org/document/7536493/

Conference

Conference36th International Conference on Distributed Computing Systems
Abbreviated titleICDCS 2016
CountryJapan
CityNara
Period27/06/1630/06/16
Internet address

Fingerprint Dive into the research topics of 'INSPECTOR: Data Provenance Using Intel Processor Trace (PT)'. Together they form a unique fingerprint.

Cite this