Publication | Open Access
INSPECTOR: Data Provenance Using Intel Processor Trace (PT)
31
Citations
37
References
2016
Year
Unknown Venue
Cluster ComputingEngineeringComputer ArchitectureInformation ForensicsSoftware AnalysisData ProvenanceParallel ToolData ScienceParallel ComputingData ManagementComputer EngineeringComputer ScienceProvenance AnalysisProgram AnalysisCloud ComputingProvenance ManagementParallel ProgrammingConcurrent Data StructureInspector LibraryConcurrent Provenance GraphProvenance TraceSystem Software
Data provenance strives for explaining how the computation was performed by recording a trace of the execution. The provenance trace is useful across a wide-range of workflows to improve the dependability, security, and efficiency of software systems. In this paper, we present Inspector, a POSIX-compliant data provenance library for shared-memory multithreaded programs. The Inspector library is completely transparent and easy to use: it can be used as a replacement for the pthreads library by a simple exchange of libraries linked, without even recompiling the application code. To achieve this result, we present a parallel provenance algorithm that records control, data, and schedule dependencies using a Concurrent Provenance Graph (CPG). We implemented our algorithm to operate at the compiled binary code level by leveraging a combination of OS-specific mechanisms, and recently released Intel PT ISA extensions as part of the Broadwell micro-architecture. Our evaluation on a multicore platform using applications from multithreaded benchmarks suites (PARSEC and Phoenix) shows reasonable provenance overheads for a majority of applications. Lastly, we briefly describe three case-studies where the generic interface exported by Inspector is being used to improve the dependability, security, and efficiency of systems. The Inspector library is publicly available for further use in a wide range of other provenance workflows.
| Year | Citations | |
|---|---|---|
Page 1
Page 1