Publication | Closed Access
Opaque: an oblivious and encrypted distributed analytics platform
227
Citations
24
References
2017
Year
Spark SqlEngineeringInformation SecurityData-centric SecurityConfidential ComputingRich AnalyticsHardware SecurityPrivacy-preserving CommunicationAnalytics PlatformData ManagementSecure Multi-party ComputationData PrivacyPrivate Information RetrievalComputer ScienceData SecurityCryptographyCloud ComputingCloud CryptographyData Analytics
Many systems run rich analytics on sensitive data in the cloud, but are prone to data breaches. Hardware enclaves promise data confidentiality and secure execution of arbitrary computation, yet still suffer from access pattern leakage. We propose Opaque, a distributed data analytics platform supporting a wide range of queries while providing strong security guarantees. Opaque introduces new distributed oblivious relational operators that hide access patterns, and new query planning techniques to optimize these new operators. Opaque is implemented on Spark SQL with few changes to the underlying system. Opaque provides data encryption, authentication and computation verification with a performance ranging from 52% faster to 3.3x slower as compared to vanilla Spark SQL; obliviousness comes with a 1.6-46x overhead. Opaque provides an improvement of three orders of magnitude over state-of-the-art oblivious protocols, and our query optimization techniques improve performance by 2-5x.
| Year | Citations | |
|---|---|---|
Page 1
Page 1