Skip to main content
Data / ML, Engineering

One Stone, Three Birds: Finer-Grained Encryption @ Apache Parquet™

March 10, 2022 / Global
Featured image for One Stone, Three Birds:  Finer-Grained Encryption @ Apache Parquet™
Figure 1: The system architecture of Encryption
Figure 2: The conceptual flow of schema-controlled Parquet encryption
Figure 3: Tagging mapping to Parquet schema
Xinli Shang

Xinli Shang

Xinli Shang is a Manager on the Uber Big Data Infra team, Apache Parquet PMC Chair, Presto Commmiter, and Uber Open Source Committee member. He is leading the Apache Parquet community and contributing to several other communities. He is also leading several initiatives on data format for storage efficiency, security, and performance. He is also passionate about tuning large-scale services for performance, throughput, and reliability.

Mohammad Islam

Mohammad Islam

Mohammad Islam is a Distinguished Engineer at Uber. He co-leads the Data cost-efficiency effort and also leads Data Security and compliance efforts. He is an Apache Oozie and Tez PMC member.

Pavi Subenderan

Pavi Subenderan

Pavi Subenderan is a Senior Software Engineer on Uber’s Data Infra team. His area of focus is on Parquet, Encryption, and Security. At Uber, he has worked on Parquet encryption and Security/Compliance for the past 3 years.

Jianchun Xu

Jianchun Xu

Jianchun Xu is a Staff Software Engineer on Uber's Data Infra team. He mainly works on big data infra and data security. He also has extensive experience in service deployment platforms, developer tools, and web/JavaScript engines.

Posted by Xinli Shang, Mohammad Islam, Pavi Subenderan, Jianchun Xu