Skip to main content
Engineering, Data / ML

Blazing Fast OLAP on Uber’s Inventory and Catalog Data with Apache Pinot™

9 December / Global
Featured image for Blazing Fast OLAP on Uber’s Inventory and Catalog Data with Apache Pinot™
Image
Figure 1: The different INCA entities and their relationships. 
Image
Figure 2: High-level data ingestion pipeline that sinks the data to multiple Kafka topics, each of which is ingested into corresponding Pinot tables.
Image
Figure 3: Pinot Server CPU profile during high CPU events around Asia morning hours.
Image
Figure 4: Latency reduction after upgrading from JRE-11 to JRE-17 (both setups used G1GC).
Image
Figure 5: ~70% reduction in segment count, from 74,000 down to 22,000.
Image
Figure 6:  40% reduction in table size (peak table size reduced from 42TB to 24TB). 
Image
Figure 7: 75% reduction in p99 query latency (1150ms to 269ms).
Suraj Modi

Suraj Modi

Suraj Modi is a Staff Engineer on the Catalog Platform team and the Tech Lead for Catalog Serving.

Ankit Sultana

Ankit Sultana

Ankit Sultana is a PMC Member for Apache Pinot and a Staff Engineer at Uber.

Tarun Mavani

Tarun Mavani

Tarun Mavani is a Software Engineer II at Uber.

Posted by Suraj Modi, Ankit Sultana, Tarun Mavani