Skip to main content
Uber logo

Schedule rides in advance

Reserve a rideReserve a ride

Schedule rides in advance

Reserve a rideReserve a ride
Data / ML, Engineering

Pinot Real-Time Ingestion with Cloud Segment Storage

29July,2021 / Global
Featured image for Pinot Real-Time Ingestion with Cloud Segment Storage
Figure 1: UberEats job state breakdown in the past X minutes
Figure 2. Pinot High-Level Architecture
Figure 3: Segment completion protocol in the original Pinot real-time ingestion
Figure 4: Pinot Real Time Ingestion using a deep segment store
Figure 5: Performance of a Pinot real-time ingestion
Figure 6: Modified segment completion protocol with segment upload to the deep store
Figure 7: Ingestion monitoring shows real-time ingestion continued with deep store failure
Figure 8. The Controller’s network traffic is reduced by 10x after turning on the new segment completion protocol
Figure 9: New segment download procedure
Ting Chen

Ting Chen

Ting Chen is a Software Engineer on Uber’s Data team. He is a Tech Lead on the Stream Analytics team whose mission is to provide fast and reliable real-time insights to Uber products and customers. Ting is an Apache Pinot committer.

Subbu Subramaniam

Subbu Subramaniam

Subbu Subramaniam is a Lead Engineer on LinkedIn’s Pinot Team, and an Apache Pinot PPMC member. Subbu has been working on Pinot for 5 years, and is the architect and author of the Realtime Ingestion system in Pinot.

Chinmay Soman

Chinmay Soman

Chinmay Soman is a former Software Engineer on Uber's Data team. He led the Streaming Platform team. The team's mission is to build a scalable platform for all of Uber's messaging, stream processing, and OLAP needs. Chinmay is an Apache Pinot contributor.

Posted by Ting Chen, Subbu Subramaniam, Chinmay Soman