At Uber, we leverage large geospatial datasets to power our marketplace algorithms, leading to more accurate ETAs, dynamic pricing, and better routes for users on our platform.
Deriving information and insights from data in the Uber marketplace requires analyzing data across an entire city. Grid systems, which partition geographic areas into identifiable cells and facilitate the exploration of data at a fine granularity, are critical to this effort. When Uber designed a grid system to bucket geospatial events, we looked to the hexagon, a shape that enabled us to minimize quantization error and easily approximate radiuses.
In 2018, Uber open sourced H3, our hexagonal grid system for visualizing geospatial data, enabling the broad adoption of this powerful and easy-to-use software. In the videos below, members of our engineering and data science teams discuss how to most effectively use H3 for geospatial indexing:
Engineering Sub-City Geos for a Hyper-Local Marketplace with Uber
Grid systems enable data analysis to abstract the messiness away from large-scale data sets for easy mathematical computations. In this presentation, Nick Rabinowitz discusses common grid operations and outlines how H3 leverages hexagons to easily aggregate and explore geospatial data from the Uber platform.
Engineering an H3-based Geospatial Data Platform at Uber
Uber leverages clusters of hexagonal cells to engineer our H3-based geospatial platform. Ankit Mehta explains why we chose hexagonal clusters over other cluster shapes and highlights how our Marketplace Intelligence team is building an efficient and scalable H3-based geospatial data platform to facilitate improved user experiences.
Building City Cores with H3
Previously manually drawn, city cores are the geospatial areas where our ridesharing business is densest. This manual process of defining city cores was time-intensive and required local knowledge about the city. Marie-Camille Achard and Camilla Nawaz discuss how H3 enables us to easily understand at scale where city cores are located.
Interested in learning more about H3? Check out the project’s official page and try it out for yourself!
If building visualization tools to explore large-scale geospatial data sets interests you, consider applying for a role on our team!
Molly Vorwerck is the Eng Blog Lead and a senior program manager on Uber's Tech Brand Team, responsible for overseeing the company's technical narratives and content production. In a previous life, Molly worked in journalism and public relations. In her spare time, she enjoys scouring record stores for Elvis Presley records, reading and writing fiction, and watching The Great British Baking Show.
How Uber Serves Over 40 Million Reads Per Second from Online Storage Using an Integrated Cache
February 15 / Global
DataCentral: Uber’s Big Data Observability and Chargeback Platform
February 1 / Global
Improving Uber Eats Home Feed Recommendations via Debiased Relevance Predictions
Using Uber: your guide to the Pace RAP Program
Supercharge the Way You Render Large Lists in React
Network IDS Ruleset Management with Aristotle v2