Skip to main content
Uber CareersUber Careers

Sr. Software Engineer - Compute Platform Efficiency & Fleet Management

Engineering
in Palo Alto, California

About the Role

We are looking for a strong developer to drive and shape the technical roadmap & delivery of our Compute platform, powering all of Uber. The person will be a member of the Efficiency and Fleet Management Compute Core Platform team which builds technical solutions to enable diverse jobs of different characteristics to run efficiently and reliably.

What You'll Do

  • Enable stateless services, batch jobs & stateful jobs to all run seamlessly together on the same machine, with SLA guarantees.
  • Build for the next level of scale. Scale the Compute products and make infrastructure cost cheaper.
  • Turn up large scale Infrastructure and remediate problems with automated Infrastructure-as-Code and monitoring/alerting.
  • Perform Linux/Container level resource management as well as Infrastructure capacity management for efficiency through kernel/HW techniques, understanding of microservices, profiling and data analysis of services behavior in production.
  • Be a role model on acting with good judgement and responsibility. Engineers across the organization will look to you on how best to plan and execute on projects.
  • Up the bar on sustainable engineering by improving best practices, producing best in class of code, documentation, testing and monitoring.
  • Be a humble mentor and trusted advisor for both our talented team members and passionate leaders alike. Deal with a difference in opinions in a mature and fair way, helping teams to commit and move forward.

What You'll Need

  • BS/MS/PhD in Computer Science or a related field
  • You have at least 5 years of full time engineering experience.
  • You have knowledge of a backend stack (in C/C++, Python, Go or similar)
  • Distributed Systems, Microservices
  • Linux, Systems, Performance/Profiling/Tracing, Kernel, Language runtimes
  • Infrastructure-as-code/Configuration management, Networking
  • Containers, Orchestration - Mesos/Kubernetes, Concurrency, Process Scheduling

About the Team

The Compute team orchestrates & manages jobs across all the Compute clusters across all geographies (on-prem and public clouds). The Compute platform hosts all kinds of jobs & services - Stateless services, batch jobs (Spark, , etc) and Stateful applications.

The Compute team builds scalable solutions, enable user jobs be deployed anywhere in the world with the touch of a button. We build large scale cluster management solutions and job scheduling software. We build abstractions so that Uber developers do not have to care about infrastructure and infrastructure just works for every application type.

The Efficiency and Fleet Management Compute team has a charter to provide Compute Platform-As-A-Service as well as Computing Efficiency and Fleet lifecycle for Uber Infrastructure as a core platform

We build solutions for Compute Platform to operate the Platform-as-a-service serving thousands of Microservices service owners. In addition, we build technology (OS, Containers, Distributed Systems) to achieve Computation efficiency to achieve a lower cost of capacity for Uber computation infrastructure. Finally, we build and operate technology to manage capacity, metrics, pooling and configuration for the entire Infrastructure fleet.

The Compute Team actively works with Open-Source community and contribute back (Kraken, Makisu). We are at the cutting edge of technology and the Compute team regularly presents at industry conferences.

At Uber, we ignite opportunity by setting the world in motion. We take on big problems to help drivers, riders, delivery partners, and eaters get moving in more than 600 cities around the world.

We welcome people from all backgrounds who seek the opportunity to help build a future where everyone and everything can move independently. If you have the curiosity, passion, and collaborative spirit, work with us, and let's move the world forward, together.