Engineering Manager II - Observability
About the Team
Observability at Uber has evolved far beyond traditional monitoring. We build a centralized, reliable, and intelligent platform spanning metrics, logging, tracing, and on-call experiences - empowering engineers to operate services confidently at massive scale.
Our team owns end-user-facing observability applications used by 4,000+ engineers globally, enabling them to detect, understand, and resolve reliability issues before they impact customers.
We are building a real-time platform for customer experience observability and analytics at scale. This platform enables engineers to:
- Detect and respond to customer experience degradations in real time
- Ensure safe code deployments and fast feature rollouts
- Leverage actionable insights to continuously improve service quality
Uber runs 5,000+ microservices with hundreds of daily deployments, making observability a critical foundation for reliability across the company.
We are investing in the next generation of observability - bringing automation, intelligent insights, and deep integration into the development lifecycle to reduce operational overhead and improve system reliability.
The Role
We are looking for an Engineering Manager II to lead a team building Uber’s next-generation observability experience.
In this role, you will combine technical leadership, execution, and people management to deliver scalable systems that improve reliability and developer productivity. You will own a key area of the observability platform, driving roadmap, architecture, and delivery in a highly cross-functional environment.
What You’ll Do
Technical & Product Leadership
- Lead the design and delivery of systems powering customer experience observability, including monitoring, alerting, and incident response.
- Build platforms that enable engineers to detect issues early, respond effectively, and improve service quality over time.
- Drive architecture and technical direction for large-scale distributed systems.
Strategy & Execution
- Define and execute on a roadmap, aligning team priorities with broader organizational goals.
- Translate complex reliability and product needs into clear technical plans and deliverables.
- Drive execution through strong prioritization, delegation, and cross-team alignment.
Incident Detection & Reliability
- Build and improve systems that reduce time-to-detection (TTD) and time-to-resolution (TTR).
- Enable scalable alerting and incident workflows, including signal correlation, noise reduction, and actionable alerting.
- Improve the overall effectiveness of on-call by reducing manual effort and improving signal quality.
Rollout Safety & Monitoring
- Develop systems that support safe deployments and feature rollouts, using real-time monitoring and guardrails.
- Enable teams to detect regressions quickly and make data-informed rollout decisions.
Data & Platform Foundations
- Drive development of reliable, scalable data pipelines and platforms that power observability and analytics use cases.
- Establish best practices for instrumentation, metrics, and data consistency across services.
People & Team Development
- Build, mentor, and grow a high-performing team of engineers.
- Foster a culture of ownership, technical excellence, and strong engineering fundamentals.
- Empower engineers and tech leads to take end-to-end ownership of projects.
Cross-Functional Leadership
- Collaborate closely with Engineering, Product, TPM, and partner teams to deliver end-to-end solutions.
- Communicate priorities, trade-offs, and outcomes clearly to stakeholders and leadership.
Basic Qualifications
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
- 10+ years of software engineering experience, including 4+ years managing teams.
- Experience building and operating large-scale distributed systems in production.
- Track record of delivering impactful technical solutions at scale.
Preferred Qualifications
- Experience with observability, reliability engineering, or developer platforms.
- Experience working with real-time systems, monitoring, or data platforms.
- Strong ability to drive execution through delegation and cross-team alignment.
- Experience defining and executing technical strategy across teams.
- Excellent communication and stakeholder management skills.
Why Join Us
- Build systems that directly impact reliability and customer experience at Uber scale
- Solve complex distributed systems challenges in a high-impact domain
- Lead a team shaping the future of observability and developer experience
Uber's mission is to reimagine the way the world moves for the better. Here, bold ideas create real-world impact, challenges drive growth, and speed fuelds progress. What moves us, moves the world - let’s move it forward, together.
Offices continue to be central to collaboration and Uber's cultural identity. Unless formally approved to work fully remotely, Uber expects employees to spend at least half of their work time in their assigned office. For certain roles, such as those based at green-light hubs, employees are expected to be in-office for 100% of their time. Please speak with your recruiter to better understand in-office expectations for this role.
*Accommodations may be available based on religious and/or medical conditions, or as required by applicable law. To request an accommodation, please reach out to accommodations@uber.com.
See our Candidate Privacy Statement
Uber is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, Veteran Status, or any other characteristic protected by law.