Site Reliability Engineering, Engineering in Boulder, CO
At Uber, we ignite opportunity by setting the world in motion. We take on big problems to help drivers, riders, delivery partners, and eaters get moving in more than 600 cities around the world.
We welcome people from all backgrounds who seek the opportunity to help build a future where everyone and everything can move independently. If you have the curiosity, passion, and collaborative spirit, work with us, and let’s move the world forward, together.
About the Role:
You will have an opportunity to make an immediate impact that improves the quality of our processing platform and Data Center Infrastructure. Key areas of impact: config management, auto-remediation, insights and monitoring (building tools, systems, databases, dashboards), systems architecture and design (for availability, performance), tuning for performance (software and config). This role involves a wide variety of technologies and we do not have siloed responsibilities set in stone. We are looking for talented individuals to push us forward, think outside the box, and help us innovate. We value the input of all our teammates, and it is important that you can contribute right away both with ideas and hands on engineering.
What You’ll Do:
Partner with fellow engineers to architect and build mission critical distributed systems that can stand the test of scale and availability, while limiting operational overhead
Design, build & support systems to detect, alert and remediate or escalate on platform, service and DC (hardware and perf) events
Design and build systems to provide improved insight into the operation of the datacenter: perf/event data reporting, visualization, capacity planning, DC infra, & config management tools
What You'll need:
Coding skills.Good programming skills in one of C++/Java, Python or Go, or Ruby, and an ability to pick up new ones.
Systems architecture:You have designed scalable, performant, systems
Configuration management.You should have real world experience with tools like Puppet, Chef, and/or Ansible.
Operating Systems.You should have strong experience in the Linux environment and a solid understanding of its fundamentals, such as DHCP, PXE, DNS, various imaging solutions, packaging, kernel tuning, troubleshooting, etc.
Team player.We have internal customers whom we want to serve to the best of our ability. You should have customer service skills and be able to develop solutions that span multiple teams.
Bonus points if:
You’re a Puppet expert
You have large scale infrastructure tool building experience (monitoring, auto-remediation, config mgmt)
About the team:
Uber is looking for top-notch Engineers to automate and build compute infrastructure at scale. We actively challenge existing trends and are always seeking the best solution to a problem. Not only are we solving our own problems, but because of our scale, we are solving problems that other companies have not yet had.
At Uber we don’t just accept difference—we celebrate it, we support it, and we thrive on it for the benefit of our employees, our products and our community. Uber is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status.