Skip to main content

Tell us your location

Please enter your nearest city name to help us display the correct information for your area

CAI Speech Scientist

Engineering in San Francisco, CA

We’re changing the way people think about transportation. Not that long ago we were just an app to request premium black cars in a few metropolitan areas. Now we’re a part of the logistical fabric of more than 600 cities around the world. Whether it’s a ride, a sandwich, or a package, we use technology to give people what they want, when they want it.

 

For the people who drive with Uber, our app represents a flexible new way to earn money. For cities, we help strengthen local economies, improve access to transportation, and make streets safer.

 

And that’s just what we’re doing today. We’re thinking about the future, too. With teams working on new modalities, self-driving cars and even urban air transportation, we’re in for the long haul. We’re reimagining how people and things move from one place to the next.

About the Role

 

Uber AI Labs is looking for Speech research scientists to create the next generation of Conversational AI algorithms to improve the lives of millions of people worldwide. You will be conducting research in speech and spoken language processing with focus on multi-modal conversational understanding systems, more specifically, building and improving the state-of-the-art in speech recognition (ASR), speech synthesis (TTS), and hot-word detection; collaborating with other AI teams in Uber, and working to deliver cutting-edge research to product teams across Uber.

What You’ll Do

 

  • Develop working solutions to difficult practical problems working with the product teams and other research teams.
  • Deliver state-of-the-art Speech solution to a product team and transfer the knowledge and ability necessary to maintain the solution.
  • Communicate and work closely with connected teams
  • Be able to acquire and process the necessary data from Uber’s data stores

 

What You’ll Need

 

  • Excellent technical skills including:
    • PhD level degree in a highly technical quantitative field from Electrical Engineering, Computer Science, or a similar field.
    • Proven track record of innovation and demonstrated ability to make deep learning models work in practice for speech and spoken language processing.
    • Experience with experimental research, evaluation, data collection, and analysis with scientific thinking.
    • Experience with existing ASR (such as Kaldi) and/or TTS systems, and distant microphone recognition.
  • Strong programming experience:
    • Competency in one key language, preferably Python or C++
    • Experience with deep learning frameworks (TF, PyTorch, etc.)
  • Communication
    • Must be able to communicate with other AI researchers and Uber product teams. This will require the ability to clearly explain how complex models work and what their benefits / drawbacks are.

 

 #AI-Labs-jobs


See our Candidate Privacy Statement

At Uber we don’t just accept difference—we celebrate it, we support it, and we thrive on it for the benefit of our employees, our products and our community. Uber is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status.