Your responsibilities:

This position is concerned with research and development in advanced speech synthesis. The work will have a particular focus on the development of structured acoustic models which take account of factors such as accent and speaking style, and on the development of machine learning techniques for vocoding. You will have a PhD in speech processing, computer science, cognitive science, linguistics, engineering, mathematics, or a related discipline. You will have the necessary programming ability to conduct research in this area, a background in statistical modelling using Hidden Markov Models, DNN, RNN, speech signal processing, and research experience in speech synthesis.

A background in one or more of the following areas is also desirable: statistical parametric text-to-speech synthesis using HMMs and HSMMs; glottal source modelling; speech signal modelling; speaker adaptation using the MLLR or MAP family of techniques; familiarity with software tools including DNN, Deep Learning, RNN, HTK, HTS, Festival; and familiarity with modern machine learning.

You will:

Develop and extend speech synthesis technologies in Oben’s proprietary speech synthesis system, in view of the realisation of prosody and voice quality modifications;
Develop and apply algorithms to annotate prosody and voice quality in expressive speech synthesis corpora
Carry out a listener evaluation study of expressive synthetic speech.

You must have:

PhD (Prefered), M.Sc. in Computer Science or Electrical Engineering
High proficiency in C++, Python, Java, Matlab
Experience with data-driven statistical or machine learning methods.
Enjoys a highly collaborative environment with minimal supervision.
Experience with speech synthesis

Great to have:

Familiarity with linguistic phonetics.
Knowledge of basic digital signal processing techniques for audio.
Experience with software engineering best practices including unit testing, continuous integration, and source control.
Proficiency in Java/Android, Objective C/iOS, javascript, C#/Unity3D.

Application Requirements

Please send the following to careers@oben.com

Detailed resume and/or LinkedIn profile
Links to any research / papers you have been an instrumental part of and are proud of
Name of instructor / adviser, if any along with link to their profile
Cover Letter identifying your five favorite apps on your phone

Interview process:

STAGE 1: Phone Interview
STAGE 2: In-person Interview at Idealab (we cover travel expenses for the day)
STAGE 3: We require a sample project submission and a candidate proposal submission(To know more about what an ObEN candidate proposal is, click here)
STAGE 4: Spend a day at our office and participate in all team activities.
STAGE 5: Offer Letter

Not ready to apply for this job? Sign-up to receive ObEN job alerts.


ObEN's proprietary artificial intelligence technology quickly combines a person's 2D image and voice to create a personal 3D avatar. Transport your personal avatar into virtual reality and augmented reality environments and enjoy deeper, social, more memorable experiences. Founded in 2014, ObEN is an HTC VIVE X portfolio company and is located in Pasadena, California at leading technology incubator Idealab.