Back to feed

Machine Learning Researcher (Speech/Audio)

Remote Full-time Live

Brahma is a pioneering enterprise AI company developing Astras, AI-native products built to help enterprises and creators innovate at scale. Brahma enables teams to break creative bottlenecks, accelerate storytelling, and deliver standout content with speed and efficiency. Part of the DNEG Group, Brahma brings together Hollywood’s leading creative technologists, innovators in AI and Generative AI, and thought leaders in the ethical creation of AI content. We are looking for a Machine Learning Researcher for Audio to join our team and help develop next-generation voice synthesis models. You'll research and build deep learning systems that can generate expressive, natural-sounding speech from text or audio prompts, and collaborate with cross-functional teams to integrate your work into production-ready pipelines.

Key Responsibilities

Research and develop state-of-the-art voice synthesis models (e.g., TTS, voice cloning, speech-to-speech). Build and fine-tune models using frameworks like PyTorch and HuggingFace. Design training pipelines and datasets for scalable voice model training. Explore techniques for emotional expressiveness, multilingual synthesis, and speaker adaptation. Work closely with product and creative teams to ensure models meet quality and production constraints. Stay on top of academic and industrial trends in speech synthesis and related fields. Must Haves Strong background in machine learning and deep learning, with focus on speech/audio. Hands-on experience with TTS, voice cloning, or related voice synthesis tasks. Proficiency with Python and PyTorch; experience with libraries like torchaudio, ESPnet, or similar. Experience training models at scale and working with large audio datasets. Familiarity with vocoders and transformer-based architectures. Strong problem-solving skills, ability to work autonomously in a remote-first environment.

Nice to Have

PhD degree in Computer Science/ Machine Learning and publications in top venues. Contributions to open-source speech research or participation in relevant benchmarks. Familiarity with adjacent areas like lip-syncing, audio-driven animation, or expressive speech control. Experience with voice datasets or proprietary pipelines. Apply To This Job

On the same wavelength

Full Stack Engineer ID67833

Remote Full-time

Lead Full Stack Engineer ID67830

Remote Full-time

Attorney II

Remote Full-time

Sr. Developer, Application Systems

Remote Full-time

Manager, Engineering I

Remote Full-time

OPS BEHAVIORAL ANALYST - 67980035

Remote Full-time

Lead Product Manager - Services Experience (Remote - United States)

Remote Full-time

Senior Legal Engineer

Remote Full-time

Director, Continuous Improvement

Remote Full-time

Commercial Underwriter III

Remote Full-time

Experienced Customer Service Associate / Cashier – Remote Opportunity with arenaflex

Remote Full-time

Experienced Customer Support Representative – Live Chat, Email, and Phone Support

Remote Full-time

WordPress Developer Needed – High-Converting Lead Gen Site, Custom Form, API Routing, SEO Templates

Remote Full-time

Sales Development Representative

Remote Full-time

Remote Customer Service Representative – Cardholder Support & Issue Resolution for arenaflex

Remote Full-time

Remote Customer Service Representative – arenaflex Home‑Based Support Specialist (Full‑Time, Flexible Hours)

Remote Full-time

Staff Level Environmental Planner- Natural Resources Economics

Remote Full-time

Experienced Virtual or In-Office Data Entry Clerk – Join arenaflex's Dynamic Team in New York City, NY

Remote Full-time

Remote Data Entry Specialist – Work From Home | Flexible Full-Time & Part-Time Positions | Competitive Pay & Career Growth Opportunities

Remote Full-time

Senior Project Manager (Workday HCM Transformation)

Remote Full-time