Senior Data Scientist

Posted 3 Days Ago
Remote
150K-220K Annually
3-5 Years Experience
Artificial Intelligence • Natural Language Processing • Software
Unlocking the power of voice data to fuel the world’s big ideas.
The Role
As a Senior Data Scientist at Deepgram, you will build data pipelines, develop audio characterizations, collaborate on automated data annotation systems, and create benchmarking methodologies to optimize voice AI models.
Summary Generated by Built In

Company Overview

Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI platform including access to models for speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.

The Opportunity

At Deepgram, we believe that data is the key to unlock the future of voice-enabled experiences. But building with audio data is hard -- audio poses incredibly rich scientific, engineering, and infrastructure challenges that are orders of magnitude harder than working with text. As a Data Scientist at Deepgram, you will tackle conversational audio at scale, establishing automated data streams that will power the next generation of Voice AI foundation models. The models we build will go beyond basic transcription and comprehension to capture nuanced meanings in complex conversations, adapt robustly to diverse speech patterns, and generate empathic responses with human-like, contextualized speech. Domain-specific expertise in speech or language AI is not required. Rather we’re looking for seasoned scientists who have a track record of solving hard data problems while exploring research frontiers. Our start-up environment offers a stunning growth trajectory for adventure-seeking individuals, providing a level of project ownership and on-ground connection with end-customers that larger research labs simply cannot provide. 

What You’ll Do

  • Build high performance data acquisition, preparation and synthesis pipelines and drive them to generate data for training foundational voice models across modalities and tasks

  • Develop advanced characterizations of complex conversational audio utilizing a diverse toolkit of signals processing techniques and deep learning methods

  • Collaborate with DataOps and Engineering to create automated systems which scale the ability of human annotators to label high value data and provide feedback on model outputs

  • Build advanced benchmarking methodologies for evaluating interactive, conversational agent systems

It’s Important To Us That You Have

  • Experience building data processing pipelines from a blank page and owning the entire data stack including acquisition, characterization, cleaning, serving and transformation

  • Experience applying statistical methods or deep learning models to understand complex data

  • Ability to design and carry out research programs independently and with minimal oversight

  • Strong software engineering skills with particular emphasis on developing clean, modular code in Python and working with Pytorch

  • Strong communication skills and the ability to translate complex concepts in simple terms, depending on the target audience

Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $85 million in total funding after closing our Series B funding round last year. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you!

Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.

We are happy to provide accommodations for applicants who need them.

Compensation Range: $150K - $220K

Top Skills

Python
PyTorch

What the Team is Saying

Jeff "Susan" Ward
Randy Barlow
Ingrid Elise Dorai-Rekaa
Mara Lubell
Primitivo Gonzalez
The Company
109 Employees
Hybrid Workplace
Year Founded: 2015

What We Do

Legacy speech recognition tech is slow, inaccurate, and expensive. It’s time to stop settling for out-of-the-box solutions that don’t meet enterprise needs. Deepgram is the only true end-to-end Deep Learning ASR offering real-time transcription, built to scale. Use it alone or on top of your existing tech and see results in weeks, not months or longer. When speech recognition that’s “good enough for everyone” isn’t good enough for you, try Deepgram.

Deepgram is an NVIDIA partner and a Y Combinator company.

Why Work With Us

Our culture, like our product, is constantly learning and evolving, but the heart of our team is enduring. We are a self-motivated, positive, passionate, and competitive group of people. We have the best technology on the market and are determined to help customers leverage it.

Gallery

Gallery
Gallery
Gallery

Deepgram Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

We currently have a hybrid business model with a nationally distributed workforce and one physical office in Ann Arbor, MI.

Typical time on-site: Not Specified
US

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account