Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (2024)

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (1)

Deepgram

Building foundational AI for speech transcription and understanding.

$150K - $210K

Location

Remote / Remote (US)

Job Type

Full-time

Experience

6+ years

Apply to

Deepgram

and hundreds of other fast-growing YC startups with a singleprofile.

Apply to role ›

About the role

Deepgram is a foundational AI company on a mission to transform human-machine interaction using natural language. We give any developer access to the fastest, most powerful voice AI platform including access to models for speech-to-text, text-to-speech, and spoken language understanding with just an API call. From transcription to sentiment analysis to voice synthesis, Deepgram is the preferred partner for builders of voice AI applications.

The Opportunity

Despite the proliferation of text-based communication, voice remains the preferred medium for humans to interact with machines. Delivering real-world voice AI solutions to our customers' most challenging problems ultimately drives our mission. At Deepgram, you will have the unique opportunity to innovate, experiment, and build -- significantly shaping our products and AI capabilities. We value tenacious problem-solving and the ability to iterate, learn and adapt. Domain-specific expertise in speech or language AI is not required. As such, you're encouraged to deepen your skills on-the-job, broadening your knowledge and expertise through constant iteration and invention. Our start-up environment offers a stunning growth trajectory due to a level of ownership and an on-ground connection with end-customers that larger research labs simply cannot provide. Embark on a journey to redefine voice technology with us at Deepgram.

The Role

Deepgram is currently looking for strong Research Scientists who have demonstrated experience in solving hard problems using deep learning. At Deepgram, you will apply your skills to uncover breakthroughs that define the future of voice-enabled applications and experiences. Your work will revolve around harnessing vast audio and text datasets to train foundation models that go beyond transcribing speech and comprehending text -- the models you’ll be building will unlock nuanced meanings in complex conversation, adapt robustly to diverse speech patterns, and generate empathic responses with human-like, contextualized speech. You will collaborate with product & engineering to help deploy these models in the most scalable voice API on the planet. We look forward to you bringing your whole self to work, sharing learnings from your latest experiments, and collaborating with us to advance the state of AI and voice technology.

What You’ll Do

  • Design and carry out experimental programs to build new speech and language AI foundation models across modalities and tasks, that solve critical problems for our customers.

  • Drive large-scale training jobs successfully on massive distributed computing infrastructure.

  • Optimize model architectures to make them as fast and memory-efficient as possible; deploy new models into production for use at massive scale.

  • Document and present results and complex technical concepts clearly for internal and external audiences

  • Stay up to date with the latest advances in deep learning with a particular eye towards their implications and applications within our products.

You’ll Love This Role If You

  • Are passionate about AI and interested in leveraging data to solve hard problems

  • Enjoy building from the ground up and love to create new systems from scratch

  • Are data-driven and prefer to solve problems using iterative experimentation

It’s Important To Us That You Have

  • PhD in Physics, Electrical Engineering, Computer Science or another related field

  • Prior experience in designing and conducting experimental programs aimed at understanding complex phenomena, with the ability to rapidly iterate and change course as needed.

  • Proven experience building models from a blank page and owning the entire deep learning stack including data curation, characterization and cleaning, architecture design and model building, distributed large-scale training, and model optimization for inference.

  • Strong communication skills and the ability to translate complex concepts in simple terms, depending on the target audience

  • Strong software engineering skills with particular emphasis on developing clean, modular code in Python and working with Pytorch.

It Would Be Great if You Had

  • Prior industry experience in building deep learning models to solve complex problems, with a solid understanding toward the applications and implications of different neural network types, architectures, and loss mechanisms.

  • Deep understanding and experience working with state-of-the-art network architectures including transformers.

  • Understanding of different parallelism paradigms for efficient distributed training.

Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $85 million in total funding after closing our Series B funding round last year. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you!

Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.

We are happy to provide accommodations for applicants who need them.

About Deepgram

Deepgram is a foundational AI company building state of the art, production-ready AI models that streamline human-computer interaction and amplify productivity. By enabling seamless communication between humans and machines, we believe we can harness the untapped potential of AI and help pave the way for a more productive future. We passionately believe in the potential of audio data to transform lives, businesses, and interactions across the globe - which is why Deepgram is trusted by well-respected companies like NASA, Twilio, Auth0, and Spotify to push the boundaries of what is possible in voice technology!

Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $85 million in total funding after closing our Series B funding round last year. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you!

Our tech advantage is end-to-end deep learning, but our strength lies in our diversity of people, ideas, and experiences that allow our company to create amazing STT products for people who are true innovators in the field. We believe every voice should be heard—and understood—from our transcriptions to our customers to our employees. Come join our revolution to unlock the power of voice technology for everyone. We want to hear what you’ve got to say. deepgram.com/careers.

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (2)

Deepgram

Founded:2015

Team Size:115

Location:San Francisco

Founders

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (3)

Scott Stephenson

CEO

Similar Jobs

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (4)

Terminal

Chief of Staff

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (5)

Moonvalley

Head of Growth

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (6)

Landeed

Operations Lead

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (7)

Lago

Solution Engineer - Post-Sales

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (8)

Candid Health

Revenue Cycle Customer Success

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (9)

Emerge Career

Program Manager

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (10)

Delfino AI

Healthcare Operations Associate - Part Time

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (11)

Tavus

Strategy and Operations Leader

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (12)

Rescale

Principal Project Manager - Japan

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (13)

Culdesac

Community Manager

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (14)

Hotplate

Frontend Lead

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (15)

Cyble

CTO- Cyber Security

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (16)

Pilot AI

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (17)

Scale AI

Account Executive: Global Accounts

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (18)

Curri

Director of Supply

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (19)

Enerjazz

Chief Operations

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (20)

Reality Defender

Senior Account Executive, Federal

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (21)

Quantstamp

Applied Cryptographer (Full Time, Anywhere)

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (22)

Tovala

Head of Data Analytics

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (23)

Imbue (formerly Generally Intelligent)

Tech Lead

Research Scientist - Voice AI Foundations at Deepgram | Y Combinator (2024)

References

Top Articles
Latest Posts
Article information

Author: Errol Quitzon

Last Updated:

Views: 5623

Rating: 4.9 / 5 (79 voted)

Reviews: 86% of readers found this page helpful

Author information

Name: Errol Quitzon

Birthday: 1993-04-02

Address: 70604 Haley Lane, Port Weldonside, TN 99233-0942

Phone: +9665282866296

Job: Product Retail Agent

Hobby: Computer programming, Horseback riding, Hooping, Dance, Ice skating, Backpacking, Rafting

Introduction: My name is Errol Quitzon, I am a fair, cute, fancy, clean, attractive, sparkling, kind person who loves writing and wants to share my knowledge and understanding with you.