An Overview of Modern Speech Recognition

NLP IL F2F Meetup at Intuit

A recap of the talk on modern speech recognition, which covered the key principles of automatic speech recognition, the impact of self-supervised learning on the field, and the current progress, research, and challenges in speech recognition technology.
meetup
nlp
Author

Oren Bochman

Published

Tuesday, November 1, 2016

Modified

Monday, May 18, 2026

Keywords

NLP, Intuit, Meetup, Long-Range Reasoning, Efficient Long-Text Understanding, Speech Recognition, SCROLLS, SLED

Session Video

An Overview of Modern Speech Recognition

Abstract

Automatic speech recognition has been impacted by advances in related fields like image processing and natural language processing in recent years. One notable achievement in these areas has been the use of self-supervised learning to improve performance in computer vision and NLP tasks. This led to the development of the first self-supervised language model for speech representations, which has demonstrated impressive results in various NLP tasks. In this talk, we will review the key principles of automatic speech recognition and discuss the current progress, research, and challenges in the field

Speaker

  • Gal Hever
    • Algorithm Developer, Vision Map
    • MSc in Data Science, with over a decade of accumulated expertise in Machine Learning & Data Analytics from 8200, academy, and industry. Deploying algorithms to production by applying data-driven Machine Learning & AI solutions end to end, starting from research to development and testing.

Slides

Overview

Overview

Conversational AI

Conversational AI

ASR

ASR

ASR input challenges

ASR input challenges

Signal & Noise

Signal & Noise

Ideal System

Ideal System

ASR Task

ASR Task

slide009

slide009

slide010

slide010

slide011

slide011

WER Metric

WER Metric

ASR History

ASR History

ASR Time Line

ASR Time Line

Augmentations

Augmentations

WER we are 21

WER we are 21

WER we are 2

WER we are 2

ASR challenges

ASR challenges

diversity challenge

diversity challenge

language is dynamic

language is dynamic

what’s next

what’s next

covid understanding challenges

covid understanding challenges

Non verbal communication 1

Non verbal communication 1

Non verbal communication 2

Non verbal communication 2

DataNights Cohort

DataNights Cohort

QR for ASR Course

QR for ASR Course

Questions

Questions
  • I’ve read a couple of books on the subject, but this shows more up to date results.

  • Show me the papers?

  • The Data Nights course should be worth taking

Citation

BibTeX citation:
@online{bochman2016,
  author = {Bochman, Oren},
  title = {An {Overview} of {Modern} {Speech} {Recognition}},
  date = {2016-11-01},
  url = {https://orenbochman.github.io/posts/2023/01-11-nlp-il-meetup-intuit/talk3.html},
  langid = {en}
}
For attribution, please cite this work as:
Bochman, Oren. 2016. “An Overview of Modern Speech Recognition.” November 1. https://orenbochman.github.io/posts/2023/01-11-nlp-il-meetup-intuit/talk3.html.