The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

G10L 15/19 (2013.01); G10L 15/22 (2006.01); G10L 25/60 (2013.01); G10L 15/18 (2013.01); G10L 15/30 (2013.01); G10L 15/04 (2013.01); G10L 15/26 (2006.01); G10L 15/20 (2006.01); G10L 15/01 (2013.01); G10L 15/02 (2006.01); G10L 15/06 (2013.01); G06F 3/0484 (2013.01);

U.S. Cl.

CPC ...

G10L 15/19 (2013.01); G10L 15/01 (2013.01); G10L 15/02 (2013.01); G10L 15/04 (2013.01); G10L 15/063 (2013.01); G10L 15/1815 (2013.01); G10L 15/20 (2013.01); G10L 15/22 (2013.01); G10L 15/265 (2013.01); G10L 15/30 (2013.01); G10L 25/60 (2013.01); G06F 3/0484 (2013.01);

Abstract

Maintaining adequate audio quality is very important for creating fast and accurate transcriptions, especially in a hybrid transcription setting, in which human transcribers review transcriptions generated by automatic speech recognition (ASR) systems. Some embodiments described herein involve detecting low-quality audio intended for transcription. In one embodiment, a server receives an audio recording that includes speech. The server generates feature values based on a segment of the audio recording and utilizes a model to calculate, based on the feature values, a certain value indicative of expected hybrid transcription quality of the segment. The model is generated based on training data that includes feature values generated based on previously recorded segments of audio, and values of transcription-quality metrics generated based on transcriptions of the previously recorded segments, which were generated at least in part by human transcribers. Optionally, an alert is provided responsive to the certain value being below a threshold.

Find Patent Forward Citations