The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 26, 2020

Filed:

Oct. 07, 2019
Applicant:

Verbit Software Ltd., Tel Aviv, IL;

Inventors:

Eric Ariel Shellef, Ramat Gan, IL;

Yaakov Kobi Ben Tsvi, Ramat Hasharon, IL;

Iris Getz, Ramat Hasharon, IL;

Tom Livne, Herzliya, IL;

Roman Himmelreich, Tel Aviv, IL;

Elisha Yehuda Rosensweig, Ra'anana, IL;

Assignee:

Verbit Software Ltd., Tel Aviv, IL;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/19 (2013.01); G10L 15/22 (2006.01); G10L 25/60 (2013.01); G10L 15/18 (2013.01); G10L 15/30 (2013.01); G10L 15/04 (2013.01); G10L 15/26 (2006.01); G10L 15/20 (2006.01); G10L 15/01 (2013.01); G10L 15/02 (2006.01); G10L 15/06 (2013.01); G06F 3/0484 (2013.01);
U.S. Cl.
CPC ...
G10L 15/19 (2013.01); G10L 15/01 (2013.01); G10L 15/02 (2013.01); G10L 15/04 (2013.01); G10L 15/063 (2013.01); G10L 15/1815 (2013.01); G10L 15/20 (2013.01); G10L 15/22 (2013.01); G10L 15/265 (2013.01); G10L 15/30 (2013.01); G10L 25/60 (2013.01); G06F 3/0484 (2013.01);
Abstract

Maintaining adequate audio quality is very important for creating fast and accurate transcriptions, especially in a hybrid transcription setting, in which human transcribers review transcriptions generated by automatic speech recognition (ASR) systems. Some embodiments described herein involve detecting low-quality audio intended for transcription. In one embodiment, a server receives an audio recording that includes speech. The server generates feature values based on a segment of the audio recording and utilizes a model to calculate, based on the feature values, a certain value indicative of expected hybrid transcription quality of the segment. The model is generated based on training data that includes feature values generated based on previously recorded segments of audio, and values of transcription-quality metrics generated based on transcriptions of the previously recorded segments, which were generated at least in part by human transcribers. Optionally, an alert is provided responsive to the certain value being below a threshold.


Find Patent Forward Citations

Loading…