The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 08, 2023

Filed:

Jun. 09, 2021
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Yuan Jin, Shanghai, CN;

Xi Xi Liu, Shanghai, CN;

Li ping Wang, Shanghai, CN;

Fan Xiao Xin, Shanghai, CN;

Zheng Ping Chu, Shanghai, CN;

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/30 (2013.01); G10L 15/02 (2006.01); G06N 3/08 (2023.01); G10L 25/51 (2013.01); G10L 25/30 (2013.01);
U.S. Cl.
CPC ...
G10L 15/02 (2013.01); G06N 3/08 (2013.01); G10L 25/30 (2013.01); G10L 25/51 (2013.01);
Abstract

A computer-implemented method, system and computer program product for providing high quality speech recognition. A first speech-to-text model is selected to perform speech recognition of a customer's spoken words and a second speech-to-text model is selected to perform speech recognition of the agent's spoken words during a call. The combined results of the speech-to-text models used to process the customer's and agent's spoken words are then analyzed to generate a reference speech-to-text result. The customer speech data that was processed by the first speech-to-text model is reprocessed by multiple other speech-to-text models. A similarity analysis is performed on the results of these speech-to-text models with respect to the reference speech-to-text result resulting in similarity scores being assigned to these speech-to-text models. The speech-to-text model with the highest similarity score is then selected as the new speech-to-text model for performing speech recognition of the customer's spoken words during the call.


Find Patent Forward Citations

Loading…