The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 19, 2016

Filed:

Aug. 20, 2013
Applicant:

Cisco Technology, Inc., San Jose, CA (US);

Inventors:

Aparna Khare, San Jose, CA (US);

Neha Agrawal, San Jose, CA (US);

Sachin S. Kajarekar, Sunnyvale, CA (US);

Matthias Paulik, San Jose, CA (US);

Assignee:

Cisco Technology, Inc., San Jose, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/00 (2013.01); G10L 15/26 (2006.01); G10L 15/06 (2013.01); G10L 15/187 (2013.01); G10L 15/04 (2013.01); G10L 17/00 (2013.01); G10L 15/02 (2006.01);
U.S. Cl.
CPC ...
G10L 15/063 (2013.01); G10L 15/187 (2013.01); G10L 15/04 (2013.01); G10L 15/26 (2013.01); G10L 17/00 (2013.01); G10L 2015/025 (2013.01);
Abstract

An audio stream is segmented into a plurality of time segments using speaker segmentation and recognition (SSR), with each time segment corresponding to the speaker's name, producing an SSR transcript. The audio stream is transcribed into a plurality of word regions using automatic speech recognition (ASR), with each of the word regions having a measure of the confidence in the accuracy of the translation, producing an ASR transcript. Word regions with a relatively low confidence in the accuracy of the translation are identified. The low confidence regions are filtered using named entity recognition (NER) rules to identify low confidence regions that a likely names. The NER rules associate a region that is identified as a likely name with the name of the speaker corresponding to the current, the previous, or the next time segment. All of the likely name regions associated with that speaker's name are selected.


Find Patent Forward Citations

Loading…