For the Inventor, By the Inventor

The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 11768961 B1

Date of Patent:

Sep. 26, 2023

Filed:

Oct. 28, 2021

System and method for speaker role determination and scrubbing identifying information

Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Yun-Cheng Ju, Bellevue, WA (US);

Ashwarya Poddar, Seattle, WA (US);

Royi Ronen, Tel Aviv, IL;

Oron Nir, Hertzeliya, IL;

Ami Turgman, Tel Aviv, IL;

Andreas Stolcke, Berkeley, CA (US);

Edan Hauon, Givatayim, IL;

Assignee:

MICROSOFT TECHNOLOGY LICENSING, LLC, Redmond, WA (US);

Attorney:

Weaver IP L.L.C.

Primary Examiner:

Kambiz Zand

Assistant Examiner:

Aubrey H Wyszynski

Int. Cl.

CPC ...

G06F 21/62 (2013.01); G06F 40/295 (2020.01); G10L 15/26 (2006.01); G10L 17/00 (2013.01); G10L 15/22 (2006.01);

U.S. Cl.

CPC ...

G06F 21/6254 (2013.01); G06F 40/295 (2020.01); G10L 15/26 (2013.01); G10L 17/00 (2013.01); G10L 2015/228 (2013.01);

Abstract

Methods for speaker role determination and scrubbing identifying information are performed by systems and devices. In speaker role determination, data from an audio or text file is divided into respective portions related to speaking parties. Characteristics classifying the portions of the data for speaking party roles are identified in the portions to generate data sets from the portions corresponding to the speaking party roles and to assign speaking party roles for the data sets. For scrubbing identifying information in data, audio data for speaking parties is processed using speech recognition to generate a text-based representation. Text associated with identifying information is determined based on a set of key words/phrases, and a portion of the text-based representation that includes a part of the text is identified. A segment of audio data that corresponds to the identified portion is replaced with different audio data, and the portion is replaced with different text.

Find Patent Forward Citations

Loading…