The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jun. 27, 2023
Filed:
Jun. 15, 2020
Applicant:
Tencent America Llc, Palo Alto, CA (US);
Inventors:
Shi-Xiong Zhang, Redmond, WA (US);
Yong Xu, Bellevue, WA (US);
Meng Yu, Bellevue, WA (US);
Dong Yu, Bothell, WA (US);
Assignee:
TENCENT AMERICA LLC, Palo Alto, CA (US);
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 40/16 (2022.01); G10L 21/0272 (2013.01); G10L 17/00 (2013.01); G06T 11/60 (2006.01); G06T 7/20 (2017.01); G06N 3/02 (2006.01); G06T 7/00 (2017.01); G06V 20/40 (2022.01);
U.S. Cl.
CPC ...
G10L 21/0272 (2013.01); G06N 3/02 (2013.01); G06T 7/0012 (2013.01); G06T 7/20 (2013.01); G06T 11/60 (2013.01); G06V 20/46 (2022.01); G06V 40/171 (2022.01); G10L 17/00 (2013.01); G06T 2207/10016 (2013.01); G06T 2207/20084 (2013.01); G06T 2207/30201 (2013.01); G06T 2210/22 (2013.01);
Abstract
A method, computer program, and computer system for separating a target voice from among a plurality of speakers is provided. Video data associated with the plurality of speakers and audio data associated with each of the one or more speakers are received. Video feature data is extracted from the received video data. The target voice is identified from among the plurality of speakers based on the received audio data and the extracted video feature data.