The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 06, 2024

Filed:

Jul. 25, 2019
Applicant:

Nippon Telegraph and Telephone Corporation, Tokyo, JP;

Inventors:

Ryo Masumura, Tokyo, JP;

Takanobu Oba, Tokyo, JP;

Kiyoaki Matsui, Tokyo, JP;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 25/93 (2013.01); G10L 25/78 (2013.01); G10L 15/00 (2013.01); G10L 15/02 (2006.01); G10L 21/0208 (2013.01); G06N 20/20 (2019.01); G06N 3/044 (2023.01); G06N 3/09 (2023.01); G10L 17/00 (2013.01); G10L 25/84 (2013.01);
U.S. Cl.
CPC ...
G10L 25/93 (2013.01); G06N 20/20 (2019.01); G10L 15/00 (2013.01); G10L 15/02 (2013.01); G10L 21/0208 (2013.01); G10L 25/78 (2013.01); G06N 3/044 (2023.01); G06N 3/09 (2023.01); G10L 17/00 (2013.01); G10L 25/84 (2013.01); G10L 2015/025 (2013.01);
Abstract

A voice/non-voice determination device robust with respect to an acoustic signal in a high-noise environment is provided. The voice/non-voice determination device includes an acoustic scene classification unit including a first model which receives input of an acoustic signal and outputs acoustic scene information which is information regarding a scene where the acoustic signal is collected, a speech enhancement unit including a second model which receives input of the acoustic signal and outputs speech enhancement information which is information regarding the acoustic signal after enhancement, and a voice/non-voice determination unit including a third model which receives input of the acoustic signal, the acoustic scene information and the speech enhancement information and outputs a voice/non-voice label which is information regarding a label of either a speech section or a non-speech section.


Find Patent Forward Citations

Loading…