The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 11568761 B1

Date of Patent:

Jan. 31, 2023

Filed:

Sep. 13, 2018

Pronunciation error detection apparatus, pronunciation error detection method and program

Applicant:

Nippon Telegraph and Telephone Corporation, Chiyoda-ku, JP;

Inventors:

Satoshi Kobashikawa, Yokosuka, JP;

Ryo Masumura, Yokosuka, JP;

Hosana Kamiyama, Yokosuka, JP;

Yusuke Ijima, Yokosuka, JP;

Yushi Aono, Yokosuka, JP;

Assignee:

NIPPON TELEGRAPH AND TELEPHONE CORPORATION, Chiyoda-ku, JP;

Attorney:

Oblon, McClelland, Maier & Neustadt, L.L.P.

Primary Examiner:

Abdelali Serrou

Int. Cl.

CPC ...

G10L 15/187 (2013.01); G09B 19/06 (2006.01); G09B 5/04 (2006.01); G10L 15/18 (2013.01); G10L 15/19 (2013.01);

U.S. Cl.

CPC ...

G09B 19/06 (2013.01); G09B 5/04 (2013.01); G10L 15/187 (2013.01); G10L 15/1815 (2013.01); G10L 15/19 (2013.01);

Abstract

The present invention provides a pronunciation error detection apparatus capable of following a text without the need for a correct sentence even when erroneous recognition such as a reading error occurs. The pronunciation error detection apparatus comprises: a speech recognition part that recognizes the speech in speech data based on a speech recognition model for a non-native speaker, and outputs speech recognition results, reliability and time information; a reliability determination part that outputs the speech recognition results with higher reliability than a predetermined threshold and the corresponding time information as the determined speech recognition results and the determined time information; and a pronunciation error detection part that outputs a phoneme as a pronunciation error when reliability for each phoneme in the speech recognition results using the native speaker speech recognition model under a weakly constraining grammar is greater than the reliability of the corresponding phoneme in the speech recognition results using the native speaker acoustic model under a constraining grammar in which the determined speech recognition results are correct for the speech data in a segment specified by the determined time information.

Find Patent Forward Citations