The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 14, 2021

Filed:

Jan. 26, 2018
Applicant:

Yutou Technology (Hangzhou) Co., Ltd., Hangzhou, CN;

Inventor:

Lichun Fan, Hangzhou, CN;

Assignee:
Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/06 (2013.01); G10L 15/02 (2006.01); G10L 15/14 (2006.01); G10L 15/16 (2006.01); G10L 25/21 (2013.01); G10L 25/24 (2013.01);
U.S. Cl.
CPC ...
G10L 15/063 (2013.01); G10L 15/02 (2013.01); G10L 15/142 (2013.01); G10L 15/16 (2013.01); G10L 25/21 (2013.01); G10L 25/24 (2013.01);
Abstract

The invention discloses a training method and a speech recognition method for a mixed frequency acoustic recognition model, which belongs to the technical field of speech recognition. The method comprises: obtaining a first-type speech feature of the first speech signal, and processing the first speech data to obtain corresponding first speech training data (S); obtaining the first-type speech feature of the second speech signal, and processing the second speech data to obtain corresponding second speech training data (S); obtaining a second-type speech feature of the first speech signal according to a power spectrum of the first speech signal, and obtaining the second-type speech feature of the second speech signal according to a power spectrum of the second speech signal (S); performing pre-training according to the first speech signal and the second speech signal, so as to form a preliminary recognition model of the hybrid frequency acoustic recognition model (S); and performing supervised parameter training on the preliminary recognition model according to the first speech training data, the second speech training data and the second-type speech feature, so as to form the hybrid frequency acoustic recognition model (S). The beneficial effects of the above technical solution are: the recognition model has better robustness and generalization.


Find Patent Forward Citations

Loading…