The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 10, 2023

Filed:

Nov. 25, 2019
Applicant:

Ai Speech Co., Ltd., Jiangsu, CN;

Inventors:

Hongbo Song, Suzhou, CN;

Chengya Zhu, Suzhou, CN;

Weisi Shi, Suzhou, CN;

Shuai Fan, Suzhou, CN;

Assignee:

AI SPEECH CO., LTD., Jiangsu, CN;

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/00 (2013.01); G10L 15/22 (2006.01); G10L 15/30 (2013.01); G10L 15/18 (2013.01); G10L 15/183 (2013.01);
U.S. Cl.
CPC ...
G10L 15/22 (2013.01); G10L 15/30 (2013.01); G10L 15/183 (2013.01); G10L 15/1822 (2013.01); G10L 2015/225 (2013.01);
Abstract

An embodiment of the present invention provides a method of man-machine interaction, including: receiving first audio uploaded by a user through a client end, marking a start time and an end time of the first audio, and generating a first recognition result of the first audio using an audio decoder; determining whether the first audio is a short speech based on the start time and end time thereof, and in case of a short speech, generating a second recognition result of the second audio using the audio decoder upon receiving the second audio uploaded by the client end within a preset heartbeat protection time range, sending at least the first recognition result and the second recognition result to a language prediction model; and if it is determined that a combination of the recognition results constitutes a sentence, generating an answering instruction corresponding to the sentence, and sending the answering instruction together with a feedback time mark of the answering instruction to the client end. Unreasonable sentence segmentation in a full-duplex dialogue scenario and redundant replies in the dialogue can thereby be avoided.


Find Patent Forward Citations

Loading…