The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 14, 2023

Filed:

May. 29, 2020
Applicant:

Ubtech Robotics Corp Ltd, Shenzhen, CN;

Inventors:

Li Ma, Shenzhen, CN;

Youjun Xiong, Shenzhen, CN;

Assignee:

UBTECH ROBOTICS CORP LTD, Shenzhen, CN;

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 40/242 (2020.01); G06N 20/00 (2019.01); G06F 40/289 (2020.01); G06N 7/00 (2006.01);
U.S. Cl.
CPC ...
G06F 40/242 (2020.01); G06F 40/289 (2020.01); G06N 7/005 (2013.01); G06N 20/00 (2019.01);
Abstract

The present disclosure provides a corpus cleaning method and a corpus entry system. The method includes: obtaining an input utterance; generating a predicted value of an information amount of each word in the input utterance according to the context of the input utterance using a pre-trained general model; and determining redundant words according to the predicted value of the information amount of each word, and determining whether to remove the redundant words from the input utterance. In such a manner, the objectivity and accuracy of corpus cleaning can be improved.


Find Patent Forward Citations

Loading…