The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Feb. 18, 2025
Filed:
Dec. 16, 2022
Zhejiang Gongshang University, Zhejiang, CN;
Xiaoning Jiang, Zhejiang, CN;
Kai Liu, Zhejiang, CN;
Yuhan Zhou, Zhejiang, CN;
Hongmin Xie, Zhejiang, CN;
Yukuan He, Zhejiang, CN;
Weijie Liu, Zhejiang, CN;
Jie Zhang, Zhejiang, CN;
Zhen Liu, Zhejiang, CN;
Zhejiang Gongshang University, Hangzhou, CN;
Abstract
A method and a system for filtering ill corpus are provided. The method includes following steps: acquiring a text corpus to be recognized, and preprocessing the text corpus to be recognized to obtain a basic text corpus; extracting entities in the basic text corpus, and performing matching search on the entities of the basic text corpus according to an ill-text knowledge graph to obtain a first recognition result; detecting and recognizing the basic text corpus according to a corpus recognition model to obtain a second recognition result; and filtering the text corpus to be recognized according to the first or/and the second recognition result, and updating the ill-text knowledge graph according to the second recognition result. With semantic network essence and strong correlation ability of knowledge graph technology, candidate ill entities can be obtained, thus facilitating filtering of obscure ill information in forms of pinyin, homophonic words and split words.