The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 18, 2025

Filed:

Dec. 16, 2022
Applicant:

Zhejiang Gongshang University, Zhejiang, CN;

Inventors:

Xiaoning Jiang, Zhejiang, CN;

Kai Liu, Zhejiang, CN;

Yuhan Zhou, Zhejiang, CN;

Hongmin Xie, Zhejiang, CN;

Yukuan He, Zhejiang, CN;

Weijie Liu, Zhejiang, CN;

Jie Zhang, Zhejiang, CN;

Zhen Liu, Zhejiang, CN;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 40/232 (2020.01); G06F 40/295 (2020.01); H04L 51/212 (2022.01);
U.S. Cl.
CPC ...
G06F 40/232 (2020.01); G06F 40/295 (2020.01); H04L 51/212 (2022.05);
Abstract

A method and a system for filtering ill corpus are provided. The method includes following steps: acquiring a text corpus to be recognized, and preprocessing the text corpus to be recognized to obtain a basic text corpus; extracting entities in the basic text corpus, and performing matching search on the entities of the basic text corpus according to an ill-text knowledge graph to obtain a first recognition result; detecting and recognizing the basic text corpus according to a corpus recognition model to obtain a second recognition result; and filtering the text corpus to be recognized according to the first or/and the second recognition result, and updating the ill-text knowledge graph according to the second recognition result. With semantic network essence and strong correlation ability of knowledge graph technology, candidate ill entities can be obtained, thus facilitating filtering of obscure ill information in forms of pinyin, homophonic words and split words.


Find Patent Forward Citations

Loading…