The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 30, 2024

Filed:

Jul. 06, 2023
Applicant:

Zhejiang Lab, Zhejiang, CN;

Inventors:

Jingsong Li, Hangzhou, CN;

Lixin Shi, Hangzhou, CN;

Ran Xin, Hangzhou, CN;

Zongfeng Yang, Hangzhou, CN;

Yu Tian, Hangzhou, CN;

Tianshu Zhou, Hangzhou, CN;

Assignee:

ZHEJIANG LAB, Hangzhou, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/00 (2019.01); G06F 40/169 (2020.01); G06F 40/284 (2020.01); G06F 40/295 (2020.01); G06F 40/30 (2020.01); G06F 40/40 (2020.01);
U.S. Cl.
CPC ...
G06F 40/295 (2020.01); G06F 40/169 (2020.01); G06F 40/284 (2020.01); G06F 40/30 (2020.01); G06F 40/40 (2020.01);
Abstract

Disclosed is a method and an apparatus NER-orientated Chinese clinical text data augmentation, and unannotated data and annotated data of label linearization processing through data preprocessing. A concealed part is predicted based on retained information by using the unannotated data and concealing part of information in text, and meanwhile an entity word-level discrimination task is introduced for pre-training of a span-based language model; and a plurality of decoding mechanisms are introduced in a fine-tune stage, a relationship between a text vector and text data is obtained based on the pre-trained span-based language model, linearized data with entity labels is converted into the text vector, and text generation is performed through forward decoding and reverse decoding in a prediction stage of a text generation model to obtain enhanced data with annotation information.


Find Patent Forward Citations

Loading…