The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 21, 2025

Filed:

Jul. 14, 2022
Applicant:

Beijing Baidu Netcom Science Technology Co., Ltd., Beijing, CN;

Inventors:

Tongyang Liu, Beijing, CN;

Shu Wang, Beijing, CN;

Wanli Chang, Beijing, CN;

Wei Zheng, Beijing, CN;

Zhifan Feng, Beijing, CN;

Chunguang Chai, Beijing, CN;

Yong Zhu, Beijing, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 40/211 (2020.01); G06F 40/109 (2020.01); G06F 40/30 (2020.01); G06N 3/08 (2023.01);
U.S. Cl.
CPC ...
G06F 40/211 (2020.01); G06F 40/109 (2020.01); G06F 40/30 (2020.01); G06N 3/08 (2013.01);
Abstract

A method for generating a pre-trained language model, includes: obtaining sample files; obtaining typography structure information and text information of the sample files by parsing the sample files; obtaining a plurality of task models of a pre-trained language model; obtaining a trained pre-trained language model by jointly training the pre-trained language model and the plurality of task models according to the typography structure information and the text information; and generating a target pre-trained language model by fine-tuning the trained pre-trained language model according to the typography structure information and the text information.


Find Patent Forward Citations

Loading…