The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jan. 21, 2025
Filed:
Jul. 14, 2022
Beijing Baidu Netcom Science Technology Co., Ltd., Beijing, CN;
Tongyang Liu, Beijing, CN;
Shu Wang, Beijing, CN;
Wanli Chang, Beijing, CN;
Wei Zheng, Beijing, CN;
Zhifan Feng, Beijing, CN;
Chunguang Chai, Beijing, CN;
Yong Zhu, Beijing, CN;
BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD., Beijing, CN;
Abstract
A method for generating a pre-trained language model, includes: obtaining sample files; obtaining typography structure information and text information of the sample files by parsing the sample files; obtaining a plurality of task models of a pre-trained language model; obtaining a trained pre-trained language model by jointly training the pre-trained language model and the plurality of task models according to the typography structure information and the text information; and generating a target pre-trained language model by fine-tuning the trained pre-trained language model according to the typography structure information and the text information.