The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 29, 2023

Filed:

Mar. 20, 2020
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

MD Arafat Sultan, Croton-on-Hudson, NY (US);

Vittorio Castelli, Croton-on-Hudson, NY (US);

Shubham Chandel, Jersey City, NJ (US);

Ramon Astudillo, White Plains, NY (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/088 (2023.01); G06F 16/2452 (2019.01); G06F 40/30 (2020.01); G06N 3/042 (2023.01);
U.S. Cl.
CPC ...
G06N 3/088 (2013.01); G06F 16/24522 (2019.01); G06F 40/30 (2020.01); G06N 3/042 (2023.01);
Abstract

Embodiments relate to an artificial intelligence (AI) computer platform to incorporate synthetic data and ground truth data, and to promote diversity and accuracy in generating the synthetic data. Synthetic questions are generated by a question generator in response to semantically related ground truth passage and answer data. Each generated question is presented to an answer generator together with the semantically related ground truth passage. Each synthetic question is evaluated with respect to its diversity from previous synthetic questions generated for the same ground truth passage and answer data. Each synthetic question is also evaluated with respect to the accuracy of the answer generated by the answer generator. A reward function that captures both accuracy and diversity of each synthetic question is leveraged to selectively modify the question generator, with the selective modification(s) directed at increasing textual diversity and maintaining accuracy of the generated synthetic questions.


Find Patent Forward Citations

Loading…