The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 08, 2025

Filed:

Sep. 15, 2021
Applicant:

Fujifilm Corporation, Tokyo, JP;

Inventor:

Makoto Ozeki, Tokyo, JP;

Assignee:
Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06N 20/00 (2019.01); G06F 18/214 (2023.01); G06F 18/25 (2023.01); G06F 40/40 (2020.01); G06V 10/40 (2022.01); G06V 10/98 (2022.01); H04L 51/52 (2022.01);
U.S. Cl.
CPC ...
G06F 40/40 (2020.01); G06F 18/214 (2023.01); G06F 18/251 (2023.01); G06N 20/00 (2019.01); G06V 10/40 (2022.01); G06V 10/98 (2022.01); H04L 51/52 (2022.05);
Abstract

Provided are a learning method and device of a model used for processing of automatically generating a text from an image, a program, a learned model, and a text generation device. In a learning method according to an embodiment of the present disclosure, a learning model including a feature amount extraction unit that extracts an image feature amount from an image, a text generation unit that generates an estimated text from the image feature amount, and a statistic estimation unit that generates an estimated image statistic from the text is used, and a combination of a first image, first metadata, and a first text is used as learning data. A first error of a first estimated text estimated from a first image feature amount of the first image, a second error of a first estimated image statistic estimated from the first text, and a third error of a second estimated image statistic estimated from the first estimated text are calculated, and a parameter of the learning model is updated.


Find Patent Forward Citations

Loading…